Generative Multiview Relighting for
3D Reconstruction under Extreme Illumination Variation

Hadi Alzayer^1,2 Philipp Henzler¹ Jon Barron¹ Jia-Bin Huang² Pratul P. Srinivasan¹ Dor Verbin¹

¹Google ²University of Maryland

CVPR 2025 (Highlight)

TLDR; Given a collection of images under extreme illumination variations, we make the lighting consistent and we get a consistent NeRF!

Explainer Video

Novel view synthesis

Results on Objaverse

Here we show our results on our synthetic dataset with rendering trajectories of the reconstructions under the reference illumination. Please see the full set of results here

Reference image Input images Ours NeRF-Casting + Appearance Embeddings NeROIC

Results on NAVI

Here we show our results on real scenes from the NAVI dataset.

Reference image Input frames Ours NeRF-Casting + Appearance Embeddings NeROIC

Method Overview

Given a collection of photos taken with varying illuminations, we select the image with the desired illumination as reference, then we use a multiview diffusion model to relight the images to match the reference. We then use a reflection-aware NeRF to reconstruct the object given the relit images. The shading embeddings allow us to model per-image normal variations, which we explain below.

In NeRF, reflections are modeled by feeding-in the view dependent effects into the appearance MLP. However, it is challenging to fit reflections perfectly. NeRF-Casting address this issue by explicitly modeling reflections off surfaces using the normals predicted by an MLP. When the input images have inconsistencies like in our relit images, the typical solution is to use appearance embeddings, but this results in diffuse objects with incorrectly "static" reflections, as we show below. To address this issue, we model the small remaining inconsistencies in our relit images as variations in the normal vectors, and we handle those by feeding into the normal prediction MLP an additional per-image learnable vector which we call shading embeddings.

Relighting results

Reference illumination Input frames Relighting output

Shading embeddings vs. appearance embeddings

We show that using prior work's appearance embeddings result in diffuse appearance. On the other hand, when using our shading embeddings, we can preserve reflections and specular highlights

Reference image Shading embeddings (ours) Standard appearance embeddings

Reconstructing different lightings

Since we have an input with varying illumination, we can choose any image as a reference and reconstruct the object under the reference illumination

Reference image Ours NeRF-Casting + Appearance Embeddings

Relighting variance

Compared to recent state of the art generative relighting, our relighting method is much more consistent when comparing different sampling outputs. While the baseline shows significant lighting variations, the variance in our outputs appears as small displacement of the specular highlights, further motivating our shading embeddings.

Our relighting IllumiNeRF relighting GT

Acknowledgements

We would like to thank Matthew Burruss and Xiaoming Zhao for their help with the rendering pipeline. We also thank Ben Poole, Alex Trevithick, Stan Szymanowicz, Rundi Wu, David Charatan, Jiapeng Tang, Matthew Levine, Ruiqi Gao, Ricardo Martin-Brualla, and Aleksander Hołyński for fruitful discussions.

Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation