HAD: Hallucination-Aware Diffusion Priors for 3D Reconstruction

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026

Xi Liu^1,2,* Weiwei Sun^1,† Zhou Ren¹ Chris Broaddus¹ Siyu Huang² Laurent Guigues¹

¹Amazon AWS ²Clemson University

^†Project lead ^*Work done at Amazon

Method Overview

We train 3DGS with input images and HAD-augmented novel views. HAD combines a pretrained diffusion prior (which generates images from 3DGS-rendered views conditioned on reference input images) with our hallucination score network (which predicts pixel-wise reliability maps). Our multi-sampling strategy fuses multiple generated versions into refined augmented views. Hallucination scores guide 3DGS optimization by masking off unreliable content improving reconstruction quality in data-sparse scenarios.

Hallucination Detection

Our hallucination scoring network can recognize artifacts introduced by diverse generative priors, including image diffusion, video diffusion, and multi-view diffusion models.

Case 2

GT

Difix3D+

Hallucination Score

Case 1

GT

Difix3D+

Hallucination Score

Case 1

GT

GenFusion

Hallucination Score

Case 2

GT

GenFusion

Hallucination Score

Case 1

GT

SVC

Hallucination Score

Case 2

GT

SVC

Hallucination Score

Difix3D+ denotes a post-rendering diffusion pipeline, where 3DGS first renders a novel view and an image diffusion prior then refines the rendered image. The resulting hallucinations are not limited to extra objects or texture details that do not exist in the scene; they can also appear as geometric distortions, such as warped structures, inconsistent boundaries, or shape changes that break multi-view consistency.