UniSem: Generalizable Semantic 3D Reconstruction from Sparse Unposed Images
arXiv cs.CV / 3/19/2026
📰 NewsModels & Research
Key Points
- UniSem introduces a unified framework for semantic-aware 3D reconstruction from sparse, unposed images, addressing instability and incomplete 3D semantics in prior 3D Gaussian Splatting methods.
- It adds Error-aware Gaussian Dropout (EGD) to suppress redundant Gaussian primitives based on rendering error cues, yielding more stable geometry and improved depth estimation.
- It also proposes Mix-training Curriculum (MTC) to blend 2D segmenter-lifted semantics with emergent 3D semantic priors through object-level prototype alignment, boosting semantic coherence.
- Experiments on ScanNet and Replica show strong depth and open-vocabulary 3D segmentation gains, including a 15.2% reduction in depth error and a 3.7% gain in mAcc with 16 views.




![[Boost]](/_next/image?url=https%3A%2F%2Fmedia2.dev.to%2Fdynamic%2Fimage%2Fwidth%3D800%252Cheight%3D%252Cfit%3Dscale-down%252Cgravity%3Dauto%252Cformat%3Dauto%2Fhttps%253A%252F%252Fdev-to-uploads.s3.amazonaws.com%252Fuploads%252Fuser%252Fprofile_image%252F3833034%252F44fa15e0-8eb9-4843-a424-a4a7b3538f43.jpeg&w=3840&q=75)