SIC3D: Style Image Conditioned Text-to-3D Gaussian Splatting Generation
arXiv cs.CV / 4/13/2026
💬 OpinionSignals & Early TrendsModels & Research
Key Points
- SIC3D is a two-stage, image-conditioned text-to-3D generation pipeline that combines 2D diffusion-style guidance with 3D Gaussian Splatting to produce controllable 3D objects from text and a reference image.
- The first stage generates 3D content from text using a text-to-3DGS model, aiming to improve geometry synthesis derived from natural-language input.
- The second stylization stage transfers style from a reference image to the 3DGS representation using a novel Variational Stylized Score Distillation (VSSD) loss that targets both global and local texture patterns.
- SIC3D includes scaling regularization to reduce artifacts and better preserve the intended style patterns during the geometry-appearance alignment process.
- The authors report that SIC3D improves geometric fidelity and style adherence, achieving stronger qualitative and quantitative performance than prior text-to-3D approaches.
Related Articles

Black Hat Asia
AI Business

Apple is building smart glasses without a display to serve as an AI wearable
THE DECODER

Why Fashion Trend Prediction Isn’t Enough Without Generative AI
Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

วิธีใช้ AI ทำ SEO ให้เว็บติดอันดับ Google (2026)
Dev.to