Microsoft Presents "TRELLIS.2": An Open-Source, 4b-Parameter, Image-To-3D Model Producing Up To 1536³ PBR Textured Assets, Built On Native 3D VAES With 16× Spatial Compression, Delivering Efficient, Scalable, High-Fidelity Asset Generation.

Reddit r/LocalLLaMA / 4/28/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • Microsoft has introduced TRELLIS.2, an open-source 4B-parameter image-to-3D generative model aimed at producing high-fidelity 3D assets.
  • The model uses a native 3D VAE approach with 16× spatial compression and an O-Voxel “field-free” sparse voxel structure to reconstruct complex 3D geometry.
  • TRELLIS.2 is designed to generate arbitrary 3D assets with sharp details, complex topology, and complete PBR materials, supporting outputs up to 1536³ textured assets.
  • The announcement includes links to the research paper, the official GitHub code repository, and a Hugging Face live demo for hands-on evaluation.
Microsoft Presents "TRELLIS.2": An Open-Source, 4b-Parameter, Image-To-3D Model Producing Up To 1536³ PBR Textured Assets, Built On Native 3D VAES With 16× Spatial Compression, Delivering Efficient, Scalable, High-Fidelity Asset Generation.

TRELLIS.2 is a state-of-the-art large 3D generative model (4B parameters) designed for high-fidelity image-to-3D generation. It leverages a novel "field-free" sparse voxel structure termed O-Voxel to reconstruct and generate arbitrary 3D assets with complex topologies, sharp features, and full PBR materials.


Link to the Paper: https://arxiv.org/pdf/2512.14692

Link to the Code: https://github.com/microsoft/TRELLIS.2

Link to Try Out A Live Demo: https://huggingface.co/spaces/microsoft/TRELLIS.2
submitted by /u/44th--Hokage
[link] [comments]