3DrawAgent: Teaching LLM to Draw in 3D with Early Contrastive Experience
arXiv cs.CV / 4/10/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces 3DrawAgent, a training-free framework that uses LLMs to sequentially generate 3D sketches represented as Bezier curves from natural-language prompts.
- Instead of using explicit ground-truth supervision, it applies a relative experience optimization approach using pairwise comparisons where one sketch is judged better than another via CLIP-based perceptual rewards plus LLM-based fine-grained qualitative assessment.
- The method adapts the Group Reward Policy Optimization (GRPO) paradigm to improve 3D “spatial awareness” through geometric feedback, enabling black-box reinforcement without updating model parameters.
- Experiments report that 3DrawAgent can produce complex, coherent 3D Bezier sketches, show emergent geometric reasoning, and generalize to novel shapes.
- Overall, the work claims a new paradigm for advancing training-free 3D sketch intelligence by leveraging early contrastive/relative experience signals to guide LLM-driven 3D generation.



