Hierarchical SVG Tokenization: Learning Compact Visual Programs for Scalable Vector Graphics Modeling
arXiv cs.LG / 4/8/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that autoregressive SVG generation models suffer from poor performance when they use generic byte-level tokenization borrowed from NLP, which fragments coordinates and breaks spatial relationships.
- It introduces HiVG, a hierarchical SVG tokenization scheme that builds compact “atomic tokens” and compresses valid command–parameter blocks into geometry-constrained “segment tokens.”
- To reduce spatial mismatch and coordinate hallucination, the authors propose a Hierarchical Mean–Noise (HMN) embedding initialization that injects numerical ordering signals and semantic priors.
- A curriculum training strategy that gradually increases program complexity is used to make the model learn executable SVG programs more stably.
- Experiments on text-to-SVG and image-to-SVG show improvements in generation fidelity, spatial consistency, and sequence efficiency versus conventional tokenization approaches.
Related Articles

Black Hat Asia
AI Business
[N] Just found out that Milla Jovovich is a dev, invested in AI, and just open sourced a project
Reddit r/MachineLearning

ALTK‑Evolve: On‑the‑Job Learning for AI Agents
Hugging Face Blog

Context Windows Are Getting Absurd — And That's a Good Thing
Dev.to

Every AI Agent Registry in 2026, Compared
Dev.to