CreatiParser: Generative Image Parsing of Raster Graphic Designs into Editable Layers
arXiv cs.CV / 4/22/2026
📰 NewsModels & Research
Key Points
- CreatiParser presents a hybrid generative approach to convert raster graphic design images into editable layers, specifically separating text, background, and sticker components.
- The method parses text regions into a text rendering protocol using a vision-language model for more faithful reconstruction and easier downstream re-editing.
- Background and sticker layers are generated with a multi-branch diffusion architecture that supports RGBA outputs, aiming to improve controllability versus prior multi-stage pipelines.
- To better match human aesthetic preferences, the work introduces ParserReward and trains the system using Group Relative Policy Optimization, reporting an overall 23.7% average improvement across metrics on Parser-40K and Crello datasets.
- Experiments on two challenging datasets show the proposed framework outperforms existing graphic design parsing methods, addressing error accumulation and limited edit control.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.
Related Articles
Autoencoders and Representation Learning in Vision
Dev.to
Google Stitch 2.0: Senior-Level UI in Seconds, But Editing Still Breaks
Dev.to
Context Bloat in AI Agents
Dev.to

We open sourced the AI dev team that builds our product
Dev.to

Intel LLM-Scaler vllm-0.14.0-b8.2 released with official Arc Pro B70 support
Reddit r/artificial