Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models
arXiv cs.CV / 3/19/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- Omni IIE Bench is introduced to diagnose the editing consistency of image editing models across tasks with varying semantic scales for practical applications.
- The benchmark uses a dual-track diagnostic design: Single-turn Consistency with shared-context task pairs and Multi-turn Coordination involving continuous dialogue tasks across semantic scales.
- It is built through a rigorous multi-stage human filtering process, with quality validation by computer vision graduate students and industry relevance review by professional designers.
- The authors evaluate 8 mainstream IIE models and find a prevalent performance degradation when moving from low-semantic-scale to high-semantic-scale tasks.
- Omni IIE Bench provides diagnostic tools and insights intended to drive the development of next-generation, more reliable and stable IIE models.




