360{\deg} Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method
arXiv cs.CV / 3/18/2026
📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- 360Bench is introduced as a 7K-resolution 360-degree image VQA benchmark with seven tasks and human-annotated data to evaluate seven MLLMs and six enhancement methods.
- The benchmark reveals current MLLMs struggle with 360-degree perception due to geometric distortion and complex spatial relations.
- The authors propose Free360, a training-free, scene-graph-based framework that decomposes reasoning into modular steps and uses adaptive spherical transformations for 360-degree images to form a unified graph for answer generation.
- Experiments show Free360 consistently improves base MLLMs and provides a strong training-free solution for 360-degree VQA, with source code and dataset to be released upon acceptance.
- The work highlights a new research direction for 360-degree visual reasoning in MLLMs and establishes a public benchmark to drive future improvements.
Related Articles
ベテランの若手育成負担を減らせ、PLC制御の「ラダー図」をAIで生成
日経XTECH
Hey dev.to community – sharing my journey with Prompt Builder, Insta Posts, and practical SEO
Dev.to
Why Regex is Not Enough: Building a Deterministic "Sudo" Layer for AI Agents
Dev.to
Perplexity Hub
Dev.to
How to Build Passive Income with AI in 2026: A Developer's Practical Guide
Dev.to