360{\deg} Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method
arXiv cs.CV / 3/18/2026
📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- 360Bench is introduced as a 7K-resolution 360-degree image VQA benchmark with seven tasks and human-annotated data to evaluate seven MLLMs and six enhancement methods.
- The benchmark reveals current MLLMs struggle with 360-degree perception due to geometric distortion and complex spatial relations.
- The authors propose Free360, a training-free, scene-graph-based framework that decomposes reasoning into modular steps and uses adaptive spherical transformations for 360-degree images to form a unified graph for answer generation.
- Experiments show Free360 consistently improves base MLLMs and provides a strong training-free solution for 360-degree VQA, with source code and dataset to be released upon acceptance.
- The work highlights a new research direction for 360-degree visual reasoning in MLLMs and establishes a public benchmark to drive future improvements.
Related Articles
We asked 200 ChatGPT users their biggest frustration. All top 5 answers are problems ChatGPT Toolbox solves.
Reddit r/artificial
I Built an AI That Reviews Every PR for Security Bugs — Here's How (2026)
Dev.to
[R] Combining Identity Anchors + Permission Hierarchies achieves 100% refusal in abliterated LLMs — system prompt only, no fine-tuning
Reddit r/MachineLearning
How I Built an AI SDR Agent That Finds Leads and Writes Personalized Cold Emails
Dev.to
Complete Guide: How To Make Money With Ai
Dev.to