One Model, Two Minds: Task-Conditioned Reasoning for Unified Image Quality and Aesthetic Assessment
arXiv cs.CV / 3/23/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The authors identify a fundamental mismatch in unifying IQA and IAA with a single reasoning strategy: IQA relies on concise, low-level perceptual cues, while IAA requires deliberative semantic judgment.
- They propose TATAR, a unified framework that shares a visual-language backbone but conditions post-training on each task’s nature.
- TATAR combines fast–slow task-specific reasoning construction, two-stage SFT+GRPO learning to establish task-aware priors before reward refinement, and asymmetric rewards that apply Gaussian score shaping for IQA and Thurstone-style completion ranking for IAA.
- Across in-domain and cross-domain settings, TATAR consistently outperforms prior unified baselines on both tasks, remains competitive with task-specific models, and yields more stable training dynamics for aesthetic assessment.
- The code for TATAR is publicly available at the authors' GitHub repository.
Related Articles
GDPR and AI Training Data: What You Need to Know Before Training on Personal Data
Dev.to
Edge-to-Cloud Swarm Coordination for heritage language revitalization programs with embodied agent feedback loops
Dev.to
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to
AI Crawler Management: The Definitive Guide to robots.txt for AI Bots
Dev.to
Data Sovereignty Rules and Enterprise AI
Dev.to