AI Navigate

OARS: Process-Aware Online Alignment for Generative Real-World Image Super-Resolution

arXiv cs.CV / 3/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • OARS is a process-aware online alignment framework for generative real-world image super-resolution that addresses the perception-fidelity trade-off under unknown degradations.
  • It uses COMPASS, an MLLM-based reward that jointly models fidelity preservation and perceptual gain with an input-quality-adaptive trade-off.
  • The authors curate COMPASS-20K spanning synthetic and real degradations and introduce a three-stage perceptual annotation pipeline yielding calibrated, fine-grained training labels.
  • OARS performs progressive online alignment, moving from cold-start flow matching to full-reference and finally reference-free RL via shallow LoRA optimization for on-policy exploration.
  • Experiments and user studies show consistent perceptual improvements while maintaining fidelity and achieving state-of-the-art performance on Real-ISR benchmarks.

Abstract

Aligning generative real-world image super-resolution models with human visual preference is challenging due to the perception--fidelity trade-off and diverse, unknown degradations. Prior approaches rely on offline preference optimization and static metric aggregation, which are often non-interpretable and prone to pseudo-diversity under strong conditioning. We propose OARS, a process-aware online alignment framework built on COMPASS, a MLLM-based reward that evaluates the LR to SR transition by jointly modeling fidelity preservation and perceptual gain with an input-quality-adaptive trade-off. To train COMPASS, we curate COMPASS-20K spanning synthetic and real degradations, and introduce a three-stage perceptual annotation pipeline that yields calibrated, fine-grained training labels. Guided by COMPASS, OARS performs progressive online alignment from cold-start flow matching to full-reference and finally reference-free RL via shallow LoRA optimization for on-policy exploration. Extensive experiments and user studies demonstrate consistent perceptual improvements while maintaining fidelity, achieving state-of-the-art performance on Real-ISR benchmarks.