MatRes: Zero-Shot Test-Time Model Adaptation for Simultaneous Matching and Restoration

arXiv cs.CV / 4/14/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • MatRes is presented as a zero-shot test-time adaptation framework that jointly improves image restoration and geometric correspondence estimation when viewpoint changes and degradations coexist.
  • The method leverages a single low-quality/high-quality image pair at inference time, enforcing conditional similarity at corresponding locations to coordinate restoration and matching.
  • MatRes updates only lightweight modules while freezing all pretrained components, avoiding offline training and any additional supervision.
  • Experiments reported across many restoration–matching combinations show MatRes delivers significant gains in both image quality and geometric alignment versus using restoration-only or matching-only approaches.
  • The authors position MatRes as a practical solution for real-world multi-image capture settings where matching and restoration otherwise interfere if treated independently.

Abstract

Real-world image pairs often exhibit both severe degradations and large viewpoint changes, making image restoration and geometric matching mutually interfering tasks when treated independently. In this work, we propose MatRes, a zero-shot test-time adaptation framework that jointly improves restoration quality and correspondence estimation using only a single low-quality and high-quality image pair. By enforcing conditional similarity at corresponding locations, MatRes updates only lightweight modules while keeping all pretrained components frozen, requiring no offline training or additional supervision. Extensive experiments across diverse combinations show that MatRes yields significant gains in both restoration and geometric alignment compared to using either restoration or matching models alone. MatRes offers a practical and widely applicable solution for real-world scenarios where users commonly capture multiple images of a scene with varying viewpoints and quality, effectively addressing the often-overlooked mutual interference between matching and restoration.