RPBA-Net: An Interpretable Residual Pyramid Bilateral Affine Network for RAW-Domain ISP Enhancement

arXiv cs.CV / 5/6/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces RPBA-Net, an interpretable residual pyramid bilateral affine network aimed at improving RAW-domain ISP tasks such as demosaicing, color correction, and detail enhancement while reducing fragmented modules.
  • RPBA-Net takes packed RAW input and performs residual affine base reconstruction by estimating a base RGB representation and applying identity-guided residual affine corrections to unify demosaicing and enhancement.
  • It uses pyramid bilateral affine grids with guide-driven autoregressive adaptive slicing and adaptive cross-layer fusion to model global tone restoration and local texture enhancement in a hierarchical manner.
  • The method adds smoothness, cross-scale consistency, and magnitude regularization terms to enhance stability, controllability, and structural interpretability.
  • Experiments report that RPBA-Net outperforms representative RAW-to-SRGB approaches, achieving state-of-the-art reconstruction fidelity and perceptual quality with low complexity suitable for mobile and embedded deployment.

Abstract

To address module fragmentation, uninterpretable mappings, and deployment constraints in RAW-domain demosaicing, color correction, and detail enhancement, this paper proposes RPBA-Net, an interpretable residual pyramid bilateral affine network for RAW-domain ISP enhancement. Given packed RAW as input, the method performs residual affine base reconstruction by estimating a base RGB representation and learning identity-guided residual affine corrections, thereby unifying demosaicing and enhancement. It further builds pyramid bilateral affine grids and combines guide-driven autoregressive adaptive slicing with adaptive cross-layer fusion to hierarchically model global tone restoration and local texture enhancement. In addition, smoothness, cross-scale consistency, and magnitude regularization terms are introduced to improve model stability, controllability, and structural interpretability. Extensive experiments demonstrate that RPBA-Net surpasses representative RAW-to-sRGB methods and achieves state-of-the-art performance in reconstruction fidelity and perceptual quality, while maintaining low model complexity and strong deployment potential for mobile and embedded platforms.