AI Navigate

RASLF: Representation-Aware State Space Model for Light Field Super-Resolution

arXiv cs.CV / 3/18/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • RASLF introduces a representation-aware state-space framework for light field super-resolution that leverages cross-representation correlations to improve texture fidelity and view consistency.
  • It features a Progressive Geometric Refinement (PGR) block that uses a panoramic epipolar representation to explicitly encode multi-view parallax differences and enable integration across different LF representations.
  • It proposes a Representation Aware Asymmetric Scanning (RAAS) mechanism that dynamically adjusts scanning paths based on the properties of each representation space and uses path pruning to balance performance and efficiency.
  • It includes a Dual-Anchor Aggregation (DAA) module to improve hierarchical feature flow and reduce redundant deep features, prioritizing key reconstruction information, with experiments showing state-of-the-art accuracy and efficiency.

Abstract

Current SSM-based light field super-resolution (LFSR) methods often fail to fully leverage the complementarity among various LF representations, leading to the loss of fine textures and geometric misalignments across views. To address these issues, we propose RASLF, a representation-aware state-space framework that explicitly models structural correlations across multiple LF representations. Specifically, a Progressive Geometric Refinement (PGR) block is created that uses a panoramic epipolar representation to explicitly encode multi-view parallax differences, thereby enabling integration across different LF representations. Furthermore, we introduce a Representation Aware Asymmetric Scanning (RAAS) mechanism that dynamically adjusts scanning paths based on the physical properties of different representation spaces, optimizing the balance between performance and efficiency through path pruning. Additionally, a Dual-Anchor Aggregation (DAA) module improves hierarchical feature flow, reducing redundant deeplayer features and prioritizing important reconstruction information. Experiments on various public benchmarks show that RASLF achieves the highest reconstruction accuracy while remaining highly computationally efficient.