HyperFM: An Efficient Hyperspectral Foundation Model with Spectral Grouping

arXiv cs.CV / 4/24/2026

📰 NewsSignals & Early TrendsIndustry & Market MovesModels & Research

Key Points

  • HyperFM is introduced as a parameter-efficient hyperspectral foundation model aimed at interpreting NASA PACE mission data, which contains hundreds of densely sampled wavelength bands.
  • The model uses intra-group and inter-group spectral attention plus hybrid parameter decomposition to better capture spectral-spatial relationships while lowering computational cost.
  • Experiments show HyperFM improves performance consistently over prior hyperspectral foundation models and strong task-specific baselines on four benchmark atmospheric cloud property retrieval tasks.
  • The authors also release HyperFM250K, a PACE-derived large-scale dataset covering both clear and cloudy scenes to support further research despite the labeling and scaling challenges of hyperspectral data.

Abstract

The NASA PACE mission provides unprecedented hyperspectral observations of ocean color, aerosols, and clouds, offering new insights into how these components interact and influence Earth's climate and air quality. Its Ocean Color Instrument measures light across hundreds of finely spaced wavelength bands, enabling detailed characterization of features such as phytoplankton composition, aerosol properties, and cloud microphysics. However, hyperspectral data of this scale is large, complex, and difficult to label, requiring specialized processing and analysis techniques. Existing foundation models, which have transformed computer vision and natural language processing, are generally trained on standard RGB imagery and therefore struggle to interpret the continuous spectral signatures captured by PACE. While recent advances have introduced hyperspectral foundation models, they are typically trained on cloud-free observations and often remain limited to single-sensor datasets due to spectral inconsistencies across instruments. Moreover, existing models tend to be parameter-heavy and computationally expensive, limiting scalability and adoption in operational settings. To address these challenges, we introduce HyperFM, a parameter-efficient hyperspectral foundation model that leverages intra-group and inter-group spectral attention along with hybrid parameter decomposition to better capture spectral spatial relationships while reducing computational cost. HyperFM demonstrates consistent performance improvements over existing hyperspectral foundation models and task-specific state-of-the-art methods across four benchmark downstream atmospheric cloud property retrieval tasks. To support further research, we additionally release HyperFM250K, a large-scale hyperspectral dataset from the PACE mission that includes both clear and cloudy scenes.