MixerCA: An Efficient and Accurate Model for High-Performance Hyperspectral Image Classification

arXiv cs.CV / 4/30/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces MixerCA, a lightweight deep learning model designed specifically for hyperspectral image (HSI) classification by combining depthwise convolution with a self-attention mechanism.
  • MixerCA uses a unified architecture with depth-wise convolutions, token/channel mixing, and coordinate attention to separate spatial and spectral (channel) interactions while preserving resolution across the network.
  • The model directly processes HSI patches to better leverage the detailed continuous spectral information characteristic of hyperspectral data.
  • Experiments on four hyperspectral benchmark datasets show MixerCA achieves clear improvements over multiple baselines, including 2D/3D CNN variants and transformer-based approaches like ViT and Swin.
  • The authors provide publicly accessible source code via GitHub, enabling reproducibility and further experimentation.

Abstract

Over the past decade, hyperspectral image (HSI) classification has drawn considerable interest due to HSIs' ability to effectively distinguish terrestrial objects by capturing detailed, continuous spectral information. The strong performance of recent deep learning techniques in tasks like image classification and semantic segmentation has led to their growing use in HSI classification, due to their ability to capture complex spatial and spectral features more effectively than traditional methods. This paper presents MixerCA, a novel lightweight model for HSI classification that leverages depthwise convolution and a self-attention mechanism. MixerCA integrates depth-wise convolutions, token and channel mixing, and coordinate attention into a unified structure to decouple spatial and channel interactions, maintain consistent resolution throughout the network, and directly process HSI patches. Extensive experiments on four hyperspectral benchmark datasets reveal MixerCA's clear advantages over several competing algorithms, including 2D-CNN, 3D-CNN, Tri-CNN, HybridSN, ViT, and Swin Transformer. The source code is publicly available at https://github.com/mqalkhatib/MixerCA.