Hierarchical Spatio-Channel Clustering for Efficient Model Compression in Medical Image Analysis

arXiv cs.CV / 4/28/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper introduces a hierarchical spatio-channel low-rank compression framework for CNNs that targets both spatial redundancy and localized channel co-activation within convolutional feature maps.
  • Instead of applying a single uniform decomposition to an entire layer, it partitions feature maps into spatial regions, clusters channels by co-activation patterns per region, and performs rank-adaptive SVD for each spatio-channel cluster.
  • Experiments on an AlexNet-based brain tumor MRI classification model show that the proposed method outperforms Global SVD and Tucker decomposition under 3× and 6× compression budgets.
  • Under the reported evaluation, it reduces FLOPs from 8.21G to 1.55G (an 81.1% reduction), yields a 1.38× inference speed-up, and improves classification accuracy from 87.76% to 89.80%.
  • The authors provide analysis of compression–accuracy trade-offs with Pareto-optimal configurations and report bootstrap standard errors across classification metrics, including better performance on difficult classes like meningioma.

Abstract

Convolutional neural networks (CNNs) have become increasingly difficult to deploy in resource-constrained environments due to their large memory and computational requirements. Although low-rank compression methods can reduce this burden, most existing approaches compress spatial and channel redundancy independently and therefore do not fully exploit the localised structure within convolutional feature maps. This paper proposes a hierarchical spatio-channel low-rank compression framework for CNNs that exploits redundancy across spatial regions and channel activations. Unlike conventional methods, which apply a uniform decomposition across an entire layer, the proposed approach first partitions feature maps into spatial regions, then groups channels according to their co-activation patterns within each region, and finally applies rank-adaptive SVD to each resulting spatio-channel cluster. The method is evaluated on an AlexNet-based brain tumour MRI classification model and compared with Global SVD and Tucker decomposition under \(3\times\) and \(6\times\) compression budgets. Our method outperforms both baselines, reducing FLOPs from \(8.21\,\mathrm{G}\) to \(1.55\,\mathrm{G}\) (\(81.1\%\) reduction), achieving a \(1.38\times\) inference speed-up, and increasing classification accuracy from \(87.76\%\) to \(89.80\%\). The method also improves the macro \(F_1\)-score and performance on challenging classes such as meningioma. A hyper-parameter trade-off analysis demonstrates that the framework provides Pareto-optimal configurations, enabling control over the balance between compression and predictive performance. Moderate clustering with adaptive rank selection yields strong results. Bootstrap standard errors are reported for all classification metrics.