Selective Depthwise Separable Convolution for Lightweight Joint Source-Channel Coding in Wireless Image Transmission

arXiv cs.CV / 4/27/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies how and where to replace standard convolution layers with depthwise separable convolution (DSConv) layers in deep learning-based joint source-channel coding (JSCC) for wireless image transmission.
  • It proposes a configurable lightweight JSCC framework that lets researchers selectively swap Conv→DSConv at chosen layer positions and replacement ratios, trading off compression against reconstruction quality.
  • The authors find that replacing Conv layers at intermediate encoder/decoder depths provides a better complexity–performance trade-off, indicating redundancy at certain layers in DL-based JSCC.
  • Experimental results show substantial parameter reduction while incurring only slight performance degradation, supporting flexible compute/quality balancing for resource-constrained edge devices.
  • The work extends beyond ratio-only analysis by examining both layer-wise replacement strategies and depth-dependent effects under fixed replacement proportions.

Abstract

Depthwise separable convolutional (DSConv) layers have been successfully applied to deep learning (DL)-based joint source-channel coding (JSCC) schemes to reduce computational complexity. However, a systematic investigation of the layerwise and ratio-wise replacement of standard convolutional (Conv) layers with DSConv layers in JSCC systems for wireless image transmission remains largely unexplored. In this letter, we propose a configurable lightweight JSCC framework that incorporates a selective replacement strategy, enabling flexible substitution of standard Conv layers with DSConv layers at various layer positions and replacement ratios. By adjusting the proportion of layers replaced, we achieve different model compression levels and analyze their impact on reconstruction performance. Furthermore, we investigate how replacements at different encoder and decoder depths influence reconstruction quality under a fixed replacement ratio. Our results show that Conv-to-DSConv replacement at intermediate layers achieves a favorable complexity-performance trade-off, revealing layer-wise redundancy in DL-based JSCC systems. Extensive experiments further demonstrate that the proposed framework achieves substantial parameter reduction with only slight performance degradation, enabling flexible complexity-performance trade-offs for resource-constrained edge devices.