AI Navigate

GAPSL: A Gradient-Aligned Parallel Split Learning on Heterogeneous Data

arXiv cs.LG / 3/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • GAPSL is a gradient-aligned extension of parallel split learning (PSL) designed to mitigate gradient directional inconsistency across heterogeneous clients.
  • It introduces leader gradient identification (LGI) to dynamically select a set of directionally consistent client gradients to construct a leader gradient that captures the global convergence trend.
  • It also introduces gradient direction alignment (GDA), a direction-aware regularization that aligns each client's gradient with the leader gradient to improve convergence.
  • The approach leverages PSL’s server-side computation to reduce client-side load and eliminate client-side model aggregation, potentially lowering deployment costs.
  • Experiments on a prototype testbed show GAPSL achieves higher training accuracy and lower latency than state-of-the-art PSL benchmarks, demonstrating improved convergence on heterogeneous data.

Abstract

The increasing complexity of neural networks poses significant challenges for democratizing FL on resource?constrained client devices. Parallel split learning (PSL) has emerged as a promising solution by offloading substantial computing workload to a server via model partitioning, shrinking client-side computing load, and eliminating the client-side model aggregation for reduced communication and deployment costs. Since PSL is aggregation-free, it suffers from severe training divergence stemming from gradient directional inconsistency across clients. To address this challenge, we propose GAPSL, a gradient-aligned PSL framework that comprises two key components: leader gradient identification (LGI) and gradient direction alignment (GDA). LGI dynamically selects a set of directionally consistent client gradients to construct a leader gradient that captures the global convergence trend. GDA employs a direction-aware regularization to align each client's gradient with the leader gradient, thereby mitigating inter-device gradient directional inconsistency and enhancing model convergence. We evaluate GAPSL on a prototype computing testbed. Extensive experiments demonstrate that GAPSL consistently outperforms state-of-the-art benchmarks in training accuracy and latency.