Federated Transfer Learning with Differential Privacy

arXiv stat.ML / 4/7/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a federated transfer learning framework that tackles both cross-site data heterogeneity and privacy protection for local datasets.
  • It formalizes “federated differential privacy,” providing per-dataset privacy guarantees without relying on a trusted central server.
  • The authors analyze four core statistical tasks (mean estimation, low-/high-dimensional linear regression, and M-estimation) under this privacy model and derive minimax rates.
  • They quantify the trade-offs introduced by privacy and heterogeneity, showing that federated differential privacy sits between local and central differential privacy in terms of privacy strength.
  • The results characterize the fundamental costs of each factor while clarifying when and how knowledge transfer can improve learning in federated settings.

Abstract

Federated learning has emerged as a powerful framework for analysing distributed data, yet two challenges remain pivotal: heterogeneity across sites and privacy of local data. In this paper, we address both challenges within a federated transfer learning framework, aiming to enhance learning on a target data set by leveraging information from multiple heterogeneous source data sets while adhering to privacy constraints. We rigorously formulate the notion of federated differential privacy, which offers privacy guarantees for each data set without assuming a trusted central server. Under this privacy model, we study four statistical problems: univariate mean estimation, low-dimensional linear regression, high-dimensional linear regression, and M-estimation. By investigating the minimax rates and quantifying the cost of privacy, we show that federated differential privacy is an intermediate privacy model between the well-established local and central models of differential privacy. Our analyses account for data heterogeneity and privacy, highlighting the fundamental costs associated with each factor and the benefits of knowledge transfer in federated learning.