FecalFed: Privacy-Preserving Poultry Disease Detection via Federated Learning

arXiv cs.CV / 4/2/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces FecalFed, a privacy-preserving federated learning framework to classify poultry diseases from fecal imaging without centralizing farm data.
  • It curates and releases the rigorously deduplicated dataset poultry-fecal-fl (8,770 unique images across four disease classes), reporting a 46.89% duplication rate removal from popular public repositories.
  • Experiments under highly heterogeneous, non-IID farm conditions (Dirichlet α=0.5) show that single-farm training collapses (64.86% accuracy) but federated learning recovers strong performance.
  • Using FedAdam with a Swin-Small model yields 90.31% accuracy, close to a centralized upper bound of 95.10%, while an edge-optimized Swin-Tiny achieves 89.74% for on-farm efficiency.
  • The authors position the approach as a scalable, privacy-first blueprint for real-world avian disease monitoring and food security.

Abstract

Early detection of highly pathogenic avian influenza (HPAI) and endemic poultry diseases is critical for global food security. While computer vision models excel at classifying diseases from fecal imaging, deploying these systems at scale is bottlenecked by farm data privacy concerns and institutional data silos. Furthermore, existing open-source agricultural datasets frequently suffer from severe, undocumented data contamination. In this paper, we introduce \textbf{FecalFed}, a privacy-preserving federated learning framework for poultry disease classification. We first curate and release \texttt{poultry-fecal-fl}, a rigorously deduplicated dataset of 8,770 unique images across four disease classes, revealing and eliminating a 46.89\% duplication rate in popular public repositories. To simulate realistic agricultural environments, we evaluate FecalFed under highly heterogeneous, non-IID conditions (Dirichlet \alpha=0.5). While isolated single-farm training collapses under this data heterogeneity, yielding only 64.86\% accuracy, our federated approach recovers performance without centralizing sensitive data. Specifically, utilizing server-side adaptive optimization (FedAdam) with a Swin-Small architecture achieves 90.31\% accuracy, closely approaching the centralized upper bound of 95.10\%. Furthermore, we demonstrate that an edge-optimized Swin-Tiny model maintains highly competitive performance at 89.74\%, establishing a highly efficient, privacy-first blueprint for on-farm avian disease monitoring.