Proofpoint Inc, USA.
Article DOI: 10.30574/wjaets.2025.15.2.0523
Received on 22 March 2025; revised on 03 May 2025; accepted on 05 May 2025
Federated learning enables machine learning across distributed devices without centralizing sensitive data, preserving privacy while creating intelligent systems from collective knowledge. Data heterogeneity, the natural variation in information across participating devices, presents significant challenges including convergence instability, model bias, communication inefficiency, privacy-utility tradeoffs, and computational imbalance. Despite these obstacles, heterogeneity offers advantages like improved model generalization, personalization opportunities, greater real-world applicability, enhanced privacy protection, and better fault tolerance when properly managed. Current solutions address these challenges through personalized federated learning, robust aggregation methods, federated distillation, client clustering, and adaptive participation strategies, while future directions focus on developing advanced heterogeneity metrics, cross-organizational techniques, dynamic adaptation mechanisms, hardware-aware algorithms, theoretical foundations, and standardized benchmarks to further enhance performance in diverse data environments
Adaptation; Decentralization; Heterogeneity; Personalization; Privacy
Preview Article PDF
Kuldeep Deshwal. Understanding data heterogeneity in federated learning. World Journal of Advanced Engineering Technology and Sciences, 2025, 15(02), 530-540. Article DOI: https://doi.org/10.30574/wjaets.2025.15.2.0523.