HeadlinesBriefing favicon HeadlinesBriefing.com

Federated Learning Basics: Training Models Where Data Lives

Towards Data Science •
×

Federated learning (FL) enables machine learning models to be trained on distributed, privacy‑sensitive data without moving the raw information to a central server. The approach addresses the growing challenge that up to 97% of valuable data in sectors such as healthcare remains unused because it is fragmented across hospitals, devices, and edge systems. By sending the model to the data, FL allows each client—whether a smartphone, IoT sensor, or hospital—to perform local training and return only model updates.

This paradigm reduces privacy risk, complies with local regulations, and mitigates bandwidth constraints. Real‑world deployments include Google’s Gboard on Android, where predictive text features improve across hundreds of millions of devices, and Curial AI’s COVID‑screening system that trained across multiple NHS hospitals. Academic collaborations, such as those between UCL and Moorfields Eye Hospital, demonstrate FL’s potential for fine‑tuning vision foundation models on sensitive eye‑scan data.

The article also outlines FL variants—cross‑device versus cross‑silo, horizontal versus vertical—highlighting how different data partitioning strategies suit distinct industry needs. Understanding these fundamentals is essential for organizations seeking to unlock siloed data while preserving confidentiality.