
What is Federated Learning?
Federated learning is a machine learning approach where a model is trained across multiple decentralized devices or servers, each holding local data, without that data ever leaving the device. Instead of centralizing data, only model updates (gradients) are shared and aggregated.
Why It Matters
Federated learning solves the fundamental tension between AI training (which needs lots of data) and privacy (which forbids centralizing sensitive data). It enables AI on medical records, financial data, and personal devices while complying with GDPR and other regulations. Apple uses it for Siri and predictive text, Google for Gboard β your phone's data never leaves your phone.
How It Works
- Initialization β a central server sends a global model to all participating devices/clients
- Local training β each device trains the model on its local data for several epochs
- Update sharing β devices send only model weight updates (gradients) to the server, not the raw data
- Aggregation β the server aggregates updates from all clients (typically using Federated Averaging / FedAvg)
- Distribution β the updated global model is sent back to all devices
- Repeat β the process continues for multiple rounds until convergence
Key challenges: