An Efficient Framework for Clustered Federated Learning

doi:10.1109/tit.2022.3192506

Open AccessJournal Article10.1109/tit.2022.3192506

An Efficient Framework for Clustered Federated Learning

01 Dec 2022

- IEEE Transactions on Information Theory

- Vol. 68, Iss: 12, pp 8076-8091

220

TL;DR: The Iterative Federated Clustering Algorithm (IFCA) as discussed by the authors is proposed to estimate the cluster identities of the users and optimize model parameters for the user clusters via gradient descent.

Abstract: We address the problem of federated learning (FL) where users are distributed and partitioned into clusters. This setup captures settings where different groups of users have their own objectives (learning tasks) but by aggregating their data with others in the same cluster (same learning task), they can leverage the strength in numbers in order to perform more efficient federated learning. For this new framework of clustered federated learning, we propose the Iterative Federated Clustering Algorithm (IFCA), which alternately estimates the cluster identities of the users and optimizes model parameters for the user clusters via gradient descent. We analyze the convergence rate of this algorithm first in a linear model with squared loss and then for generic strongly convex and smooth loss functions. We show that in both settings, with good initialization, IFCA is guaranteed to converge, and discuss the optimality of the statistical error rate. In particular, for the linear model with two clusters, we can guarantee that our algorithm converges as long as the initialization is slightly better than random. When the clustering structure is ambiguous, we propose to train the models by combining IFCA with the weight sharing technique in multi-task learning. In the experiments, we show that our algorithm can succeed even if we relax the requirements on initialization with random initialization and multiple restarts. We also present experimental results showing that our algorithm is efficient in non-convex problems such as neural networks. We demonstrate the benefits of IFCA over the baselines on several clustered FL benchmarks.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection

Qinbin Li, +7 more

- 23 Jul 2019

- arXiv: Learning

TL;DR: A comprehensive review of federated learning systems can be found in this paper, where the authors provide a thorough categorization of the existing systems according to six different aspects, including data distribution, machine learning model, privacy mechanism, communication architecture, scale of federation and motivation of federation.

...read moreread less

703

•Journal Article•10.1109/tkde.2021.3124599

A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection

01 Apr 2023

- IEEE Transactions on Knowledge and Data ...

TL;DR: A comprehensive review on federated learning systems can be found in this paper , where the authors provide a thorough categorization of federated Learning systems according to six different aspects, including data distribution, machine learning model, privacy mechanism, communication architecture, scale of federation and motivation of federation.

...read moreread less

587

•Journal Article•10.1016/J.NEUCOM.2021.07.098

Federated learning on non-IID data: A survey

Hangyu Zhu, +3 more

- 20 Nov 2021

- Neurocomputing

TL;DR: In this article, a detailed analysis of the influence of non-IID data on both parametric and non-parametric machine learning models in both horizontal and vertical federated learning is provided.

...read moreread less

504

•Journal Article•10.1109/JSAC.2021.3118346

Distributed Learning in Wireless Networks: Recent Progress and Future Challenges

Mingzhe Chen, +6 more

- 06 Oct 2021

- IEEE Journal on Selected Areas in Commun...

TL;DR: In this paper, the authors provide a comprehensive study of how distributed learning can be efficiently and effectively deployed over wireless edge networks, including federated learning, federated distillation, distributed inference, and multi-agent reinforcement learning.

...read moreread less

336

•Posted Content

Multi-Center Federated Learning

Ming Xie, +5 more

- 03 May 2020

- arXiv: Learning

TL;DR: This paper proposes a novel multi-center aggregation mechanism for federated learning, which learns multiple global models from the non-IID user data and simultaneously derives the optimal matching between users and centers.

...read moreread less

188

...

Expand

References

Journal Article•10.1109/5.726791

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

- 01 Jan 1998

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

53.5K

•Dissertation

Learning Multiple Layers of Features from Tiny Images

Alex Krizhevsky

- 01 Jan 2009

TL;DR: In this paper, the authors describe how to train a multi-layer generative model of natural images, using a dataset of millions of tiny colour images, described in the next section.

...read moreread less

23.7K

•Journal Article•10.1109/TIT.1982.1056489

Least squares quantization in PCM

S. P. Lloyd

- 01 Mar 1982

- IEEE Transactions on Information Theory

TL;DR: In this article, the authors derived necessary conditions for any finite number of quanta and associated quantization intervals of an optimum finite quantization scheme to achieve minimum average quantization noise power.

...read moreread less

16K

•Journal Article•10.1023/A:1007379606734

Multitask Learning

Rich Caruana

- 01 Jul 1997

TL;DR: Multi-task Learning (MTL) as mentioned in this paper is an approach to inductive transfer that improves generalization by using the domain information contained in the training signals of related tasks as an inductive bias.

...read moreread less

8K

Journal Article•10.1364/AO.21.002758

Phase retrieval algorithms: a comparison.

James R. Fienup

- 01 Aug 1982

- Applied Optics

TL;DR: Iterative algorithms for phase retrieval from intensity data are compared to gradient search methods and it is shown that both the error-reduction algorithm for the problem of a single intensity measurement and the Gerchberg-Saxton algorithm forThe problem of two intensity measurements converge.

...read moreread less

6K