Distributed Gradient Descent with Coded Partial Gradient Computations.

Open AccessPosted Content

Distributed Gradient Descent with Coded Partial Gradient Computations.

- 22 Nov 2018

28

TL;DR: A hybrid approach is introduced, called coded partial gradient computation (CPGC), that benefits from the advantages of both coded and uncoded computation schemes, and reduces both the computation time and decoding complexity.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/TSP.2020.2981904

Machine Learning at the Wireless Edge: Distributed Stochastic Gradient Descent Over-the-Air

Mohammad Mohammadi Amiri, +1 more

- 19 Mar 2020

- IEEE Transactions on Signal Processing

TL;DR: This work introduces a novel analog scheme, called A-DSGD, which exploits the additive nature of the wireless MAC for over-the-air gradient computation, and provides convergence analysis for this approach.

...read moreread less

752

•Journal Article•10.1109/JSAIT.2020.2991361

Stochastic Gradient Coding for Straggler Mitigation in Distributed Learning

Rawad Bitar, +2 more

- 29 Apr 2020

TL;DR: Stochastic Gradient Coding (SGC) as mentioned in this paper is an approximate gradient coding scheme for distributed gradient descent in the presence of straggglers, which works when the stragglers are random.

...read moreread less

112

•Proceedings Article•10.1109/ISIT.2019.8849684

Speeding Up Distributed Gradient Descent by Utilizing Non-persistent Stragglers

Emre Ozfatura, +2 more

- 07 Jul 2019

TL;DR: In this paper, the authors proposed a coded distributed gradient descent (DGD) technique which can trade-off the average computation time with the communication load, and showed that the average completion time per iteration can be reduced significantly at a reasonable increase in communication load.

...read moreread less

107

Journal Article•10.1109/COMST.2021.3091684

A Comprehensive Survey on Coded Distributed Computing: Fundamentals, Challenges, and Networking Applications

Jer Shyuan Ng, +7 more

- 23 Jun 2021

- IEEE Communications Surveys and Tutorial...

TL;DR: Coded distributed computing (CDC) as discussed by the authors is a combination of coding theoretic techniques and distributed computing, which has been recently proposed as a promising solution to reduce communication load and straggler effects.

...read moreread less

105

•Journal Article•10.1109/jstsp.2021.3137028

Distributed Few-Shot Learning for Intelligent Recognition of Communication Jamming

Mingqian Liu, +5 more

- 01 Apr 2022

- IEEE Journal of Selected Topics in Signa...

TL;DR: A novel jamming recognition method based on distributed few-shot learning that employs a distributed recognition architecture to achieve the global optimization of multiple sub-networks by federated learning and introduces a dense block structure in the sub-network structure to improve network information flow.

...read moreread less

70

...

Expand

References

•Journal Article•10.1137/16M1080173

Optimization Methods for Large-Scale Machine Learning

Léon Bottou, +2 more

- 08 May 2018

- Siam Review

TL;DR: The authors provides a review and commentary on the past, present, and future of numerical optimization algorithms in the context of machine learning applications and discusses how optimization problems arise in machine learning and what makes them challenging.

...read moreread less

3.7K

•Proceedings Article

Gradient Coding: Avoiding Stragglers in Distributed Learning

Rashish Tandon, +3 more

- 17 Jul 2017

TL;DR: This work proposes a novel coding theoretic framework for mitigating stragglers in distributed learning and shows how carefully replicating data blocks and coding across gradients can provide tolerance to failures andstragglers for synchronous Gradient Descent.

...read moreread less

583

•Posted Content

Near-Optimal Straggler Mitigation for Distributed Gradient Methods

Songze Li, +3 more

- 27 Oct 2017

- arXiv: Information Theory

TL;DR: This work proves that the proposed Batched Coupon's Collector (BCC) scheme is robust to a near optimal number of random stragglers, and reduces the run-time by up to 85.4% over Amazon EC2 clusters when compared with other straggler mitigation strategies.

...read moreread less

60

•Posted Content

Slow and Stale Gradients Can Win the Race

Sanghamitra Dutta, +2 more

- 23 Mar 2020

- arXiv: Machine Learning

TL;DR: This work presents a novel theoretical characterization of the speed-up offered by asynchronous SGD methods by analyzing the trade-off between the error in the trained model and the actual training runtime (wallclock time).

...read moreread less

•Posted Content

$C^{3}LES$: Codes for Coded Computation that Leverage Stragglers

Anindya Bijoy Das, +2 more

- 17 Sep 2018

- arXiv: Information Theory

TL;DR: A fine-grained model is proposed that quantifies the level of non-trivial coding needed to obtain the benefits of coding in matrix-vector computation and allows us to leverage partial computations performed by the straggler nodes.

...read moreread less

Distributed Gradient Descent with Coded Partial Gradient Computations.

Chat with Paper

AI Agents for this Paper

Citations

Machine Learning at the Wireless Edge: Distributed Stochastic Gradient Descent Over-the-Air

Stochastic Gradient Coding for Straggler Mitigation in Distributed Learning

Speeding Up Distributed Gradient Descent by Utilizing Non-persistent Stragglers

A Comprehensive Survey on Coded Distributed Computing: Fundamentals, Challenges, and Networking Applications

Distributed Few-Shot Learning for Intelligent Recognition of Communication Jamming

References

Optimization Methods for Large-Scale Machine Learning

Gradient Coding: Avoiding Stragglers in Distributed Learning

Near-Optimal Straggler Mitigation for Distributed Gradient Methods

Slow and Stale Gradients Can Win the Race

$C^{3}LES$: Codes for Coded Computation that Leverage Stragglers

Related Papers (5)

Gradient Coding with Clustering and Multi-Message Communication

Computation Efficient Coded Linear Transform

Age-Based Coded Computation for Bias Reduction in Distributed Learning

Numerically Stable Binary Gradient Coding

Cross-Iteration Coded Computing