Numerically Stable Binary Gradient Coding
Neophytos Charalambides,Hessam Mahdavifar,Alfred O. Hero +2 more
- 01 Jun 2020
- pp 2622-2627
19
TL;DR: In this paper, the authors present a binary gradient coding scheme for distributed computation of the gradient of an objective function in a distributed network, which avoids operations over real or complex numbers and is inherently numerically unstable.
read more
Abstract: A major hurdle in machine learning is scalability to massive datasets. One approach to overcoming this is to distribute the computational tasks among several workers. Gradient coding has been recently proposed in distributed optimization to compute the gradient of an objective function using multiple, possibly unreliable, worker nodes. By designing distributed coded schemes, gradient coded computations can be made resilient to stragglers, nodes with longer response time compared to other nodes in a distributed network. Most such schemes rely on operations over the real or complex numbers and are inherently numerically unstable. We present a binary scheme which avoids such operations, thereby enabling numerically stable distributed computation of the gradient. Also, some restricting assumptions in prior work are dropped, and a more efficient decoding is given.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Analog Lagrange Coded Computing
Mahdi Soleymani,Hessam Mahdavifar,A. Salman Avestimehr +2 more
- 02 Feb 2021
TL;DR: In this paper, the authors proposed a novel extension of LCC to the analog domain, referred to as analog LCC (ALCC), where all the operations in the proposed ALCC protocol are done over the infinite fields of ${ √ R}/ { √ C}$ but for practical implementations floating-point numbers are used.
48
Weighted Gradient Coding with Leverage Score Sampling
Neophytos Charalambides,Mert Pilanci,Alfred O. Hero +2 more
- 01 May 2020
TL;DR: A novel weighted leverage score approach is presented, that achieves improved performance for distributed gradient coding by utilizing an importance sampling and provides a compressed approximation of a data matrix using an importance weighted subset.
15
List-Decodable Coded Computing: Breaking the Adversarial Toleration Barrier
Mahdi Soleymani,Ramy E. Ali,Hessam Mahdavifar,A. Salman Avestimehr +3 more
- 06 Aug 2021
TL;DR: In this article, the authors leverage list-decoding techniques for folded Reed-Solomon codes and propose novel algorithms to recover the correct codeword using side information, where the master node can perform carefully designed extra computations to obtain the side information.
14
Approximate Weighted C R Coded Matrix Multiplication
Neophytos Charalambides,Mert Pilanci,Alfred O. Hero +2 more
- 06 Jun 2021
TL;DR: In this paper, a weighted coded matrix multiplication method was proposed to solve the problem of matrix multiplication with high dimensionality, which can occur for large data size or feature dimension. But, it presents a major computational bottleneck when the matrix dimension is high, and two different approaches to overcome this bottleneck are: 1) low rank approximation of the matrix product; and 2) distributed computation.
13
ϵ -Approximate Coded Matrix Multiplication Is Nearly Twice as Efficient as Exact Multiplication
Haewon Jeong,Ateet Devulapalli,Viveck R. Cadambe,Flavio P. Calmon +3 more
- 09 Aug 2021
TL;DR: In this paper, it was shown that the matrix product can be recovered with a relatively small relative error from any node in a distributed matrix multiplication system, for any ϵ > 0.
11
References
Speeding Up Distributed Machine Learning Using Codes
TL;DR: In this paper, the authors provide theoretical insights on how coded solutions can achieve significant gains compared to uncoded ones for matrix multiplication and data shuffling in large-scale distributed systems.
923
Speeding Up Distributed Machine Learning Using Codes
TL;DR: In this paper, the authors provide theoretical insights on how coded solutions can achieve significant gains compared with uncoded ones for matrix multiplication and data shuffling in large-scale distributed systems.
798
•Proceedings Article
Gradient Coding: Avoiding Stragglers in Distributed Learning
Rashish Tandon,Qi Lei,Alexandros G. Dimakis,Nikos Karampatziakis +3 more
- 17 Jul 2017
TL;DR: This work proposes a novel coding theoretic framework for mitigating stragglers in distributed learning and shows how carefully replicating data blocks and coding across gradients can provide tolerance to failures andstragglers for synchronous Gradient Descent.
High-dimensional coded matrix multiplication
Kangwook Lee,Changho Suh,Kannan Ramchandran +2 more
- 01 Jun 2017
TL;DR: This work studies coded computation involving large matrix multiplication where both matrices are large, and proposes a new coded computation scheme, which is called product-coded matrix multiplication, which reveals interesting insights into which schemes perform best in which regimes.
330
•Proceedings Article
Short-Dot: Computing Large Linear Transforms Distributedly Using Coded Short Dot Products
Sanghamitra Dutta,Viveck R. Cadambe,Pulkit Grover +2 more
- 01 Jan 2016
TL;DR: In this article, the authors propose a technique called Short-Dot to reduce the number of redundant computations in a coding theory-inspired fashion for computing linear transforms of long vectors.
283