Book Chapter10.1007/978-3-662-48096-0_50
Iterative Sparse Triangular Solves for Preconditioning
Hartwig Anzt,Edmond Chow,Jack Dongarra +2 more
- 24 Aug 2015
- pp 650-661
TL;DR: This work proposes using an iterative approach for solving sparse triangular systems when an approximation is suitable, and demonstrates the performance gains that this approach can have on GPUs in the context of solving sparse linear systems with a preconditioned Krylov subspace method.
read more
Abstract: Sparse triangular solvers are typically parallelized using level-scheduling techniques, but parallel efficiency is poor on high-throughput architectures like GPUs. We propose using an iterative approach for solving sparse triangular systems when an approximation is suitable. This approach will not work for all problems, but can be successful for sparse triangular matrices arising from incomplete factorizations, where an approximate solution is acceptable. We demonstrate the performance gains that this approach can have on GPUs in the context of solving sparse linear systems with a preconditioned Krylov subspace method. We also illustrate the effect of using asynchronous iterations.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves
Weifeng Liu,Ang Li,JD Hogg,Iain S. Duff,Brian Vinter +4 more
- 24 Aug 2016
TL;DR: This paper proposes a novel approach for SpTRSV in which the ordering between components is naturally enforced within the solution stage, and is an order of magnitude faster for the preprocessing stage than existing methods.
97
•Posted Content
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing
Hartwig Anzt,Terry Cojean,Goran Flegar,Fritz Göbel,Thomas Grützmacher,Pratik Nayak,Tobias Ribizel,Yuhsiang Mike Tsai,Enrique S. Quintana-Ortí +8 more
TL;DR: This sophisticated software architecture that separates core algorithms from architecture-specific backends and provide details on extensibility and sustainability measures is introduced and Ginkgo’s usability is demonstrated by providing examples on how to use its functionality inside the MFEM and deal.
74
Fast synchronization‐free algorithms for parallel sparse triangular solves with multiple right‐hand sides
TL;DR: Novel approaches for SpTRSV and SpTRSM in which the ordering between components is naturally enforced within the solution stage are proposed, so the cost for preprocessing can be greatly reduced, and the synchronizations between sets are completely eliminated.
54
Using Jacobi iterations and blocking for solving sparse triangular systems in incomplete factorization preconditioning
Edmond Chow,Hartwig Anzt,Hartwig Anzt,Jennifer A. Scott,Jack Dongarra,Jack Dongarra,Jack Dongarra +6 more
TL;DR: It is shown that by using block Jacobi iterations, the range of problems for which such an approach can be effective is extended, and it is essential for the blocking to be cognizant of the matrix structure.
45
References
•Book
Iterative Methods for Sparse Linear Systems
Yousef Saad
- 01 Apr 2003
TL;DR: This chapter discusses methods related to the normal equations of linear algebra, and some of the techniques used in this chapter were derived from previous chapters of this book.
The university of Florida sparse matrix collection
Timothy A. Davis,Yifan Hu +1 more
TL;DR: The University of Florida Sparse Matrix Collection, a large and actively growing set of sparse matrices that arise in real applications, is described and a new multilevel coarsening scheme is proposed to facilitate this task.
4.3K
A flexible inner-outer preconditioned GMRES algorithm
TL;DR: A variant of the GMRES algorithm is presented that allows changes in the preconditioning at every step, resulting in a result of the flexibility of the new variant that any iterative method can be used as a preconditionser.
1.5K
The effect of ordering on preconditioned conjugate gradients
Iain S. Duff,Gérard Meurant +1 more
TL;DR: It is shown empirically that there can be a significant difference in the number of iterations required by the conjugate gradient method and reasons for this marked difference in performance are suggested.
372
Fine-Grained Parallel Incomplete LU Factorization
Edmond Chow,Aftab Patel +1 more
TL;DR: Numerical tests show that very few sweeps are needed to construct a factorization that is an effective preconditioner, and the amount of parallelism is large irrespective of the ordering of the matrix, and matrix ordering can be used to enhance the accuracy of the factorization rather than to increase parallelism.
213