Journal Article10.1137/S0895479897317685
An Asynchronous Parallel Supernodal Algorithm for Sparse Gaussian Elimination
283
TL;DR: An efficient parallel algorithm that overcomes the difficulty of implementing Gaussian elimination with partial pivoting on parallel machines, using a graph reduction technique and a supernode-panel computational kernel for high single processor utilization and scheduling two types of parallel tasks for a high level of concurrency.
read more
Abstract: Although Gaussian elimination with partial pivoting is a robust algorithm to solve unsymmetric sparse linear systems of equations, it is difficult to implement efficiently on parallel machines because of its dynamic and somewhat unpredictable way of generating work and intermediate results at run time. In this paper, we present an efficient parallel algorithm that overcomes this difficulty. The high performance of our algorithm is achieved through (1) using a graph reduction technique and a supernode-panel computational kernel for high single processor utilization, and (2) scheduling two types of parallel tasks for a high level of concurrency. One such task is factoring the independent panels in the disjoint subtrees of the column elimination tree of $A$. Another task is updating a panel by previously computed supernodes. A scheduler assigns tasks to free processors dynamically and facilitates the smooth transition between the two types of parallel tasks. No global synchronization is used in the algorithm. The algorithm is well suited for shared memory machines (SMP) with a modest number of processors. We demonstrate 4- to 7-fold speedups on a range of 8 processor SMPs, and more on larger SMPs. One realistic problem arising from a 3-D flow calculation achieves factorization rates of 1.0, 2.5, 0.8, and 0.8 gigaflops on the 12 processor Power Challenge, 8 processor Cray C90, 16 processor Cray J90, and 8 processor AlphaServer 8400.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
The university of Florida sparse matrix collection
Timothy A. Davis,Yifan Hu +1 more
TL;DR: The University of Florida Sparse Matrix Collection, a large and actively growing set of sparse matrices that arise in real applications, is described and a new multilevel coarsening scheme is proposed to facilitate this task.
4.3K
SUNDIALS: Suite of nonlinear and differential/algebraic equation solvers
Alan C. Hindmarsh,Peter Brown,K. E. Grant,Steven L. Lee,Radu Serban,Dan E. Shumaker,Carol S. Woodward +6 more
TL;DR: The current capabilities of the codes, along with some of the algorithms and heuristics used to achieve efficiency and robustness, are described.
•Posted Content
Enabling New Flexibility in the SUNDIALS Suite of Nonlinear and Differential/Algebraic Equation Solvers.
TL;DR: The SUNDIALS suite of nonlinear and DIfferential/ALgebraic equation solvers (SUNDIALs) as mentioned in this paper has been redesigned to better enable the use of application-specific and third-party algebraic solvers and data structures.
1.9K
Solving unsymmetric sparse systems of linear equations with PARDISO
Olaf Schenk,Klaus Gärtner +1 more
TL;DR: Experiments demonstrate that a wide set of unsymmetric linear systems can be solved and high performance is consistently achieved for large sparse unsympetric matrices from real world applications.
1.5K
A Supernodal Approach to Sparse Partial Pivoting
TL;DR: A sparse LU code is developed that is significantly faster than earlier partial pivoting codes and compared with UMFPACK, which uses a multifrontal approach; the code is very competitive in time and storage requirements, especially for large problems.
References
•Journal Article
WHO Technical Report.
TL;DR: The Feather River Coordinated Resource Management Group (FR-CRM) has been restoring channel/ meadow/ floodplain systems in the Feather River watershed since 1985 and recognized the possibility of a significant change in carbon stocks in these restored meadows and valleys.
3.1K
A set of level 3 basic linear algebra subprograms
TL;DR: This paper describes an extension to the set of Basic Linear Algebra Subprograms targeted at matrix-vector operations that should provide for efficient and portable implementations of algorithms for high-performance computers.
A Supernodal Approach to Sparse Partial Pivoting
TL;DR: A sparse LU code is developed that is significantly faster than earlier partial pivoting codes and compared with UMFPACK, which uses a multifrontal approach; the code is very competitive in time and storage requirements, especially for large problems.
An extended set of Fortran Basic Linear Algebra Subprograms: model implementation and test programs
Jack Dongarra,J. Du Croz,Sven Hammarling,Richard J. Hanson +3 more
- 01 Jan 1987
TL;DR: In this article, a model implementation and test software for Level 2 Basic Linear Algebra Subprograms (Level 2 BLAS) is described, targeted at matrix-vector operations with the aim of providing more efficient, but portable, implementations of algorithms on high-performance computers.
942
Sparse matrices in matlab: design and implementation
TL;DR: The matrix computation language and environment MATLAB is extended to include sparse matrix storage and operations, and nearly all the operations of MATLAB now apply equally to full or sparse matrices, without any explicit action by the user.