About: Tridiagonal matrix algorithm is a research topic. Over the lifetime, 1070 publications have been published within this topic receiving 21084 citations.
TL;DR: A divide-and-conquer algorithm for the resolution of distributed linear tridiagonal systems of equations is implemented within a binary tree connection architecture, where computation time decreases linearly with the number of nodes, and the data communication required becomes proportional to the logarithm of thenumber of nodes.
Abstract: A divide-and-conquer algorithm for the resolution of distributed linear tridiagonal systems of equations is implemented within a binary tree connection architecture. A new scheme for the distribution of the data among the computing nodes allows a dilation-one implementation of a recursive substitution scheme for the solution of the global system. In this way, computation time decreases linearly with the number of nodes, and the data communication required becomes proportional to the logarithm of the number of nodes. This takes place within a network with a fixed connectivity degree of three.
TL;DR: A new shift in the QL algorithm for symmetric tridiagonal matrices is described, a combination of the Rayleigh quotient shift and Wilkinson's shift.
TL;DR: In this article, an accurate integral-based scheme for solving the advection-diffusion equation is proposed, which can be extended straightforwardly from one-dimensional to multi-dimensional problems without much difficulty and complication.
TL;DR: A polyalgorithm can be derived which takes into account much better the given architecture of the IBM 3090 VF than the ‘pure’ cyclic reduction algorithm, as described for instance by Kershaw.
Abstract: Cyclic reduction, originally proposed by Hockney and Golub, is the most popular algorithm for solving tridiagonal linear systems on SIMD-type computers like CRAY-1 or CDC CYBER 205. That algorithm seems to be the adequate one for the IBM 3090 VF (uni-processor), too, although the overall expected speedup over Gaussian elimination, specialized for tridiagonal systems, is not as high as for the CRAY-1 or the CYBER 205. That is because the excellent scalar speed of the IBM 3090 makes its vector-to-scalar speed ratio relatively moderate. The idea of the cyclic reduction algorithm can be generalized and modified in various directions. A polyalgorithm can be derived which takes into account much better the given architecture of the IBM 3090 VF than the ‘pure’ cyclic reduction algorithm as described for instance by Kershaw. This is mainly achieved by introducing more locality into the formulae. For large systems of equations the well-known cache problems are prevented.