Mapping dynamic programming algorithms on graphics processing units

doi:10.15480/882.1184

Open AccessDissertation10.15480/882.1184

Mapping dynamic programming algorithms on graphics processing units

- 01 Jan 2014

8

TL;DR: The parallelism exhibited by the compute intensive tasks is studied and a parallel solution based on the matrix-matrix product method onto GPU is devised and the opportunity to use optimized BLAS library provided by CUDA is explored.

Abstract: The Graphics Processing Unit (GPU) is a highly parallel, many-core streaming architecture that can execute hundreds of threads concurrently. The data parallel architecture of the GPU is suitable to perform computation intensive applications. In recent years, the use of GPUs for general purpose computation has increased and a large set of problems can be tackled by mapping onto GPUs. The programming model CUDA enables to design C like programs with some extensions which leverages programmers to efficiently use the graphics API. Alignment is the fundamental operation used to compare biological sequences and in this way to identify regions of similarity that are eventually consequences of structural, functional, or evolutionary relationships. Multiple sequence alignment is an important tool for the simultaneous alignment of three or more sequences. Efficient heuristics exist to cope with this problem. In the thesis, progressive alignment methods and their parallel implementation by GPUs are studied. More specifically, the dynamic programming algorithms of profile-profile and profile-sequence alignment are mapped onto GPU. Wavefront and matrix-matrix product techniques are discussed which can deal well with the data dependencies. The performance of these methods is analyzed. Simulations show that one order of magnitude of speed-up over the serial version can be achieved. ClustalW is the most widely used progressive sequence alignment method which aligns more closely related sequences first and then gradually adds more divergent sequences. It consists of three stages: distance matrix calculation, guide tree compilation, and progressive alignment. In this work, the efficient mapping of the alignment stage onto GPU by using a combination of wavefront and matrix-matrix product techniques has been studied. In the hidden Markov model, the Viterbi algorithm is used to find the most probable sequence of hidden states that has generated the observation. In the thesis, the parallelism exhibited by the compute intensive tasks is studied and a parallel solution based on the matrix-matrix product method onto GPU is devised. Moreover, the opportunity to use optimized BLAS library provided by CUDA is explored. Finally, the performance by fixing the number of states and changing the number of observations and vice versa is portrayed. At the end, general principles and guidelines for GPU programming of matrixmatrix product algorithms are discussed.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article

GPU-Clustalw : Using graphics hardware to accelerate multiple-sequence alignment

Weiguo Liu, +3 more

- 01 Jan 2006

- Lecture Notes in Computer Science

TL;DR: In this article, the authors presented a new approach to reduce the computational complexity of ClustalW by using graphics processing units (GPUs) to accelerate the computationally expensive part of the algorithm.

...read moreread less

53

Journal Article•10.1007/S00607-017-0557-6

Accelerating Viterbi algorithm on graphics processing units

Muhammad Kashif Hanif, +1 more

- 01 Nov 2017

- Computing

TL;DR: A parallel solution to improve the performance of Viterbi algorithm is presented by formulating a matrix product based algorithm that has been mapped to a NVIDIA graphics processing unit.

...read moreread less

10

•Posted Content

BTAS: A Library for Tropical Algebra

Ahsan Humayun, +2 more

- 17 Jan 2017

- arXiv: Distributed, Parallel, and Cluste...

TL;DR: A library based on Tropical Algebra which is used to provide standard vector and matrix operations namely Basic tropical Algebra Subroutines (BTAS) is proposed and designed.

...read moreread less

4

Journal Article•10.1108/ERR.1997.1.10.112.94

Encyclopedia of Molecular Biology on CD‐ROM

Margaret Sylvia

- 01 Oct 1997

- Electronic Resources Review

4

Proceedings Article•10.1109/PCCC.2016.7820646

Optimized GPU implementation for dynamic programming in image data processing

Jing Ke, +2 more

- 01 Dec 2016

TL;DR: Three parallelization strategies are proposed and implemented for a dynamic programming based cloud smoothing application, using both shared memory and non-shared memory approaches, and these strategies can be adopted in similar data analysis and processing applications.

...read moreread less

4

References

Journal Article•10.1016/S0022-2836(05)80360-2

Basic Local Alignment Search Tool

Stephen F. Altschul, +4 more

- 01 Oct 1990

- Journal of Molecular Biology

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

...read moreread less

98.8K

•Journal Article•10.1093/NAR/22.22.4673

Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice

Julie D. Thompson, +2 more

- 11 Nov 1994

- Nucleic Acids Research

TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.

...read moreread less

67.3K

•Journal Article•10.1093/OXFORDJOURNALS.MOLBEV.A040454

The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Naruya Saitou, +1 more

- 01 Jul 1987

- Molecular Biology and Evolution

TL;DR: The neighbor-joining method and Sattath and Tversky's method are shown to be generally better than the other methods for reconstructing phylogenetic trees from evolutionary distance data.

...read moreread less

62.1K

Journal Article•10.1109/5.18626

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

- 01 Feb 1989

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

24.3K

•Journal Article•10.1093/NAR/GKF436

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform

Kazutaka Katoh, +3 more

- 15 Jul 2002

- Nucleic Acids Research

TL;DR: A simplified scoring system is proposed that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length.

...read moreread less

15.6K