Journal Article10.1109/JCN.2014.000074
Performance optimization of parallel algorithms
Martin Hudik,Michal Hodon +1 more
4
TL;DR: Finite algorithms for solving systems of linear equations, namely the matrix manipulation (Gauss elimination method, GEM) are designed for architectures with shared memory, distributed-memory and for their combination.
read more
Abstract: The high intensity of research and modeling in fields of mathematics, physics, biology and chemistry requires new com- puting resources. For the big computational complexity of such tasks computing time is large and costly. The most efficient w ay to increase efficiency is to adopt parallel principles. Purp ose of this paper is to present the issue of parallel computing with em- phasis on the analysis of parallel systems, the impact of commu- nication delays on their efficiency and on overall execution time. Paper focuses is on finite algorithms for solving systems of l in- ear equations, namely the matrix manipulation (Gauss elimina- tion method, GEM). Algorithms are designed for architectures with shared memory (open multiprocessing, openMP), distributed- memory (message passing interface, MPI) and for their combina- tion (MPI + openMP). The properties of the algorithms were an- alytically determined and they were experimentally verifie d. The conclusions are drawn for theory and practice.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Parallel Algorithm for Finding Inverse of a Matrix and its Application in Message Sharing (Coding Theory)
TL;DR: A parallel algorithm for finding the inverse of the matrix using Gauss Jordan method in OpenMP was proposed in this article, which provides a direct method for obtaining inverse matrix and requires approx. 50% fewer operations unlike other methods.
An efficient process for enhancing genotype imputation in Genome-wide association studies using high performance computing
Kasikrit Damkliang,Pichaya Tandayya,Unitsa Sangket,Surakameth Mahasirimongkol,Ekawat Pasomsab +4 more
- 01 Nov 2015
TL;DR: This paper presents a practical and efficient process for enhancing the genotype imputation based analysis on Single Nucleotide Polymorphism (SNP) using High Performance Computing (HPC).
2
Scalable Multi Core Architectures Design Methodologies And Tools
Nicole Bauer
- 01 Jan 2016
TL;DR: We believe that, people have search numerous times for their favorite books like this scalable multi core architectures design methodologies and tools, but end up in infectious downloads, instead they are facing with some harmful virus inside their desktop computer.
2
Design of Image Matching Parallel Algorithm Based on Improved Minimum Distance Method
Guoliang ZOU,Shiming ZHANG,Changji CHEN,Zhenhua WANG +3 more
TL;DR: A parallel image matching algorithm is designed to improve real-time recognition using multi-core resources. It uses normalized Euclidean distance and MPI+OpenMP hybrid programming, achieving reduced matching errors and time, with improved acceleration and parallel efficiency.
References
Cloud Computing and Grid Computing 360-Degree Compared
Ian Foster,Yong Zhao,Ioan Raicu,Shiyong Lu +3 more
- 01 Nov 2008
TL;DR: In this article, the authors compare and contrast cloud computing with grid computing from various angles and give insights into the essential characteristics of both the two technologies, and compare the advantages of grid computing and cloud computing.
•Book
Distributed Systems: Concepts and Design
George Coulouris,Jean Dollimore +1 more
- 01 Jan 1988
TL;DR: The fifth edition of this best-selling text continues to provide a comprehensive source of material on the principles and practice of distributed computer systems and the exciting new developments based on them, using a wealth of modern case studies to illustrate their design and development.
•Book
Algorithms and Parallel Computing
Fayez Gebali
- 19 Apr 2011
TL;DR: This book provides the techniques to explore the possible ways to program a parallel computer for a given application.
101