Journal Article10.1016/J.COMPELECENG.2013.11.015
Graphics Processing Units and Open Computing Language for parallel computing
5
TL;DR: The current state of the art in the use of GPUs and OpenCL for parallel computations is reviewed and an implementation of the n-body simulation is used to illustrate some important considerations in developing OpenCL programs.
read more
About: This article is published in Computers & Electrical Engineering. The article was published on 01 Jan 2014. The article focuses on the topics: General-purpose computing on graphics processing units & Supercomputer.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
SWPepNovo: An Efficient De Novo Peptide Sequencing Tool for Large-scale MS/MS Spectra Analysis.
TL;DR: An efficient tool based on SW26010 many-core processor, namely SWPepNovo, to process the large-scale peptide MS/MS spectra using a parallel peptide spectrum matches (PSMs) algorithm is introduced.
Parallel Self-organizing Map Using Shared Virtual Memory Buffers
Noor Elaiza Abd Khalid,Muhammad Firdaus Mustapha,Azlan Ismail,Mazani Manaf +3 more
- 03 Apr 2017
TL;DR: The experimental results show the parallel SOM running on heterogeneous platform has significant improvement in computation time, and the proposed parallel SOM architecture is suitable for both GPU and heterogeneous system with the aim of comparing the performance in term of computation time.
3
An optimized magnetostatic field solver on GPU using open computing language
Fiaz Gul Khan,Bartolomeo Montrucchio,Bilal Jan,Abdul Nasir Khan,Waqas Jadoon,Shahaboddin Shamshirband,Anthony T. Chronopoulos,Iftikhar Ahmed Khan +7 more
TL;DR: A multidimensional FFT‐based parallel implementation of a magnetostatic field computation on GPUs that shows a speedup of up to 95x and 8.6x for double precision floating point accuracy against equivalent serial implementation and OOMMF, respectively.
3
Enhancing Parallel Self-organizing Map on Heterogeneous System Architecture
Muhammad Firdaus Mustapha,Noor Elaiza Abd Khalid,Azlan Ismail,Mazani Manaf +3 more
- 27 Nov 2017
TL;DR: This study attempts to enhance the processing of SOM algorithm using multiple stimuli approach and is able to score a promising speed up for different parameter size compared to standard parallel SOM on HSA platform.
2
References
A fast algorithm for particle simulations
TL;DR: An algorithm is presented for the rapid evaluation of the potential and force fields in systems involving large numbers of particles whose interactions are Coulombic or gravitational in nature, making it considerably more practical for large-scale problems encountered in plasma physics, fluid dynamics, molecular dynamics, and celestial mechanics.
5.5K
A hierarchical O(N log N) force-calculation algorithm
Josh Barnes,Piet Hut +1 more
TL;DR: A novel method of directly calculating the force on N bodies that grows only as N log N is described, using a tree-structured hierarchical subdivision of space into cubic cells, each is recursively divided into eight subcells whenever more than one particle is found to occupy the same cell.
4.2K
•Book
Digital integrated circuits: a design perspective
Jan M. Rabaey
- 01 Jan 1996
TL;DR: In this paper, the authors present a survey of the state-of-the-art in the field of digital integrated circuits, focusing on the following: 1. A Historical Perspective. 2. A CIRCUIT PERSPECTIVE.
Implementing Molecular Dynamics on Hybrid High Performance Computers - Particle-Particle Particle-Mesh
TL;DR: This paper presents an efficient implementation of the particle–particle particle-mesh method based on the work by Harvey and De Fabritiis, and provides a performance comparison of the same kernels compiled with both CUDA and OpenCL.
499
Fast calculation of computer-generated-hologram on AMD HD5000 series GPU and OpenCL
TL;DR: Using a RV870 GPU and OpenCL, a CGH from a 3D object consisting of 1,024 points in 30 milli-seconds is calculated, a speed approximately two times faster than that of a GPU made by NVIDIA.
135
Related Papers (5)
David Kaeli,Perhaad Mistry,Dana Schaa,Dong Ping Zhang +3 more
- 01 Jun 2015
Slo-Li Chu,Chih-Chieh Hsiao +1 more
- 01 Sep 2010