Graphics Processing Units and Open Computing Language for parallel computing

doi:10.1016/J.COMPELECENG.2013.11.015

Journal Article10.1016/J.COMPELECENG.2013.11.015

Graphics Processing Units and Open Computing Language for parallel computing

Kyrylo Perelygin, +2 more

- 01 Jan 2014

- Computers & Electrical Engineering

- Vol. 40, Iss: 1, pp 241-251

5

TL;DR: The current state of the art in the use of GPUs and OpenCL for parallel computations is reviewed and an implementation of the n-body simulation is used to illustrate some important considerations in developing OpenCL programs.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.7150/IJBS.32142

SWPepNovo: An Efficient De Novo Peptide Sequencing Tool for Large-scale MS/MS Spectra Analysis.

Chuang Li, +4 more

- 03 Jul 2019

- International Journal of Biological Scie...

TL;DR: An efficient tool based on SW26010 many-core processor, namely SWPepNovo, to process the large-scale peptide MS/MS spectra using a parallel peptide spectrum matches (PSMs) algorithm is introduced.

...read moreread less

13

Book Chapter•10.1007/978-3-319-56660-3_5

Parallel Self-organizing Map Using Shared Virtual Memory Buffers

Noor Elaiza Abd Khalid, +3 more

- 03 Apr 2017

TL;DR: The experimental results show the parallel SOM running on heterogeneous platform has significant improvement in computation time, and the proposed parallel SOM architecture is suitable for both GPU and heterogeneous system with the aim of comparing the performance in term of computation time.

...read moreread less

3

Journal Article•10.1002/CPE.3981

An optimized magnetostatic field solver on GPU using open computing language

Fiaz Gul Khan, +7 more

- 10 Mar 2017

- Concurrency and Computation: Practice an...

TL;DR: A multidimensional FFT‐based parallel implementation of a magnetostatic field computation on GPUs that shows a speedup of up to 95x and 8.6x for double precision floating point accuracy against equivalent serial implementation and OOMMF, respectively.

...read moreread less

3

•Journal Article•10.3923/JAS.2017.204.211

Evaluation of Parallel Self-organizing Map Using Heterogeneous System Platform

Muhammad Firdaus Mustapha, +2 more

- 15 Mar 2017

- Journal of Applied Sciences

3

Book Chapter•10.1007/978-981-10-7242-0_14

Enhancing Parallel Self-organizing Map on Heterogeneous System Architecture

Muhammad Firdaus Mustapha, +3 more

- 27 Nov 2017

TL;DR: This study attempts to enhance the processing of SOM algorithm using multiple stimuli approach and is able to score a promising speed up for different parameter size compared to standard parallel SOM on HSA platform.

...read moreread less

2

References

Journal Article•10.1016/0021-9991(87)90140-9

A fast algorithm for particle simulations

Leslie Greengard, +1 more

- 01 Dec 1987

- Journal of Computational Physics

TL;DR: An algorithm is presented for the rapid evaluation of the potential and force fields in systems involving large numbers of particles whose interactions are Coulombic or gravitational in nature, making it considerably more practical for large-scale problems encountered in plasma physics, fluid dynamics, molecular dynamics, and celestial mechanics.

...read moreread less

5.5K

Journal Article•10.1038/324446A0

A hierarchical O(N log N) force-calculation algorithm

Josh Barnes, +1 more

- 14 Apr 1986

- Nature

TL;DR: A novel method of directly calculating the force on N bodies that grows only as N log N is described, using a tree-structured hierarchical subdivision of space into cubic cells, each is recursively divided into eight subcells whenever more than one particle is found to occupy the same cell.

...read moreread less

4.2K

•Book

Digital integrated circuits: a design perspective

Jan M. Rabaey

- 01 Jan 1996

TL;DR: In this paper, the authors present a survey of the state-of-the-art in the field of digital integrated circuits, focusing on the following: 1. A Historical Perspective. 2. A CIRCUIT PERSPECTIVE.

...read moreread less

3K

Journal Article•10.1016/J.CPC.2011.10.012

Implementing Molecular Dynamics on Hybrid High Performance Computers - Particle-Particle Particle-Mesh

W. Michael Brown, +3 more

- 01 Mar 2012

- Computer Physics Communications

TL;DR: This paper presents an efficient implementation of the particle–particle particle-mesh method based on the work by Harvey and De Fabritiis, and provides a performance comparison of the same kernels compiled with both CUDA and OpenCL.

...read moreread less

499

•Journal Article•10.1364/OE.18.009955

Fast calculation of computer-generated-hologram on AMD HD5000 series GPU and OpenCL

Tomoyoshi Shimobaba, +4 more

- 10 May 2010

- Optics Express

TL;DR: Using a RV870 GPU and OpenCL, a CGH from a 3D object consisting of 1,024 points in 30 milli-seconds is calculated, a speed approximately two times faster than that of a GPU made by NVIDIA.

...read moreread less

135

...

Expand

Graphics Processing Units and Open Computing Language for parallel computing

Chat with Paper

AI Agents for this Paper

Citations

SWPepNovo: An Efficient De Novo Peptide Sequencing Tool for Large-scale MS/MS Spectra Analysis.

Parallel Self-organizing Map Using Shared Virtual Memory Buffers

An optimized magnetostatic field solver on GPU using open computing language

Evaluation of Parallel Self-organizing Map Using Heterogeneous System Platform

Enhancing Parallel Self-organizing Map on Heterogeneous System Architecture

References

A fast algorithm for particle simulations

A hierarchical O(N log N) force-calculation algorithm

Digital integrated circuits: a design perspective

Implementing Molecular Dynamics on Hybrid High Performance Computers - Particle-Particle Particle-Mesh

Fast calculation of computer-generated-hologram on AMD HD5000 series GPU and OpenCL

Related Papers (5)

Heterogeneous Computing with OpenCL 2.0

OpenCL: Make Ubiquitous Supercomputing Possible

Designing APU Oriented Scientific Computing Applications in OpenCL

GPUBlocks: GUI Programming Tool for CUDA and OpenCL

A 3D graphics rendering pipeline implementation based on the openCL massively parallel processing