Open AccessBook
CUDA Application Design and Development
Robert M. Farber
- 08 Oct 2011
341
TL;DR: CUDA Application Design and Development starts with an introduction to parallel computing concepts for readers with no previous parallel experience, and focuses on issues of immediate importance to working software developers: achieving high performance, maintaining competitiveness, analyzing CUDA benefits versus costs, and determining application lifespan.
read more
Abstract: As the computer industry retools to leverage massively parallel graphics processing units (GPUs), this book is designed to meet the needs of working software developers who need to understand GPU programming with CUDA and increase efficiency in their projects. CUDA Application Design and Development starts with an introduction to parallel computing concepts for readers with no previous parallel experience, and focuses on issues of immediate importance to working software developers: achieving high performance, maintaining competitiveness, analyzing CUDA benefits versus costs, and determining application lifespan.
The book then details the thought behind CUDA and teaches how to create, analyze, and debug CUDA applications. Throughout, the focus is on software engineering issues: how to use CUDA in the context of existing application code, with existing compilers, languages, software tools, and industry-standard API libraries.
Using an approach refined in a series of well-received articles at Dr Dobb's Journal, author Rob Farber takes the reader step-by-step from fundamentals to implementation, moving from language theory to practical coding.
Includes multiple examples building from simple to more complex applications in four key areas: machine learning, visualization, vision recognition, and mobile computing
Addresses the foundational issues for CUDA development: multi-threaded programming and the different memory hierarchy
Includes teaching chapters designed to give a full understanding of CUDA tools, techniques and structure.
Presents CUDA techniques in the context of the hardware they are implemented on as well as other styles of programming that will help readers bridge into the new material
Table of Contents
1. First Programs and How to Think in CUDA
2. CUDA for Machine Learning and Optimization
3. The CUDA Tool Suite: Profiling a PCA/NLPCA Functor
4. The CUDA Execution Model
5. CUDA Memory
6. Efficiently Using GPU Memory
7. Techniques to Increase Parallelism
8. CUDA for All GPU and CPU Applications
9. Mixing CUDA and Rendering
10. CUDA in a Cloud and Cluster Environments
11. CUDA for Real Problems: Monte Carlo, Modeling, and More
12. Application Focus on Live Streaming Video
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
An integrated approach to correction for off-resonance effects and subject movement in diffusion MR imaging
TL;DR: The method is based on registering the individual volumes to a model free prediction of what each volume should look like, thereby enabling its use on high b-value data where the contrast is vastly different in different volumes.
3.2K
Medical Image Processing on the GPU : Past, Present and Future
TL;DR: This review presents the past and present work on GPU accelerated medical image processing, and is meant to serve as an overview and introduction to existing GPU implementations.
426
Towards a comprehensive framework for movement and distortion correction of diffusion MR images: Within volume movement
Jesper L. R. Andersson,Mark S. Graham,Ivana Drobnjak,Hui Zhang,Nicola Filippini,Matteo Bastiani +5 more
TL;DR: A method to correct for intra‐volume movement into an existing framework for movement and distortion correction is introduced and it is demonstrated that the true movement can be estimated with high accuracy, and scalar parameters derived from the data are estimated with greater fidelity.
362
Susceptibility-induced distortion that varies due to motion: Correction in diffusion MR without acquiring additional data.
TL;DR: A post‐processing method for estimating the field as it changes with motion during the course of an experiment that only requires a single measured field and knowledge of the orientation of the subject when that field was acquired and is validated on both simulations and experimental data.
130
A coarse-grained parallel approach for seismic damage simulations of urban areas based on refined models and GPU/CPU cooperative computing
TL;DR: A coarse-grained parallel approach for seismic damage simulations of urban areas based on refined models and GPU/CPU cooperative computing is proposed and can be 39 times as great as that of a traditional CPU approach.
126
Related Papers (5)
Jason Sanders,Edward Kandrot +1 more
- 19 Jul 2010
David B. Kirk,Wen-mei W. Hwu +1 more
- 31 Dec 2012
Nicholas Wilt
- 11 Jun 2013
Barbara Chapman,Gabriele Jost,Ruud van der Pas +2 more
- 12 Oct 2007