Journal Article10.1016/J.ADVENGSOFT.2015.03.006
A multiple-GPU based parallel independent coefficient reanalysis method and applications for vehicle design
19
TL;DR: The multi-GPU based IC reanalysis method has potential capability for handling the real large scale problem and reducing the design cycle and the suggested non-blocking communication strategy achieves higher speedups compared with blocking one.
read more
About: This article is published in Advances in Engineering Software. The article was published on 01 Jul 2015. The article focuses on the topics: CUDA & Bottleneck.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Sheet metal forming optimization by using surrogate modeling techniques
TL;DR: In this article, the authors provide insightful observations into the performance and potential development of these methods in sheet metal forming design, including the description of the sheet metal form design, off-line and online sampling strategies, space mapping algorithm, high dimensional problems, robust design, some challenges and potential feasible methods.
43
Multi-GPU acceleration of large-scale density-based topology optimization
TL;DR: In this article, a parallel implementation of density-based topology optimization using distributed GPU computing systems is presented, where a distributed conjugate gradient solver preconditioned by a smooth aggregation-based algebraic multigrid (AMG) is used to solve the linear system of equations obtained from finite element analysis.
23
Reanalysis assisted metaheuristic optimization for free vibration problems of composite laminates
TL;DR: Reanalysis assisted particle swarm optimization (RPSO) is much more efficient for free vibration problems of composite laminates than the PSO and can be extended to other metaheuristic algorithms easily.
22
The spectral cell method for wave propagation in heterogeneous materials simulated on multiple GPUs and CPUs
TL;DR: Details of parallel implementation of the spectral cell method using multi-core CPUs as well as GPUs and the OpenCL framework are reported, showing that speedups of up to 128 relative to serial CPU code can be achieved usingMulti-GPU code.
20
An exact and efficient X-FEM-based reanalysis algorithm for quasi-static crack propagation
Zhenxing Cheng,Hu Wang +1 more
TL;DR: In this paper, a decomposed updating reanalysis (DUR) algorithm is proposed to accelerate the X-FEM solving process by only calculating the changed part of the equilibrium equations.
13
References
A high-performance, portable implementation of the MPI message passing interface standard
William Gropp,Ewing Lusk,Nathan E. Doss,Anthony Skjellum +3 more
- 01 Sep 1996
TL;DR: The MPI Message Passing Interface (MPI) as mentioned in this paper is a standard library for message passing that was defined by the MPI Forum, a broadly based group of parallel computer vendors, library writers, and applications specialists.
2.4K
Portable implementation of the mpi message passing interface standard
William Gropp,Ewing Lusk,Nathan E. Doss,A. Skjeltum. A Highperformance +3 more
- 01 Jan 1996
TL;DR: The MPI Message Passing Interface (MPI) as discussed by the authors is a standard library for message passing that was defined by the MPI Forum, a broadly based group of parallel computer vendors, library writers, and applications specialists.
2K
GPU Computing
John D. Owens,Mike Houston,David Luebke,Simon Green,John E. Stone,James C. Phillips +5 more
- 01 May 2008
TL;DR: The background, hardware, and programming model for GPU computing is described, the state of the art in tools and techniques are summarized, and four GPU computing successes in game physics and computational biophysics that deliver order-of-magnitude performance gains over optimized CPU applications are presented.
1.7K
A method of finite element tearing and interconnecting and its parallel solution algorithm
TL;DR: A novel domain decomposition approach for the parallel finite element solution of equilibrium equations is presented, which exhibits a degree of parallelism that is not limited by the bandwidth of the finite element system of equations.
1.4K
Brook for GPUs: stream computing on graphics hardware
Ian Buck,Tim Foley,Daniel Reiter Horn,Jeremy Sugerman,Kayvon Fatahalian,Mike Houston,Pat Hanrahan +6 more
- 01 Aug 2004
TL;DR: This paper presents Brook for GPUs, a system for general-purpose computation on programmable graphics hardware that abstracts and virtualizes many aspects of graphics hardware, and presents an analysis of the effectiveness of the GPU as a compute engine compared to the CPU.