A parallel ghosting algorithm for the flexible distributed mesh database
TL;DR: A parallel algorithm of creating and deleting data copies, referred to as ghost copies, which localize neighborhood data for computation purposes while minimizing inter-process communication is presented, which leads to scalable results when used in a parallel super-convergent patch recovery error estimator.
read more
Abstract: Critical to the scalability of parallel adaptive simulations are parallel control functions including load balancing, reduced inter-process communication and optimal data decomposition. In distributed meshes, many mesh-based applications frequently access neighborhood information for computational purposes which must be transmitted efficiently to avoid parallel performance degradation when the neighbors are on different processors. This article presents a parallel algorithm of creating and deleting data copies, referred to as ghost copies, which localize neighborhood data for computation purposes while minimizing inter-process communication. The key characteristics of the algorithm are: 1 It can create ghost copies of any permissible topological order in a 1D, 2D or 3D mesh based on selected adjacencies. 2 It exploits neighborhood communication patterns during the ghost creation process thus eliminating all-to-all communication. 3 For applications that need neighbors of neighbors, the algorithm can create n number of ghost layers up to a point where the whole partitioned mesh can be ghosted. Strong and weak scaling results are presented for the IBM BG/P and Cray XE6 architectures up to a core count of 32,768 processors. The algorithm also leads to scalable results when used in a parallel super-convergent patch recovery error estimator, an application that frequently accesses neighborhood data to carry out computation.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Journal Article
Application of adaptive mesh refinement to particle-in-cell simulations of plasmas and beams
Jean-Luc Vay,Phillip Colella,J.W. Kwan,P. McCorquodale,D. B. Serafini,Alex Friedman,D.P. Grote,Glen Westenskow,J.-C. Adam,A. Heron,Irving Haber +10 more
TL;DR: In this paper, the authors discuss the challenges posed by coupling this technique with plasma Particle-In-Cell simulations, and present examples of application in Heavy Ion Fusion and related fields which illustrate the effectiveness of the approach.
Increasing ising machine capacity with multi-chip architectures
Anshujit Sharma,Richard Afoakwa,Zeljko Ignjatovic,Michael C. Huang +3 more
- 11 Jun 2022
TL;DR: The proposed architectures allow an Ising machine to scale in capacity and maintain its significant performance advantage (about 2200x speedup over a state-of-the-art computational substrate) and proposed optimizations in supporting batch mode operation can cut down communication demand by about 4--5x without a significant impact on solution quality.
25
PUMIPic: A mesh-based approach to unstructured mesh Particle-In-Cell on GPUs
TL;DR: The studies show that mesh-based PIC can utilize a partitioned mesh and maintain scaling up to system limitations, and a pseudo physics simulation based on a five-dimensional gyro-kinetic code for modeling plasma physics is used to examine the performance of PUMIPic.
12
IBHM: index-based data structures for 2D and 3D hybrid meshes
TL;DR: The proposed data structures are designed to achieve a balance between their memory requirements and the time complexity necessary to answer topological queries while accepting cells (elements) of different types and shows that the overhead required to accept arbitrary cell types is small.
7
Parallel Curved Mesh Adaptation for Large Scale High-Order Finite Element Simulations
Qiukai Lu,Mark S. Shephard,Saurabh Tendulkar,Mark W. Beall +3 more
- 01 Jan 2013
TL;DR: A parallel adaptive mesh control procedure designed to operate with high-order finite element analysis packages to enable large scale automated simulations on massively parallel computers is presented.
6
References
•Book
Design Patterns: Elements of Reusable Object-Oriented Software
Erich Gamma,Richard Helm,Ralph E. Johnson,John Vlissides +3 more
- 01 Jan 1994
TL;DR: The book is an introduction to the idea of design patterns in software engineering, and a catalog of twenty-three common patterns, which most experienced OOP designers will find out they've known about patterns all along.
24.8K
Triangle: Engineering a 2D Quality Mesh Generator and Delaunay Triangulator
Jonathan Richard Shewchuk
- 27 May 1996
TL;DR: Triangle as discussed by the authors is a robust implementation of two-dimensional constrained Delaunay triangulation and Ruppert's Delaunayer refinement algorithm for quality mesh generation, and it is shown that the problem of triangulating a planar straight line graph (PSLG) without introducing new small angles is impossible for some PSLGs.
The superconvergent patch recovery and a posteriori error estimates. Part 1: The recovery technique
O. C. Zienkiewicz,J. Z. Zhu +1 more
TL;DR: In this article, a general recovery technique is developed for determining the derivatives (stresses) of the finite element solutions at nodes, which has been tested for a group of widely used linear, quadratic and cubic elements for both one and two dimensional problems.
2.1K
•Book
Numerical Solution of Partial Differential Equations by the Finite Element Method
Claes Johnson
- 29 Jan 1988
TL;DR: In this article, the authors present an easily accessible introduction to one of the most important methods used to solve partial differential equations, which they call finite element methods for integral equations (FEME).
2.1K
GloMoSim: a library for parallel simulation of large-scale wireless networks
Xiang Zeng,Rajive Bagrodia,Mario Gerla +2 more
- 01 Jul 1998
TL;DR: The paper describes the GloMoSim library, addresses a number of issues relevant to its parallelization, and presents a set of experimental results on the IBM 9076 SP, a distributed memory multicomputer.
1.5K
Related Papers (5)
Antal Hiba,Zoltán Nagy,Miklos Ruszinko +2 more
- 01 Aug 2012
Michael Cox,Pat Hanrahan +1 more
- 01 Nov 1993