A Parallel Ghosting Algorithm for The Flexible Distributed Mesh Database
- Scientific Computation Research Center, Rensselaer Polytechnic Institute, 110 8th St, Troy, NY 12180, USA
Critical to the scalability of parallel adaptive simulations are parallel control functions including load balancing, reduced inter-process communication and optimal data decomposition. In distributed meshes, many mesh-based applications frequently access neighborhood information for computational purposes which must be transmitted efficiently to avoid parallel performance degradation when the neighbors are on different processors. This article presents a parallel algorithm of creating and deleting data copies, referred to as ghost copies, which localize neighborhood data for computation purposes while minimizing inter-process communication. The key characteristics of the algorithm are: (1) It can create ghost copies of any permissible topological order in a 1D, 2D or 3D mesh based on selected adjacencies. (2) It exploits neighborhood communication patterns during the ghost creation process thus eliminating all-to-all communication. (3) For applications that need neighbors of neighbors, the algorithm can create n number of ghost layers up to a point where the whole partitioned mesh can be ghosted. Strong and weak scaling results are presented for the IBM BG/P and Cray XE6 architectures up to a core count of 32,768 processors. The algorithm also leads to scalable results when used in a parallel super-convergent patch recovery error estimator, an application that frequently accesses neighborhood data to carry out computation.
- Sponsoring Organization:
- USDOE
- Grant/Contract Number:
- FC02-06ER25769
- OSTI ID:
- 1197889
- Journal Information:
- Scientific Programming, Journal Name: Scientific Programming Journal Issue: 1-2 Vol. 21; ISSN 1058-9244
- Publisher:
- Hindawi Publishing CorporationCopyright Statement
- Country of Publication:
- Egypt
- Language:
- English
Similar Records
An assumed partition algorithm for determining processor inter-communication
Investigation of Realistic Performance Limits for Tera-Scale Computations