Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Runtime Techniques to Enable a Highly-Scalable Global Address Space Model for Petascale Computing

Journal Article · · International Journal of Parallel Programming

Over the past decade, the trajectory to the petascale has been built on increased complexity and scale of the underlying parallel architectures. Meanwhile, software developers have struggled to provide tools that maintain the productivity of computational science teams using these new systems. In this regard, Global Address Space (GAS) programming models provide a straightforward and easy to use addressing model, which can lead to improved productivity. However, the scalability of GAS depends directly on the design and implementation of the runtime system on the target petascale distributed-memory architecture. In this paper, we describe the design, implementation, and optimization of the Aggregate Remote Memory Copy Interface (ARMCI) runtime library on the Cray XT5 2.3 PetaFLOPs computer at Oak Ridge National Laboratory. We optimized our implementation with the flow intimation technique that we have introduced in this paper. Our optimized ARMCI implementation improves scalability of both the Global Arrays programming model and a real-world chemistry application—NWChem—from small jobs up through 180,000 cores.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); UT-Battelle LLC/ORNL, Oak Ridge, TN (Unted States)
Sponsoring Organization:
USDOE Office of Science
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1564954
Journal Information:
International Journal of Parallel Programming, Journal Name: International Journal of Parallel Programming Journal Issue: 6 Vol. 40; ISSN 0885-7458
Publisher:
Springer
Country of Publication:
United States
Language:
English

References (20)

Protocols and strategies for optimizing performance of remote memory operations on clusters conference January 2002
Host-assisted zero-copy remote memory access communication on InfiniBand conference January 2004
Advances, Applications and Performance of the Global Arrays Shared Memory Programming Toolkit journal May 2006
Scaling communication-intensive applications on BlueGene/P using one-sided communication and overlap conference May 2009
A direct coupled cluster algorithm for massively parallel computers journal January 1997
Liquid water: obtaining the right answer for the right reasons
  • Aprà, Edoardo; Rendell, Alistair P.; Harrison, Robert J.
  • Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis - SC '09 https://doi.org/10.1145/1654059.1654127
conference January 2009
Analysis of Implementation Options for MPI-2 One-Sided book January 2007
Thermodynamic Properties of the C 5 , C 6 , and C 8 n -Alkanes from ab Initio Electronic Structure Theory journal August 2005
Optimizing strided remote memory access operations on the Quadrics QsNetII network interconnect conference January 2005
High Performance Remote Memory Access Communication: The Armci Approach journal May 2006
Portals 3.0: protocol building blocks for low overhead communication conference January 2002
Generalized Portable SHMEM Library for High Performance Computing report January 2003
An Evaluation of Two Implementation Strategies for Optimizing One-Sided Atomic Reduction conference January 2005
Blue Gene system software---Design and implementation of a one-sided communication interface for the IBM eServer Blue Gene® supercomputer conference January 2006
Evaluation of Remote Memory Access Communication on the IBM Blue Gene/P Supercomputer
  • Krishnan, Manojkumar; Nieplocha, Jarek; Blocksome, Michael
  • 2008 International Conference on Parallel Processing Workshops (ICPP-W), 2008 International Conference on Parallel Processing - Workshops https://doi.org/10.1109/ICPP-W.2008.34
conference September 2008
Scalable MPI design over InfiniBand using eXtended Reliable Connection conference September 2008
Evaluation of Remote Memory Access Communication on the Cray XT3 conference March 2007
Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogen journal January 1989
The deep computing messaging framework: generalized scalable message passing on the blue gene/P supercomputer conference January 2008
Multi-platform co-array fortran compiler * conference January 2004

Similar Records

Enabling a Highly-Scalable Global Address Space Model for Petascale Computing
Conference · Thu Dec 31 23:00:00 EST 2009 · OSTI ID:979618

Cooperative Server Clustering for a Scalable GAS Model on petascale cray XT5 Systems
Journal Article · Thu Dec 31 23:00:00 EST 2009 · Computer Science - Research and Development · OSTI ID:982405

Cooperative Server Clustering for a Scalable GAS Model on Petascale Cray XT5 Systems
Conference · Sat May 01 00:00:00 EDT 2010 · OSTI ID:981439

Related Subjects