skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: PKDGRAV3: beyond trillion particle cosmological simulations for the next era of galaxy surveys

Journal Article · · Computational Astrophysics and Cosmology

We report on the successful completion of a 2 trillion particle cosmological simulation to z=0 run on the Piz Daint supercomputer (CSCS, Switzerland), using 4000+ GPU nodes for a little less than 80 h of wall-clock time or 350,000 node hours. Using multiple benchmarks and performance measurements on the US Oak Ridge National Laboratory Titan supercomputer, we demonstrate that our code PKDGRAV3, delivers, to our knowledge, the fastest time-to-solution for large-scale cosmological N-body simulations. This was made possible by using the Fast Multipole Method in conjunction with individual and adaptive particle time steps, both deployed efficiently (and for the first time) on supercomputers with GPU-accelerated nodes. The very low memory footprint of PKDGRAV3 allowed us to run the first ever benchmark with 8 trillion particles on Titan, and to achieve perfect scaling up to 18,000 nodes and a peak performance of 10 Pflops.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1357708
Alternate ID(s):
OSTI ID: 1567656
Journal Information:
Computational Astrophysics and Cosmology, Journal Name: Computational Astrophysics and Cosmology Vol. 4 Journal Issue: 1; ISSN 2197-7909
Publisher:
SpringerCopyright Statement
Country of Publication:
Germany
Language:
English

References (25)

N-body simulations of gravitational dynamics journal May 2011
Structure of the Coma Cluster of Galaxies journal February 1970
HACC: extreme scaling and performance across diverse architectures
  • Habib, Salman; Morozov, Vitali; Frontiere, Nicholas
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13 https://doi.org/10.1145/2503210.2504566
conference January 2013
GreeM: Massively Parallel TreePM Code for Large Cosmological N -body Simulations journal December 2009
First‐Year Wilkinson Microwave Anisotropy Probe ( WMAP ) Observations: Determination of Cosmological Parameters journal September 2003
Hydra: an Adaptive-Mesh Implementation of P 3M-SPH journal October 1995
A parallel hashed Oct-Tree N-body algorithm conference January 1993
Cosmological hydrodynamics with adaptive mesh refinement: A new high resolution code called RAMSES journal April 2002
24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs
  • Bedorf, Jeroen; Gaburov, Evghenii; Fujii, Michiko S.
  • SC14: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2014.10
conference November 2014
Planck 2013 results. XVI. Cosmological parameters journal October 2014
A Hierarchical (N) Force Calculation Algorithm journal June 2002
First-ever full observable universe simulation
  • Alimi, Jean-Michel; Bouillot, Vincent; Rasera, Yann
  • 2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2012.58
conference November 2012
A hierarchical O(N log N) force-calculation algorithm journal December 1986
HACC: Simulating sky surveys on state-of-the-art supercomputing architectures journal January 2016
2HOT: an improved parallel hashed oct-tree n-body algorithm for cosmological simulation conference January 2013
The cosmological simulation code gadget-2 journal December 2005
Matter power spectrum and the challenge of percent accuracy journal April 2016
Towards an accurate mass function for precision cosmology journal March 2013
4.45 Pflops astrophysical N-body simulation on K computer -- The gravitational trillion-body problem
  • Ishiyama, Tomoaki; Nitadori, Keigo; Makino, Junichiro
  • 2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2012.3
conference November 2012
On the Clustering Tendencies among the Nebulae. II. a Study of Encounters Between Laboratory Models of Stellar Systems by a New Integration Procedure. journal September 1941
Application of the Ewald method to cosmological N-body simulations journal February 1991
Scaling relations for galaxy clusters in the Millennium-XXL simulation: Scaling relations for clusters in the MXXL journal October 2012
A fast algorithm for particle simulations journal December 1987
The MICE Grand Challenge light-cone simulation – III. Galaxy lensing mocks from all-sky lensing maps journal December 2014
The q Continuum Simulation: Harnessing the Power of gpu Accelerated Supercomputers journal August 2015