DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: In situ and in-transit analysis of cosmological simulations

Journal Article · · Computational Astrophysics and Cosmology

Abstract Modern cosmological simulations have reached the trillion-element scale, rendering data storage and subsequent analysis formidable tasks. To address this circumstance, we present a new MPI-parallel approach for analysis of simulation data while the simulation runs, as an alternative to the traditional workflow consisting of periodically saving large data sets to disk for subsequent ‘offline’ analysis. We demonstrate this approach in the compressible gasdynamics/ N -body code Nyx, a hybrid $$\mbox{MPI}+\mbox{OpenMP}$$ MPI + OpenMP code based on the BoxLib framework, used for large-scale cosmological simulations. We have enabled on-the-fly workflows in two different ways: one is a straightforward approach consisting of all MPI processes periodically halting the main simulation and analyzing each component of data that they own (‘ in situ ’). The other consists of partitioning processes into disjoint MPI groups, with one performing the simulation and periodically sending data to the other ‘sidecar’ group, which post-processes it while the simulation continues (‘in-transit’). The two groups execute their tasks asynchronously, stopping only to synchronize when a new set of simulation data needs to be analyzed. For both the in situ and in-transit approaches, we experiment with two different analysis suites with distinct performance behavior: one which finds dark matter halos in the simulation using merge trees to calculate the mass contained within iso-density contours, and another which calculates probability distribution functions and power spectra of various fields in the simulation. Both are common analysis tasks for cosmology, and both result in summary statistics significantly smaller than the original data set. We study the behavior of each type of analysis in each workflow in order to determine the optimal configuration for the different data analysis algorithms.

Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR); USDOE Office of Science (SC), High Energy Physics (HEP)
OSTI ID:
1328630
Journal Information:
Computational Astrophysics and Cosmology, Journal Name: Computational Astrophysics and Cosmology Journal Issue: 1 Vol. 3; ISSN 2197-7909
Publisher:
Springer Science + Business MediaCopyright Statement
Country of Publication:
Germany
Language:
English

References (33)

A piecewise parabolic method for cosmological hydrodynamics journal August 1995
Efficient solution algorithms for the Riemann problem for real gases journal June 1985
Multidimensional upwind methods for hyperbolic conservation laws journal March 1990
HACC: Simulating sky surveys on state-of-the-art supercomputing architectures journal January 2016
Galaxy Formation and Evolution book January 2010
The one-dimensional Ly α forest power spectrum from BOSS journal November 2013
Search for TeV Gamma-ray Emission from GRB 100621A, an extremely bright GRB in X-rays, with H.E.S.S. journal April 2014
Nucleosynthesis in multi-dimensional SN Ia explosions journal September 2004
Type Ia supernova diversity in three-dimensional models journal June 2006
Formation of Galaxies and Clusters of Galaxies by Self-Similar Gravitational Condensation journal February 1974
The evolution of large-scale structure in a universe dominated by cold dark matter journal May 1985
The Santa Barbara Cluster Comparison Project: A Comparison of Cosmological Hydrodynamics Solutions journal November 1999
The Structure of Halos: Implications for Group and Cluster Cosmology journal February 2009
Castro: a new Compressible Astrophysical Solver. i. Hydrodynamics and Self-Gravity journal May 2010
Radiative Transfer in a Clumpy Universe. iv. new Synthesis Models of the Cosmic Uv/X-Ray Background journal February 2012
Nyx: A MASSIVELY PARALLEL AMR CODE FOR COMPUTATIONAL COSMOLOGY journal February 2013
The q Continuum Simulation: Harnessing the Power of gpu Accelerated Supercomputers journal August 2015
Visualization and parallel I/O at extreme scale journal July 2008
Structure finding in cosmological simulations: the state of affairs journal August 2013
Sussing Merger Trees: The Merger Trees Comparison Project journal October 2013
The Lyman α forest in optically thin hydrodynamical simulations journal December 2014
The clustering of galaxies in the SDSS-III Baryon Oscillation Spectroscopic Survey: baryon acoustic oscillations in the Data Releases 10 and 11 Galaxy samples journal April 2014
Warm dark matter as a solution to the small scale crisis: New constraints from high redshift Lyman- α forest data journal August 2013
The Design and Implementation of FFTW3 journal February 2005
ADR visualization: A generalized framework for ranking large-scale scientific data using Analysis-Driven Refinement conference November 2014
Improved post hoc flow analysis via Lagrangian representations conference November 2014
Matplotlib: A 2D Graphics Environment journal January 2007
Large-Scale Simulations of Sky Surveys journal September 2014
The Universe at extreme scale: Multi-petaflop sky simulation on the BG/Q
  • Habib, Salman; Morozov, Vitali; Finkel, Hal
  • 2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2012.106
conference November 2012
Roofline: an insightful visual performance model for multicore architectures journal April 2009
Distributed merge trees conference January 2013
Large-scale compute-intensive analysis via a combined in-situ and co-scheduling workflow approach
  • Sewell, Christopher; Messer, Bronson; Habib, Salman
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '15 https://doi.org/10.1145/2807591.2807663
conference January 2015
PHEW: a parallel segmentation algorithm for three-dimensional AMR datasets: Application to structure detection in self-gravitating flows journal June 2015

Related Subjects