Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Discrete Event Execution with One-Sided and Two-Sided GVT Algorithms on 216,000 Processor Cores

Journal Article · · ACM Transactions on Modeling and Computer Simulation
DOI:https://doi.org/10.1145/2611561· OSTI ID:1132340

Global virtual time (GVT) computation is a key determinant of the efficiency and runtime dynamics of parallel discrete event simulations (PDES), especially on large-scale parallel platforms. Here, three execution modes of a generalized GVT computation algorithm are studied on high-performance parallel computing systems: (1) a synchronous GVT algorithm that affords ease of implementation, (2) an asynchronous GVT algorithm that is more complex to implement but can relieve blocking latencies, and (3) a variant of the asynchronous GVT algorithm to exploit one-sided communication in extant supercomputing platforms. Performance results are presented of implementations of these algorithms on up to 216,000 cores of a Cray XT5 system, exercised on a range of parameters: optimistic and conservative synchronization, fine- to medium-grained event computation, synthetic and non-synthetic applications, and different lookahead values. Performance of up to 54 billion events executed per second is registered. Detailed PDES-specific runtime metrics are presented to further the understanding of tightly-coupled discrete event dynamics on massively parallel platforms.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Oak Ridge National Laboratory (ORNL)
Sponsoring Organization:
DOE Office of Science; USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1132340
Journal Information:
ACM Transactions on Modeling and Computer Simulation, Journal Name: ACM Transactions on Modeling and Computer Simulation Journal Issue: 3 Vol. 24; ISSN 1049-3301
Publisher:
Association for Computing Machinery
Country of Publication:
United States
Language:
English

References (14)

Scalable Time Warp on Blue Gene Supercomputers
  • Bauer Jr., David W.; Carothers, Christopher D.; Holder, Akintayo
  • 2009 ACM/IEEE/SCS 23rd Workshop on Principles of Advanced and Distributed Simulation (PADS) https://doi.org/10.1109/PADS.2009.21
conference June 2009
Efficient Algorithms for Distributed Snapshots and Global Virtual Time Approximation journal August 1993
Discrete event modeling and massively parallel execution of epidemic outbreak phenomena journal July 2011
The cost of conservative synchronization in parallel discrete event simulations journal April 1993
Virtual time journal July 1985
GVT algorithms and discrete event dynamics on 129K+ processor cores conference December 2011
Reducing null messages in Misra's distributed discrete event simulation method journal January 1990
Scaling time warp-based discrete event execution to 104 processors on a Blue Gene supercomputer conference January 2007
Providing determinism in the Time Warp operating system-costs, benefits, and implications conference January 1990
On deciding between conservative and optimistic approaches on massively parallel platforms conference December 2010
Computing global virtual time in shared-memory multiprocessors journal October 1997
MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations
  • Faraj, Ahmad; Kumar, Sameer; Smith, Brian
  • 2009 17th Annual IEEE Symposium on High-Performance Interconnects (HOTI), 2009 17th IEEE Symposium on High Performance Interconnects https://doi.org/10.1109/HOTI.2009.12
conference August 2009
Middleware for real-time distributed simulations
  • McLean, Thom; Fujimoto, Richard; Fitzgibbons, Brad
  • Concurrency and Computation: Practice and Experience, Vol. 16, Issue 15 https://doi.org/10.1002/cpe.874
journal January 2004
DSIM: Scaling Time Warp to 1,033 Processors conference January 2005

Similar Records

GVT Algorithms and Discrete Event Dynamics on 128K+ Processor Cores
Conference · Fri Dec 31 23:00:00 EST 2010 · OSTI ID:1034668

Reversible Parallel Discrete-Event Execution of Large-scale Epidemic Outbreak Models
Conference · Thu Dec 31 23:00:00 EST 2009 · OSTI ID:982395

Virtual Time III, Part 1: Unified Virtual Time Synchronization for Parallel Discrete Event Simulation
Journal Article · Tue Jan 10 23:00:00 EST 2023 · ACM Transactions on Modeling and Computer Simulation · OSTI ID:1986609

Related Subjects