skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Efficient Parallel Discrete Event Simulation on Cloud/Virtual Machine Platforms

Journal Article · · ACM Transactions on Modeling and Computer Simulation
DOI:https://doi.org/10.1145/2746232· OSTI ID:1393818

Cloud and Virtual machine (VM) technologies present new challenges with respect to performance and monetary cost in executing parallel discrete event simulation (PDES) applications. Due to the introduction of overall cost as a metric, the traditional use of the highest-end computing configuration is no longer the most obvious choice. Moreover, the unique runtime dynamics and configuration choices of Cloud and VM platforms introduce new design considerations and runtime characteristics specific to PDES over Cloud/VMs. Here, an empirical study is presented to guide an understanding of the dynamics, trends, and trade-offs in executing PDES on Cloud/VM platforms. Performance and cost measures obtained from multiple PDES applications executed on the Amazon EC2 Cloud and on a high-end VM host machine reveal new, counterintuitive VM--PDES dynamics and guidelines. One of the critical aspects uncovered is the fundamental mismatch in hypervisor scheduler policies designed for general cloud workloads versus the virtual time ordering needed for PDES workloads. This insight is supported by experimental data revealing the gross deterioration in PDES performance traceable to VM scheduling policy. To overcome this fundamental problem, the design and implementation of a new deadlock-free scheduler algorithm are presented, optimized specifically for PDES applications on VMs. The scalability of our scheduler has been tested up to 128 VMs multiplexed on 32 cores, showing significant improvement in the runtime relative to the default Cloud/VM scheduler. The observations, algorithmic design, and results are timely for emerging cloud/VM-based installations, highlighting the need for PDES-specific support in high performance discrete event simulations on Cloud/VM platforms.

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1393818
Journal Information:
ACM Transactions on Modeling and Computer Simulation, Vol. 26, Issue 1; ISSN 1049-3301
Publisher:
Association for Computing Machinery
Country of Publication:
United States
Language:
English

References (11)

Discrete event modeling and massively parallel execution of epidemic outbreak phenomena journal July 2011
System Deadlocks journal June 1971
Conservative Distributed Discrete Event Simulation on Amazon EC2
  • Vanmechelen, Kurt; De Munck, Silas; Broeckhove, Jan
  • 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012) https://doi.org/10.1109/CCGrid.2012.73
conference May 2012
The Impact of Virtualization on Network Performance of Amazon EC2 Data Center conference March 2010
Time warp operating system journal November 1987
Empirical evaluation of conservative and optimistic discrete event execution on cloud and VM platforms
  • Yoginath, Srikanth B.; Perumalla, Kalyan S.
  • Proceedings of the 2013 ACM SIGSIM conference on Principles of advanced discrete simulation - SIGSIM-PADS '13 https://doi.org/10.1145/2486092.2486118
conference January 2013
A Virtual Time System for OpenVZ-Based Network Emulations
  • Zheng, Yuhao; Nicol, David M.
  • 2011 ACM/IEEE/SCS 25th Workshop on Principles of Advanced and Distributed Simulation (PADS), 2011 IEEE Workshop on Principles of Advanced and Distributed Simulation https://doi.org/10.1109/PADS.2011.5936745
conference June 2011
Reversible discrete event formulation and optimistic parallel execution of vehicular traffic models journal January 2009
Optimistic Synchronization of Parallel Simulations in Cloud Computing Environments conference September 2009
Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud
  • Jackson, Keith R.; Ramakrishnan, Lavanya; Muriki, Krishna
  • 2010 IEEE 2nd International Conference on Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on Cloud Computing Technology and Science https://doi.org/10.1109/CloudCom.2010.69
conference November 2010
Parallel and distributed simulation from many cores to the public cloud conference July 2011