DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Lowering entry barriers to developing custom simulators of distributed applications and platforms with SimGrid

Journal Article · · Parallel Computing

Researchers in parallel and distributed computing (PDC) often resort to simulation because experiments conducted using a simulator can be for arbitrary experimental scenarios, are less resource-, labor-, and time-consuming than their real-world counterparts, and are perfectly repeatable and observable. Many frameworks have been developed to ease the development of PDC simulators, and these frameworks provide different levels of accuracy, scalability, versatility, extensibility, and usability. Further, the SimGrid framework has been used by many PDC researchers to produce a wide range of simulators for over two decades. Its popularity is due to a large emphasis placed on accuracy, scalability, and versatility, and is in spite of shortcomings in terms of extensibility and usability. Although SimGrid provides sensible simulation models for the common case, it was difficult for users to extend these models to meet domain-specific needs. Furthermore, SimGrid only provided relatively low-level simulation abstractions, making the implementation of a simulator of a complex system a labor-intensive undertaking. In this work we describe developments in the last decade that have contributed to vastly improving extensibility and usability, thus lowering or removing entry barriers for users to develop custom SimGrid simulators.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
National Science Foundation (NSF); USDOE Laboratory Directed Research and Development (LDRD) Program; USDOE Office of Science (SC)
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
2538150
Journal Information:
Parallel Computing, Journal Name: Parallel Computing Vol. 123; ISSN 0167-8191
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (19)

StarPU: a unified platform for task scheduling on heterogeneous multicore architectures journal November 2010
Faithful performance prediction of a dynamic task‐based runtime system for heterogeneous multi‐core architectures journal July 2015
Developing accurate and scalable simulators of production workflow management systems with WRENCH journal November 2020
Versatile, scalable, and accurate simulation of distributed applications and platforms journal October 2014
Cloudsim: simulator for cloud computing infrastructure and modeling journal January 2012
DISSECT-CF: A simulator to foster energy-aware scheduling in infrastructure clouds journal November 2015
Flexibilities of wavelets as a computational basis set for large-scale electronic structure calculations journal May 2020
SST: A Scalable Parallel Framework for Architecture-Level Performance, Power, Area and Thermal Simulation journal July 2011
Parallel Simulation of Peer-to-Peer Systems conference May 2012
Adding Storage Simulation Capacities to the SimGrid Toolkit: Concepts, Models, and API conference May 2015
Automated Calibration of Parallel and Distributed Computing Simulators: A Case Study conference May 2024
Simulating MPI Applications: The SMPI Approach journal August 2017
Software model checking journal October 2009
The gem5 simulator journal August 2011
Sniper
  • Carlson, Trevor E.; Heirman, Wim; Eeckhout, Lieven
  • Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1145/2063384.2063454
conference November 2011
On the validity of flow-level tcp network models for grid and cloud simulations
  • Velho, Pedro; Schnorr, Lucas Mello; Casanova, Henri
  • ACM Transactions on Modeling and Computer Simulation, Vol. 23, Issue 4 https://doi.org/10.1145/2517448
journal October 2013
Parallel discrete event simulation journal October 1990
Sthread: In-Vivo Model Checking of Multithreaded Programs journal February 2020
Simulation of the performance and scalability of message passing interface (MPI) communications of atmospheric models running on exascale supercomputers journal August 2018