Monitoring and Steering of Large-Scale Distributed Simulations
Oak Ridge National Laboratory is developing a state-of-the-art parallel application development system called CUMULVS, which allows scientists to easily incorporate interactive visualization, computational steering and fault tolerance into distributed software applications. The system is a valuable tool for many large scientific applications because it enables the scientist to visually monitor large data fields and remotely control parameters inside a running application. Collaborative monitoring is provided by allowing multiple researchers to simultaneously attach to a simulation, each controlling their own view of the same or different data fields within the simulation. By supporting steering of a simulation while it is running, CUMULVS provides the opportunity to accelerate the process of scientific discovery. CUMULVS also provides a simple mechanism to incorporate automatic checkpointing and heterogeneous task migration into large applications so that simulations can continue to run for weeks unattended. This paper will give an overview of the CUMULVS system and its capabilities, including several case histories. The status of the project is described with instructions on how to obtain the software.
- Research Organization:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC)
- DOE Contract Number:
- AC05-96OR22464
- OSTI ID:
- 9401
- Report Number(s):
- ORNL/CP-104213; KJ 01 01 03 0; ON: DE00009401
- Resource Relation:
- Conference: International Conference on Applied Modeling and Simulation, Cairns, Australia, September 1-3, 1999
- Country of Publication:
- United States
- Language:
- English
Similar Records
Interfacing parallel scientific applications with multiple visualization systems: The CUMULVS approach
Efficient and flexible fault tolerance and migration of scientific simulation using CUMULVS