Simulating performance sensitivity of supercomputer job parameters.
- Woodside, CA
We report on the use of a supercomputer simulation to study the performance sensitivity to systematic changes in the job parameters of run time, number of CPUs, and interarrival time. We also examine the effect of changes in share allocation and service ratio for job prioritization under a Fair Share queuing Algorithm to see the effect on facility figures of merit. We used log data from the ASCI supercomputer Blue Mountain and the ASCI simulator BIRMinator to perform this study. The key finding is that the performance of the supercomputer is quite sensitive to all the job parameters with the interarrival rate of the jobs being most sensitive at the highest rates and increasing run times the least sensitive job parameter with respect to utilization and rapid turnaround. We also find that this facility is running near its maximum practical utilization. Finally, we show the importance of the use of simulation in understanding the performance sensitivity of a supercomputer.
- Research Organization:
- Sandia National Laboratories (SNL), Albuquerque, NM, and Livermore, CA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC04-94AL85000
- OSTI ID:
- 917462
- Report Number(s):
- SAND2003-1025C; TRN: US200816%%391
- Resource Relation:
- Conference: Proposed for presentation at the ASTC 2003 High Performance Computer Symposium 2003 held March 30-April 3, 2003 in Orlando, FL.
- Country of Publication:
- United States
- Language:
- English
Similar Records
Fair share on high performance computing systems : what does fair really mean?
SMC 2021 : Analyzing Resource Utilization and User Behavior on Titan Supercomputer