Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Fair share on high performance computing systems : what does fair really mean?

Conference ·
OSTI ID:917463

We report on a performance evaluation of a Fair Share system at the ASCI Blue Mountain supercomputer cluster. We study the impacts of share allocation under Fair Share on wait times and expansion factor. We also measure the Service Ratio, a typical figure of merit for Fair Share systems, with respect to a number of job parameters. We conclude that Fair Share does little to alter important performance metrics such as expansion factor. This leads to the question of what Fair Share means on cluster machines. The essential difference between Fair Share on a uni-processor and a cluster is that the workload on a cluster is not fungible in space or time. We find that cluster machines must be highly utilized and support checkpointing in order for Fair Share to function more closely to the spirit in which it was originally developed.

Research Organization:
Sandia National Laboratories
Sponsoring Organization:
USDOE
DOE Contract Number:
AC04-94AL85000
OSTI ID:
917463
Report Number(s):
SAND2003-1026C
Country of Publication:
United States
Language:
English

Similar Records

Simulating performance sensitivity of supercomputer job parameters.
Conference · Fri Feb 28 23:00:00 EST 2003 · OSTI ID:917462

Parallel job scheduling policies to improve fairness : a case study.
Technical Report · Thu Jan 31 23:00:00 EST 2008 · OSTI ID:929521

Characteristics of workload on ASCI blue-pacific at lawrence livermore national laboratory
Conference · Mon Aug 14 00:00:00 EDT 2000 · OSTI ID:15006159