Evaluating HPC Scheduling Strategies for Urgent Workloads
- ORNL
- Colorado State University, Fort Collins
Scientific computing centers increasingly face workloads with diverse urgency requirements, driven by applications that demand rapid or even immediate execution. Appropriately configured scheduling policies can significantly improve both user satisfaction and overall cluster utilization. In this work, we present a systematic analysis of scheduler configurations under scenarios where a fraction of jobs have urgent computing needs. We evaluate multiple job scheduling simulators, develop a lightweight job-submission emulation framework, and create tools to analyze and visualize the resulting scheduling data. Our study identifies key trade-offs between responsiveness, fairness, and efficiency, and offers a set of practical scheduling configurations (particularly for Slurm) that can be tailored to HPC environments supporting mixed-urgency workloads.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC); USDOE
- DOE Contract Number:
- AC05-00OR22725;
- OSTI ID:
- 3019946
- Resource Type:
- Conference paper/presentation
- Conference Information:
- International Conference for High Performance Computing, Networking, Storage and Analysis (SC'26), Sixth Combined Workshop on Interactive and Urgent High-Performance Computing (WIUHPC) - St. Louis, Missouri, United States of America - 11/16/2025-11/21/2025
- Country of Publication:
- United States
- Language:
- English
Similar Records
Integrating and Characterizing HPC Task Runtime Systems for hybrid AI-HPC workloads
Quantum/AI Topology-Aware Latency-Adaptive HPC Workflow Scheduling Optimization
Power Profile Monitoring and Tracking Evolution of System-Wide HPC Workloads
Conference
·
Fri Nov 14 19:00:00 EST 2025
· Proceedings of the SC '25 Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis
·
OSTI ID:3008784
Quantum/AI Topology-Aware Latency-Adaptive HPC Workflow Scheduling Optimization
Conference
·
Sat Nov 30 19:00:00 EST 2024
·
OSTI ID:2538094
Power Profile Monitoring and Tracking Evolution of System-Wide HPC Workloads
Conference
·
Sun Jun 30 20:00:00 EDT 2024
·
OSTI ID:2439873