HPC Colony II: FAST_OS II: Operating Systems and Runtime Systems at Extreme Scale
- IBM, Armonk, NY (United States)
HPC Colony II has been a 36-month project focused on providing portable performance for leadership class machines—a task made difficult by the emerging variety of more complex computer architectures. The project attempts to move the burden of portable performance to adaptive system software, thereby allowing domain scientists to concentrate on their field rather than the fine details of a new leadership class machine. To accomplish our goals, we focused on adding intelligence into the system software stack. Our revised components include: new techniques to address OS jitter; new techniques to dynamically address load imbalances; new techniques to map resources according to architectural subtleties and application dynamic behavior; new techniques to dramatically improve the performance of checkpoint-restart; and new techniques to address membership service issues at scale.
- Research Organization:
- IBM, Armonk, NY (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- SC0002107
- OSTI ID:
- 1214793
- Country of Publication:
- United States
- Language:
- English
Similar Records
Combined Final Report for Colony II Project
...And Eat it Too: High Read Performance in Write-Optimized HPC I/O Middleware File Formats