Argo NodeOS: Toward Unified Resource Management for Exascale
- Argonne National Laboratory
- Lawrence Livermore National Laboratory
- BATTELLE (PACIFIC NW LAB)
- Argonne National Lab
Exascale systems are expected to feature hundreds of thousands of compute nodes with hundreds of hardware threads and complex memory hierarchies with a mix of on-package and persistent memory modules. In this context, the Argo project is developing a new operating system for exascale machines. Targeting production workloads using workflows or coupled codes, we improve the Linux kernel on several fronts. We extend the memory management of Linux to be able to subdivide NUMA memory nodes, allowing better resource partitioning among processes running on the same node. We also add support for memory-mapped access to node- local, PCIe-attached NVRAM devices and introduce a new scheduling class targeted at parallel runtimes supporting user- level load balancing.
- Research Organization:
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1862913
- Report Number(s):
- PNNL-SA-123277
- Resource Relation:
- Conference: IEEE 31st International Parallel and Distributed Processing Symposium (IPDPS 2017), May 29-June 2, 2017 Orlando, FL
- Country of Publication:
- United States
- Language:
- English
Similar Records
LDRD final report : managing shared memory data distribution in hybrid HPC applications.
HPC-Colony: Services and Interfaces to Aupport Systems With Very Large Numbers of Processors