MAPA: Multi-Accelerator Pattern Allocation Policy for Multi-Tenant GPU Servers
- University of California, Riverside
- BATTELLE (PACIFIC NW LAB)
- University of Sydney
Multi-accelerator servers are increasingly being deployed in shared multi-tenant environments (such as in cloud data centers) in order to meet the demands of large-scale compute-intensive workloads. In addition, these accelerators are increasingly being inter-connected in complex topologies and workloads are exhibiting a wider variety of inter-accelerator communication patterns. However, existing al-location policies are ill-suited for these emerging use-cases. Specifically, this work identifies that multi-accelerator workloads are commonly fragmented leading to reduced bandwidth and increased latency for inter-accelerator communication. We propose Multi-Accelerator Pattern Allocation (MAPA), a graph pattern mining approach towards providing generalized allocation support for allocating multi-accelerator workloads on multi-accelerator servers. We demonstrate that MAPA is able to improve the execution time of multi-accelerator workloads and that MAPA is able to provide generalized benefits across various accelerator topologies. Finally, we demonstrate a speedup of 12.4% for75th percentile of jobs with the worst case execution time reduced by up to 35% against baseline policy using MAPA.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1836016
- Report Number(s):
- PNNL-SA-165192
- Country of Publication:
- United States
- Language:
- English
Similar Records
Evaluating Burst Buffer Placement in HPC Systems
Efficient Subtorus Processor Allocation in a Multi-Dimensional Torus
Parallel Agent-Based Simulations on Clusters of GPUs and Multi-Core Processors
Conference
·
Sun Sep 01 00:00:00 EDT 2019
·
OSTI ID:1566958
Efficient Subtorus Processor Allocation in a Multi-Dimensional Torus
Conference
·
Tue Nov 29 23:00:00 EST 2005
·
OSTI ID:887122
Parallel Agent-Based Simulations on Clusters of GPUs and Multi-Core Processors
Conference
·
Thu Dec 31 23:00:00 EST 2009
·
OSTI ID:974630