Quality Assessment of GPU Power Profiling Mechanisms
Abstract
Accurate component-level power measurements are nowadays essential for the design and optimization of high-performance computing (HPC) systems and applications. Particularly, as more and more heterogeneous HPC systems are developed, the characterizations of GPU power profiles have become extremely crucial because, although GPUs provide exceptional performance, they do consume substantial amounts of power. Currently, there are various GPU power profiling mechanisms available; however, there is no standard way to assess the quality of such profiling schemes. To address this issue, in this paper, we develop an assessment methodology to rate the quality and performance of the profiling mechanism itself. Specifically, we present the assessments of four different GPU power profiling techniques: (i) Nvidia's NVML via Allinea MAP, (ii) Nvidia's NVML via direct reads, and (iii) Penguin Computing's PowerInsight (PI) via two vendor-provided drivers, and (iv) PowerInsight via Allinea MAP. In addition, we discuss the effects of moving-average filters to explain the slow variations of some of the measured power profiles. Based on our assessment, the GPU power profiling mechanism using PI device outperforms the other schemes by reliably measuring the ground-truth power profile generated by a GPU stress-test benchmark.
- Authors:
-
- ORNL
- Publication Date:
- Research Org.:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1474562
- DOE Contract Number:
- AC05-00OR22725
- Resource Type:
- Conference
- Resource Relation:
- Conference: IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) - Vancouver, , Canada - 5/21/2018 4:00:00 AM-5/25/2018 4:00:00 AM
- Country of Publication:
- United States
- Language:
- English
Citation Formats
Sen, Satyabrata, Imam, Neena, and Hsu, Chung-Hsing. Quality Assessment of GPU Power Profiling Mechanisms. United States: N. p., 2018.
Web. doi:10.1109/IPDPSW.2018.00113.
Sen, Satyabrata, Imam, Neena, & Hsu, Chung-Hsing. Quality Assessment of GPU Power Profiling Mechanisms. United States. https://doi.org/10.1109/IPDPSW.2018.00113
Sen, Satyabrata, Imam, Neena, and Hsu, Chung-Hsing. Tue .
"Quality Assessment of GPU Power Profiling Mechanisms". United States. https://doi.org/10.1109/IPDPSW.2018.00113. https://www.osti.gov/servlets/purl/1474562.
@article{osti_1474562,
title = {Quality Assessment of GPU Power Profiling Mechanisms},
author = {Sen, Satyabrata and Imam, Neena and Hsu, Chung-Hsing},
abstractNote = {Accurate component-level power measurements are nowadays essential for the design and optimization of high-performance computing (HPC) systems and applications. Particularly, as more and more heterogeneous HPC systems are developed, the characterizations of GPU power profiles have become extremely crucial because, although GPUs provide exceptional performance, they do consume substantial amounts of power. Currently, there are various GPU power profiling mechanisms available; however, there is no standard way to assess the quality of such profiling schemes. To address this issue, in this paper, we develop an assessment methodology to rate the quality and performance of the profiling mechanism itself. Specifically, we present the assessments of four different GPU power profiling techniques: (i) Nvidia's NVML via Allinea MAP, (ii) Nvidia's NVML via direct reads, and (iii) Penguin Computing's PowerInsight (PI) via two vendor-provided drivers, and (iv) PowerInsight via Allinea MAP. In addition, we discuss the effects of moving-average filters to explain the slow variations of some of the measured power profiles. Based on our assessment, the GPU power profiling mechanism using PI device outperforms the other schemes by reliably measuring the ground-truth power profile generated by a GPU stress-test benchmark.},
doi = {10.1109/IPDPSW.2018.00113},
url = {https://www.osti.gov/biblio/1474562},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2018},
month = {5}
}