skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: LibPowerMon: A vendor-neutral system monitoring library

Abstract

As power becomes an important resource to provision for modern HPC systems it becomes crucial to obtain deeper insights into application's power and thermal characteristics and to correlate application context with important system-level measurements. Existing profiling tools to monitor power and thermal measurements either operate at a granularity that is not fine enough to correlate with application-level events that describe application context or are not equipped to sample application-level events transparently. We introduce LibPowerMon, a lightweight, vendor-neutral user-level profiling framework to simultaneously sample user-specified application events and system-level metrics at up to 1 kHz sampling intervals. At the application level, LibPowerMon provides a source-level phase markup interface to capture application context. At runtime level, LibPowerMon records MPI and OpenMP events and can be extended to include other runtimes. At the system level, LibPowerMon samples power, thermal and relevant metrics, and provides an extensible interface to system's control capabilities. In this release, LibPowerMon supports monitoring and control functionalities on the Intel, IBM Power9 and Nvidia Volta architectures.

Developers:
 [1];
  1. Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Release Date:
Project Type:
Open Source, Publicly Available Repository
Software Type:
Scientific
Version:
0.1.0
Licenses:
MIT License
Sponsoring Org.:
USDOE National Nuclear Security Administration (NNSA)

Primary Award/Contract Number:
AC52-07NA27344
Code ID:
36017
Site Accession Number:
LLNL-CODE-805458
Research Org.:
Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Country of Origin:
United States

Citation Formats

MARATHE, ANIRUDDHA P, ROUNTREE, BARRY, and USDOE National Nuclear Security Administration. LibPowerMon: A vendor-neutral system monitoring library. Computer software. https://www.osti.gov//servlets/purl/1618413. Vers. 0.1.0. USDOE National Nuclear Security Administration (NNSA). 1 Oct. 2019. Web. doi:10.11578/dc.20200513.6.
MARATHE, ANIRUDDHA P, ROUNTREE, BARRY, & USDOE National Nuclear Security Administration. (2019, October 1). LibPowerMon: A vendor-neutral system monitoring library (Version 0.1.0) [Computer software]. https://www.osti.gov//servlets/purl/1618413. doi:10.11578/dc.20200513.6.
MARATHE, ANIRUDDHA P, ROUNTREE, BARRY, and USDOE National Nuclear Security Administration. LibPowerMon: A vendor-neutral system monitoring library. Computer software. Version 0.1.0. October 1, 2019. https://www.osti.gov//servlets/purl/1618413. doi:10.11578/dc.20200513.6.
@misc{osti_1618413,
title = {LibPowerMon: A vendor-neutral system monitoring library, Version 0.1.0},
author = {MARATHE, ANIRUDDHA P and ROUNTREE, BARRY and USDOE National Nuclear Security Administration},
abstractNote = {As power becomes an important resource to provision for modern HPC systems it becomes crucial to obtain deeper insights into application's power and thermal characteristics and to correlate application context with important system-level measurements. Existing profiling tools to monitor power and thermal measurements either operate at a granularity that is not fine enough to correlate with application-level events that describe application context or are not equipped to sample application-level events transparently. We introduce LibPowerMon, a lightweight, vendor-neutral user-level profiling framework to simultaneously sample user-specified application events and system-level metrics at up to 1 kHz sampling intervals. At the application level, LibPowerMon provides a source-level phase markup interface to capture application context. At runtime level, LibPowerMon records MPI and OpenMP events and can be extended to include other runtimes. At the system level, LibPowerMon samples power, thermal and relevant metrics, and provides an extensible interface to system's control capabilities. In this release, LibPowerMon supports monitoring and control functionalities on the Intel, IBM Power9 and Nvidia Volta architectures.},
url = {https://www.osti.gov//servlets/purl/1618413},
doi = {10.11578/dc.20200513.6},
year = {2019},
month = {10},
note =
}

Software:
Publicly Accessible Repository
https://github.com/LLNL/LibPowerMon

Save / Share: