DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Utilizing ensemble learning for performance and power modeling and improvement of parallel cancer deep learning CANDLE benchmarks

Journal Article · · Concurrency and Computation. Practice and Experience
DOI: https://doi.org/10.1002/cpe.6516 · OSTI ID:1983428
ORCiD logo [1];  [1]
  1. Mathematics &, Computer Science Division, Argonne National Laboratory The University of Chicago Lemont Illinois USA

Abstract Machine learning (ML) continues to grow in importance across nearly all domains in modeling to learn from data. Often a tradeoff exists between a model's ability to minimize bias and variance. In this article, we utilize ensemble learning to combine linear, nonlinear, and tree‐/rule‐based ML methods to cope with the bias‐variance tradeoff and result in more accurate models. We use the datasets collected for two parallel cancer deep learning CANDLE benchmarks, NT3 and P1B2, to build performance and power models based on hardware performance counters using single‐object and multiple‐objects ensemble learning to identify the most important counters for improvement on the Cray XC40 Theta at Argonne National Laboratory. Based on the insights from these models, we improve the performance and energy of P1B2 and NT3 by optimizing the deep learning environments TensorFlow, Keras, Horovod, and Python under the huge page size of 8 MB. Experimental results show that ensemble learning not only produces more accurate models but also provides more robust performance counter ranking. We achieve up to 61.15% performance improvement and up to 62.58% energy saving for P1B2 and up to 55.81% performance improvement and up to 52.60% energy saving for NT3 on up to 24,576 cores.

Sponsoring Organization:
USDOE
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1983428
Journal Information:
Concurrency and Computation. Practice and Experience, Journal Name: Concurrency and Computation. Practice and Experience Journal Issue: 15 Vol. 35; ISSN 1532-0626
Publisher:
Wiley Blackwell (John Wiley & Sons)Copyright Statement
Country of Publication:
United Kingdom
Language:
English

References (21)

Applied Predictive Modeling book January 2013
Power-aware predictive models of hybrid (MPI/OpenMP) scientific applications on multicore systems journal August 2011
E-AMOM: an energy-aware modeling and optimization methodology for scientific applications journal July 2013
LACross: Learning-Based Analytical Cross-Platform Performance and Power Prediction journal January 2017
Exploiting performance counters to predict and improve energy performance of HPC systems journal July 2014
Statistical power modeling of GPU kernels using performance counters conference August 2010
A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures
  • Song, Shuaiwen; Su, Chunyi; Rountree, Barry
  • 2013 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on Parallel and Distributed Processing https://doi.org/10.1109/IPDPS.2013.73
conference May 2013
Utilizing Hardware Performance Counters to Model and Optimize the Energy and Performance of Large Scale Scientific Applications on Power-Aware Supercomputers conference May 2016
Using Performance-Power Modeling to Improve Energy Efficiency of HPC Applications journal October 2016
Ensemble based systems in decision making journal January 2006
Complete System Power Estimation Using Processor Performance Events journal April 2012
A Study on the Use of Performance Counters to Estimate Power in Microprocessors journal December 2013
Power prediction for intel XScale® processors using performance monitoring unit events conference January 2005
Online power-performance adaptation of multithreaded programs using hardware event-based prediction conference June 2006
Real time power estimation and thread scheduling via performance counters journal May 2009
Performance and power modeling in a multi-programmed multi-core environment conference June 2010
SoftPower: fine-grain power estimations using performance counters
  • Lim, Min Yeol; Porterfield, Allan; Fowler, Robert
  • Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing - HPDC '10 https://doi.org/10.1145/1851476.1851517
conference January 2010
Performance, Energy, and Scalability Analysis and Improvement of Parallel Cancer Deep Learning CANDLE Benchmarks conference August 2019
machine. journal October 2001
kernlab - An S4 Package for Kernel Methods in R journal January 2004
Regularization Paths for Generalized Linear Models via Coordinate Descent journal January 2010