Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A2Cloud‐RF : A random forest based statistical framework to guide resource selection for high‐performance scientific computing on the cloud

Journal Article · · Concurrency and Computation. Practice and Experience
DOI:https://doi.org/10.1002/cpe.5942· OSTI ID:1641843
 [1];  [1];  [2];  [1];  [3];  [1];  [1];  [1]
  1. School of Engineering and Computer Science University of the Pacific Stockton California USA
  2. Lawrence Livermore National Laboratory Stockton California USA
  3. Department of Chemistry, College of the Pacific University of the Pacific Stockton California USA

Summary

This article proposes a random‐forest based A2Cloud framework to match scientific applications with Cloud providers and their instances for high performance. The framework leverages four engines for this task: PERF engine, Cloud trace engine, A2Cloud‐ext engine, and the random forest classifier (RFC) engine. The PERF engine profiles the application to obtain performance characteristics, including the number of single‐precision (SP) floating‐point operations (FLOPs), double‐precision (DP) FLOPs, x87 operations, memory accesses, and disk accesses. The Cloud trace engine obtains the corresponding performance characteristics of the selected Cloud instances including: SP floating point operations per second (FLOPS), DP FLOPS, x87 operations per second, memory bandwidth, and disk bandwidth. The A2Cloud‐ext engine uses the application and Cloud instance characteristics to generate objective scores that represent the application‐to‐Cloud match. The RFC engine uses these objective scores to generate two types of random forests to assist users with rapid analysis: application‐specific random forests (ARF) and application‐class based random forests. The ARF consider only the input application's characteristics to generate a random forest and provide numerical ratings to the selected Cloud instances. To generate the application‐class based random forests, the RFC engine downloads the application profiles and scores of previously tested applications that perform similar to the input application. Using these data, the RFC engine creates a random forest for instance recommendation. We exhaustively test this framework using eight real‐world applications across 12 instances from different Cloud providers. Our tests show significant statistical agreement between the instance ratings given by the framework and the ratings obtained via actual Cloud executions.

Sponsoring Organization:
USDOE
Grant/Contract Number:
NONE; AC52-07NA27344
OSTI ID:
1641843
Journal Information:
Concurrency and Computation. Practice and Experience, Journal Name: Concurrency and Computation. Practice and Experience Journal Issue: 24 Vol. 32; ISSN 1532-0626
Publisher:
Wiley Blackwell (John Wiley & Sons)Copyright Statement
Country of Publication:
United Kingdom
Language:
English

References (42)

Simplified Dynamics of Human and Mammalian Neocortical Neurons journal October 1999
Fragmentation Methods: A Route to Accurate Calculations on Large Systems journal August 2011
Energy-Based Molecular Fragmentation Methods journal April 2015
Frozen-Density Embedding Strategy for Multilevel Simulations of Electronic Structure journal April 2015
Accurate Composite and Fragment-Based Quantum Chemical Models for Large Molecules journal April 2015
The spin–flip approach within time-dependent density functional theory: Theory and applications to diradicals journal March 2003
A generalized many-body expansion and a unified view of fragment-based methods in electronic structure theory journal August 2012
Simulation of surface processes journal January 2011
Systematically improvable excitonic Hamiltonians for electronic structure theory journal September 2018
Excitonically renormalised coupled-cluster theory journal September 2018
A best-features based digital rotoscope conference October 2017
Cloud big data decision support system for machine learning on AWS: Analytics of analytics conference December 2017
Cloud Service Selection with Fuzzy C-Means Artificial Immune Network Memory Classifier conference November 2018
Efficient Bidding for Virtual Machine Instances in Clouds conference July 2011
Optimal Cloud Resource Selection Method Considering Hard and Soft Constraints and Multiple Conflicting Objectives conference July 2018
Self-Regulated Multi-criteria Decision Analysis: An Autonomous Brokerage-Based Approach for Service Provider Ranking in the Cloud conference December 2017
Performance evaluation of cloud computing providers using fuzzy multiattribute group decision making model conference August 2015
Decision making on adoption of cloud computing in e-commerce using fuzzy TOPSIS conference July 2017
A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning conference June 2017
ML-NA: A Machine Learning Based Node Performance Analyzer Utilizing Straggler Statistics conference December 2017
A Testing Engine for High-Performance and Cost-Effective Workflow Execution in the Cloud conference September 2015
CELIA: Cost-Time Performance of Elastic Applications on Cloud conference August 2017
Acceleration of spiking neural networks in emerging multi-core and GPU architectures
  • Bhuiyan, Mohammad A.; Pallipuram, Vivek K.; Smith, Melissa C.
  • Distributed Processing, Workshops and Phd Forum (IPDPSW 2010), 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW) https://doi.org/10.1109/IPDPSW.2010.5470899
conference April 2010
Applying the roofline model conference March 2014
Online Algorithms for Cost-Effective Cloud Selection with Multiple Demands conference September 2018
Cost-Aware Cloud Profiling, Prediction, and Provisioning as a Service journal July 2017
Data migration algorithms in heterogeneous storage systems: A comparative performance evaluation conference October 2017
Daleel: Simplifying cloud instance selection using machine learning conference April 2016
HPC Application Performance and Cost Efficiency in the Cloud
  • Roloff, Eduardo; Diener, Matthias; Gaspary, Luciano Paschoal
  • 2017 25th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP) https://doi.org/10.1109/PDP.2017.59
conference January 2017
Cloud Workload Prediction and Generation Models
  • Wamba, Gilles Madi; Li, Yunbo; Orgerie, Anne-Cecile
  • 2017 29th International Symposium on Computer Architecture and High-Performance Computing (SBAC-PAD), 2017 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) https://doi.org/10.1109/SBAC-PAD.2017.19
conference October 2017
Understanding the Performance and Potential of Cloud Computing for Scientific Applications journal April 2017
Simple model of spiking neurons journal November 2003
Comprehensive Elastic Resource Management to Ensure Predictable Performance for Scientific Applications on Public IaaS Clouds conference December 2014
A quantitative description of membrane current and its application to conduction and excitation in nerve journal August 1952
Understanding and Improving Computational Science Storage Access through Continuous Characterization journal October 2011
Heterogeneity and dynamicity of clouds at scale: Google trace analysis conference January 2012
Selecting Efficient Cloud Resources for HPC Workloads
  • Brunetta, Jeferson R.; Borin, Edson
  • UCC '19: IEEE/ACM 12th International Conference on Utility and Cloud Computing, Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing https://doi.org/10.1145/3344341.3368798
conference December 2019
Cloud Instance Selection Using Parallel K-Means and AHP
  • Guo, Taiyang; Bahsoon, Rami; Chen, Tao
  • Proceedings of the 12th IEEE/ACM International Conference on Utility and Cloud Computing Companion - UCC '19 Companion https://doi.org/10.1145/3368235.3368845
conference January 2019
Advances in Correlated Electronic Structure Methods for Solids, Surfaces, and Nanostructures journal May 2008
Modeling of Semiconductor Nanostructures with nextnano 3 journal August 2006
LULESH Programming Model and Performance Ports Overview report December 2012
Setting Priorities - A Heuristic Approach for Cloud Data Center Selection [Setting Priorities - A Heuristic Approach for Cloud Data Center Selection] conference May 2015

Similar Records

The implications of working set analysis on supercomputing memory hierarchy design.
Conference · 2005 · OSTI ID:946978

Design Space Exploration of Emerging Memory Technologies for Machine Learning Applications
Conference · 2021 · OSTI ID:1807257

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
Conference · 2018 · PROCEEDINGS OF 2018 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE, PORTABILITY AND PRODUCTIVITY IN HPC (P3HPC 2018 · OSTI ID:1567493

Related Subjects