Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

The role of machine learning in scientific workflows

Journal Article · · International Journal of High Performance Computing Applications
 [1];  [2];  [3];  [4]
  1. University of Southern California Information Sciences Institute, Marina Del Rey, CA, USA
  2. Renaissance Computing Institute, Chapel Hill, NC, USA
  3. Lawrence Livermore National Laboratory, Livermore, CA, USA
  4. The University of Manchester, Manchester, UK

Machine learning (ML) is being applied in a number of everyday contexts from image recognition, to natural language processing, to autonomous vehicles, to product recommendation. In the science realm, ML is being used for medical diagnosis, new materials development, smart agriculture, DNA classification, and many others. In this article, we describe the opportunities of using ML in the area of scientific workflow management. Scientific workflows are key to today’s computational science, enabling the definition and execution of complex applications in heterogeneous and often distributed environments. We describe the challenges of composing and executing scientific workflows and identify opportunities for applying ML techniques to meet these challenges by enhancing the current workflow management system capabilities. We foresee that as the ML field progresses, the automation provided by workflow management systems will greatly increase and result in significant improvements in scientific productivity.

Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA); National Science Foundation (NSF)
Grant/Contract Number:
SC0012636
OSTI ID:
1523666
Journal Information:
International Journal of High Performance Computing Applications, Journal Name: International Journal of High Performance Computing Applications Journal Issue: 6 Vol. 33; ISSN 1094-3420
Publisher:
SAGECopyright Statement
Country of Publication:
United States
Language:
English

References (57)

What makes workflows work in an opportunistic environment? journal January 2006
Self-Organizing Maps book January 2001
A Survey of Data-Intensive Scientific Workflow Management journal March 2015
Local convergence of the fuzzy c-Means algorithms journal January 1986
Sequential Competitive Learning and the Fuzzy c-Means Clustering Algorithms journal July 1996
Discovering cluster-based local outliers journal June 2003
ROSS: A high-performance, low-memory, modular Time Warp system journal November 2002
Comparing machine learning classifiers in potential distribution modelling journal May 2011
Intelligent failure prediction models for scientific workflows journal February 2015
Pegasus, a workflow management system for science automation journal May 2015
Algorithms for cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds journal July 2015
A Pareto-based approach for CPU provisioning of scientific workflows on clouds journal May 2019
Workload-aware anomaly detection for Web applications journal March 2014
Resource-efficient workflow scheduling in clouds journal May 2015
The mutual information: Detecting and evaluating dependencies between variables journal October 2002
The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud journal May 2013
Principal component analysis: a review and recent developments
  • Jolliffe, Ian T.; Cadima, Jorge
  • Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 374, Issue 2065 https://doi.org/10.1098/rsta.2015.0202
journal April 2016
Lambda architecture for cost-effective batch and speed big data processing conference October 2015
On the Use of Machine Learning to Predict the Time and Resources Consumed by Applications conference May 2010
TRIO: Burst Buffer Based I/O Orchestration conference September 2015
How to Track Your Data: The Case for Cloud Computing Provenance
  • Zhang, Olive Qing; Kirchberg, Markus; Ko, Ryan K. L.
  • 2011 IEEE 3rd International Conference on Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on Cloud Computing Technology and Science https://doi.org/10.1109/CloudCom.2011.66
conference November 2011
MOHEFT: A multi-objective list-based method for workflow scheduling
  • Durillo, Juan J.; Fard, Hamid Mohammadi; Prodan, Radu
  • 2012 IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom), 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings https://doi.org/10.1109/CloudCom.2012.6427573
conference December 2012
ASKALON: a Grid application development and computing environment conference January 2005
Online Fault and Anomaly Detection for Large-Scale Scientific Workflows
  • Samak, Taghrid; Gunter, Dan; Goode, Monte
  • Communication (HPCC), 2011 IEEE International Conference on High Performance Computing and Communications https://doi.org/10.1109/HPCC.2011.55
conference September 2011
Data Elevator: Low-Contention Data Movement in Hierarchical Storage System conference December 2016
Detecting Abnormal Machine Characteristics in Cloud Infrastructures
  • Bhaduri, Kanishka; Das, Kamalika; Matthews, Bryan L.
  • 2011 IEEE International Conference on Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on Data Mining Workshops https://doi.org/10.1109/ICDMW.2011.62
conference December 2011
Energy-Aware Workflow Scheduling Using Frequency Scaling
  • Pietri, Ilia; Sakellariou, Rizos
  • 2014 43nd International Conference on Parallel Processing Workshops (ICCPW), 2014 43rd International Conference on Parallel Processing Workshops https://doi.org/10.1109/ICPPW.2014.26
conference September 2014
Toward an End-to-End Framework for Modeling, Monitoring and Anomaly Detection for Scientific Workflows conference May 2016
In Situ Visualization at Extreme Scale: Challenges and Opportunities journal November 2009
Wings: Intelligent Workflow-Based Design of Computational Experiments journal January 2011
A Machine Learning Approach for Performance Prediction and Scheduling on Heterogeneous CPUs
  • Nemirovsky, Daniel; Arkose, Tugberk; Markovic, Nikola
  • 2017 29th International Symposium on Computer Architecture and High-Performance Computing (SBAC-PAD), 2017 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) https://doi.org/10.1109/SBAC-PAD.2017.23
conference October 2017
Nimrod/K: Towards massively parallel dynamic Grid workflows conference November 2008
Analysis of application heartbeats: Learning structural and temporal features in time series data for identification of performance problems conference November 2008
Aspen: A domain specific language for performance modeling
  • Spafford, Kyle L.; Vetter, Jeffrey S.
  • 2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2012.20
conference November 2012
Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In-Situ Workflows conference November 2018
Kepler: an extensible system for design and execution of scientific workflows conference January 2004
A Job Sizing Strategy for High-Throughput Scientific Workflows journal February 2018
Harnessing Data Movement in Virtual Clusters for In-Situ Execution journal March 2019
A Performance Model to Estimate Execution Time of Scientific Workflows on the Cloud conference November 2014
A survey of data provenance in e-science journal September 2005
Anomaly detection: A survey journal July 2009
Failure prediction and localization in large scientific workflows conference January 2011
Ubl conference September 2012
Makeflow: a portable abstraction for data intensive computing on clusters, clouds, and grids
  • Albrecht, Michael; Donnelly, Patrick; Bui, Peter
  • Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies - SWEET '12 https://doi.org/10.1145/2443416.2443417
conference January 2012
Predicting application performance using supervised learning on communication features
  • Jain, Nikhil; Bhatele, Abhinav; Robson, Michael P.
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13 https://doi.org/10.1145/2503210.2503263
conference January 2013
A Declarative Optimization Engine for Resource Provisioning of Scientific Workflows in IaaS Clouds
  • Zhou, Amelie Chi; He, Bingsheng; Cheng, Xuntao
  • Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing - HPDC '15 https://doi.org/10.1145/2749246.2749251
conference January 2015
Performance Anomaly Detection and Bottleneck Identification journal July 2015
Data Access for LIGO on the OSG
  • Weitzel, Derek; Bockelman, Brian; Brown, Duncan A.
  • Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact - PEARC17 https://doi.org/10.1145/3093338.3093363
conference January 2017
Optimizing Workflow Data Footprint journal January 2007
Scheduling Multilevel Deadline-Constrained Scientific Workflows on Clouds Based on Cost Optimization journal January 2015
PANORAMA: An approach to performance modeling and diagnosis of extreme-scale workflows journal July 2016
The future of scientific workflows journal April 2017
Performing statistical analyses on quantitative data in Taverna workflows: An example using R and maxdBrowse to identify differentially-expressed genes from microarray data journal August 2008
Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences journal January 2010
Classification And Regression Trees book January 1984
A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data journal April 2016
OSG-GEM: Gene Expression Matrix Construction Using the Open Science Grid journal January 2016

Similar Records

Using simple PID-inspired controllers for online resilient resource management of distributed scientific workflows
Journal Article · Sun Jan 27 23:00:00 EST 2019 · Future Generations Computer Systems · OSTI ID:1611913

Related Subjects