DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: FRIEDA: Flexible Robust Intelligent Elastic Data Management Framework

Abstract

Scientific applications are increasingly using cloud resources for their data analysis workflows. However, managing data effectively and efficiently over these cloud resources is challenging due to the myriad storage choices with different performance, cost trade-offs, complex application choices and complexity associated with elasticity, failure rates in these environments. The different data access patterns for data-intensive scientific applications require a more flexible and robust data management solution than the ones currently in existence. FRIEDA is a Flexible Robust Intelligent Elastic Data Management framework that employs a range of data management strategies in cloud environments. FRIEDA can manage storage and data lifecycle of applications in cloud environments. There are four different stages in the data management lifecycle of FRIEDA – (i) storage planning, (ii) provisioning and preparation, (iii) data placement, and (iv) execution. FRIEDA defines a data control plane and an execution plane. The data control plane defines the data partition and distribution strategy, whereas the execution plane manages the execution of the application using a master-worker paradigm. FRIEDA also provides different data management strategies, either to partition the data in real-time, or predetermine the data partitions prior to application execution.

Authors:
ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Publication Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
OSTI Identifier:
1435078
Grant/Contract Number:  
AC02-05CH11231
Resource Type:
Accepted Manuscript
Journal Name:
Journal of Open Source Software
Additional Journal Information:
Journal Volume: 2; Journal Issue: 10; Journal ID: ISSN 2475-9066
Publisher:
Open Source Initiative - NumFOCUS
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Ghoshal, Devarshi, Hendrix, Valerie, Fox, William, Balasubhramanian, Sowmya, and Ramakrishnan, Lavanya. FRIEDA: Flexible Robust Intelligent Elastic Data Management Framework. United States: N. p., 2017. Web. doi:10.21105/joss.00164.
Ghoshal, Devarshi, Hendrix, Valerie, Fox, William, Balasubhramanian, Sowmya, & Ramakrishnan, Lavanya. FRIEDA: Flexible Robust Intelligent Elastic Data Management Framework. United States. https://doi.org/10.21105/joss.00164
Ghoshal, Devarshi, Hendrix, Valerie, Fox, William, Balasubhramanian, Sowmya, and Ramakrishnan, Lavanya. Wed . "FRIEDA: Flexible Robust Intelligent Elastic Data Management Framework". United States. https://doi.org/10.21105/joss.00164. https://www.osti.gov/servlets/purl/1435078.
@article{osti_1435078,
title = {FRIEDA: Flexible Robust Intelligent Elastic Data Management Framework},
author = {Ghoshal, Devarshi and Hendrix, Valerie and Fox, William and Balasubhramanian, Sowmya and Ramakrishnan, Lavanya},
abstractNote = {Scientific applications are increasingly using cloud resources for their data analysis workflows. However, managing data effectively and efficiently over these cloud resources is challenging due to the myriad storage choices with different performance, cost trade-offs, complex application choices and complexity associated with elasticity, failure rates in these environments. The different data access patterns for data-intensive scientific applications require a more flexible and robust data management solution than the ones currently in existence. FRIEDA is a Flexible Robust Intelligent Elastic Data Management framework that employs a range of data management strategies in cloud environments. FRIEDA can manage storage and data lifecycle of applications in cloud environments. There are four different stages in the data management lifecycle of FRIEDA – (i) storage planning, (ii) provisioning and preparation, (iii) data placement, and (iv) execution. FRIEDA defines a data control plane and an execution plane. The data control plane defines the data partition and distribution strategy, whereas the execution plane manages the execution of the application using a master-worker paradigm. FRIEDA also provides different data management strategies, either to partition the data in real-time, or predetermine the data partitions prior to application execution.},
doi = {10.21105/joss.00164},
journal = {Journal of Open Source Software},
number = 10,
volume = 2,
place = {United States},
year = {Wed Feb 01 00:00:00 EST 2017},
month = {Wed Feb 01 00:00:00 EST 2017}
}

Works referenced in this record:

Storage and Data Life Cycle Management in Cloud Environments with FRIEDA
book, January 2014

  • Ramakrishnan, Lavanya; Ghoshal, Devarshi; Hendrix, Valerie
  • Cloud Computing for Data-Intensive Applications
  • DOI: 10.1007/978-1-4939-1905-5_15

Provisioning, Placement and Pipelining Strategies for Data-Intensive Applications in Cloud Environments
conference, March 2014

  • Ghoshal, Devarshi; Ramakrishnan, Lavanya
  • 2014 IEEE International Conference on Cloud Engineering (IC2E)
  • DOI: 10.1109/IC2E.2014.66

FRIEDA: Flexible Robust Intelligent Elastic Data Management in Cloud Environments
conference, November 2012

  • Ghoshal, Devarshi; Ramakrishnan, Lavanya
  • 2012 SC Companion: High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion: High Performance Computing, Networking Storage and Analysis
  • DOI: 10.1109/SC.Companion.2012.132