Towards Composing Data Aware Systems Biology Workflows on Cloud Platforms: A MeDICi-based Approach
Cloud computing is being increasingly adopted for deploying systems biology scientific workflows. Scientists developing these workflows use a wide variety of fragmented and competing data sets and computational tools of all scales to support their research. To this end, the synergy of client side workflow tools with cloud platforms is a promising approach to share and reuse data and workflows. In such systems, the location of data and computation is essential consideration in terms of quality of service for composing a scientific workflow across remote cloud platforms. In this paper, we describe a cloud-based workflow for genome annotation processing that is underpinned by MeDICi - a middleware designed for data intensive scientific applications. The workflow implementation incorporates an execution layer for exploiting data locality that routes the workflow requests to the processing steps that are colocated with the data. We demonstrate our approach by composing two workflowswith the MeDICi pipelines.
- Research Organization:
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1030870
- Report Number(s):
- PNNL-SA-80025; TRN: US201124%%501
- Resource Relation:
- Conference: Proceedings. 2011 IEEE International World Congress on Services (SERVICES 2011), July 4-9, 2011, Washington DC, 184-191
- Country of Publication:
- United States
- Language:
- English
Similar Records
Build Less Code, Deliver More Science: An Experience Report on Composing Scientific Environments using Component-based and Commodity Software Platforms
Scientific Workflows Composition and Deployment on SOA Frameworks