skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: CaGrid Workflow Toolkit: A taverna based workflow tool for cancer grid

Journal Article · · BMC Bioinformatics
 [1];  [2];  [3];  [3];  [1];  [2];  [3]
  1. Univ. of Chicago, IL (United States). Argonne National Lab. (ANL). Computation Inst.
  2. Univ. of Chicago, IL (United States). Argonne National Lab. (ANL). Computation Inst.; Argonne National Lab. (ANL), Argonne, IL (United States). Mathematics and Computer Science Division
  3. Univ. of Manchester (United Kingdom). School of Computer Science

Background: In biological and medical domain, the use of web services made the data and computation functionality accessible in a unified manner, which helped automate the data pipeline that was previously performed manually. Workflow technology is widely used in the orchestration of multiple services to facilitate insilico research. Cancer Biomedical Informatics Grid (caBIG) is an information network enabling the sharing of cancer research related resources and caGrid is its underlying service-based computation infrastructure. CaBIG requires that services are composed and orchestrated in a given sequence to realize data pipelines, which are often called scientific workflows. Results: CaGrid selected Taverna as its workflow execution system of choice due to its integration with web service technology and support for a wide range of web services, plug-in architecture to cater for easy integration of third party extensions, etc. The caGrid Workflow Toolkit (or the toolkit for short), an extension to the Taverna workflow system, is designed and implemented to ease building and running caGrid workflows. It provides users with support for various phases in using workflows: service discovery, composition and orchestration, data access, and secure service invocation, which have been identified by the caGrid community as challenging in a multiinstitutional and cross-discipline domain. Conclusions: By extending the Taverna Workbench, caGrid Workflow Toolkit provided a comprehensive solution to compose and coordinate services in caGrid, which would otherwise remain isolated and disconnected from each other. Using it users can access more than 140 services and are offered with a rich set of features including discovery of data and analytical services, query and transfer of data, security protections for service invocations, state management in service interactions, and sharing of workflows, experiences and best practices. The proposed solution is general enough to be applicable and reusable within other service-computing infrastructures that leverage similar technology stack.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1626274
Journal Information:
BMC Bioinformatics, Vol. 11, Issue 1; ISSN 1471-2105
Publisher:
BioMed CentralCopyright Statement
Country of Publication:
United States
Language:
English

References (23)

CDK-Taverna: an open workflow environment for cheminformatics journal January 2010
The design and realisation of the $^{my}$Experiment Virtual Research Environment for social sharing of workflows journal May 2009
Cancer Informatics Vision: caBIG journal January 2006
e-Science, caGrid, and Translational Biomedical Research journal November 2008
Swift: Fast, Reliable, Loosely Coupled Parallel Computation conference July 2007
Composition as a service [Web-Scale Workflow journal January 2010
Taverna: a tool for building and running workflows of services journal July 2006
Workflows for e-Science book January 2007
Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed Systems journal January 2005
caGrid: design and implementation of the core architecture of the cancer biomedical informatics grid journal June 2006
Scientific workflow management and the Kepler system
  • Ludäscher, Bertram; Altintas, Ilkay; Berkley, Chad
  • Concurrency and Computation: Practice and Experience, Vol. 18, Issue 10 https://doi.org/10.1002/cpe.994
journal January 2006
The Globus Toolkit 4 book January 2006
Taverna: lessons in creating a workflow environment for the life sciences journal January 2006
Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning journal January 2002
Performing statistical analyses on quantitative data in Taverna workflows: An example using R and maxdBrowse to identify differentially-expressed genes from microarray data journal August 2008
Service-Oriented Science journal May 2005
Globus Toolkit Version 4: Software for Service-Oriented Systems journal July 2006
Combining the Power of Taverna and caGrid: Scientific Workflows that Enable Web-Scale Collaboration journal November 2008
Modeling and Managing State in Distributed Systems: The Role of OGSI and WSRF journal March 2005
BioCatalogue: a universal catalogue of web services for the life sciences journal May 2010
Security for Grid services conference
Trident: Scientific Workflow Workbench for Oceanography conference July 2008
Sharing Data and Analytical Resources Securely in a Biomedical Research Grid Environment journal May 2008

Cited By (3)

PAV ontology: provenance, authoring and versioning journal January 2013
The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud journal May 2013
SIDECACHE: Information access, management and dissemination framework for web services journal June 2011

Similar Records

caGrid 1.0 : an enterprise Grid infrastructure for biomedical research.
Journal Article · Sat Mar 01 00:00:00 EST 2008 · J. Am. Inform. Assoc. · OSTI ID:1626274

Domain-Specific Languages for Composing Signature Discovery Workflows
Conference · Tue Oct 23 00:00:00 EDT 2012 · OSTI ID:1626274

Domain-Specific Languages For Developing and Deploying Signature Discovery Workflows
Journal Article · Mon Dec 02 00:00:00 EST 2013 · Computing in Science & Engineering, 16(1):52-64 · OSTI ID:1626274