Scientific Data Management Integrated Software Infrastructure Center (SDM/ISIC): Scientific Process Automation (SPA) (Final Report)
- Univ. of California, San Diego, CA (United States)
- San Diego Supercomputing Center, La Jolla, CA (United States)
The KEPLER project has made a significant impact and established a sustained leadership in the field of scientific workflows. The open source KEPLER project started in 2003 when members of the SciDAC-SDM Center/SPA Team, sponsored by this DOE-funded project (DE-FC02-01ER25486) and members of the SEEK project (NSF/ITR awards DBI-0225674 and DBI-0533368) decided to collaborate and jointly develop a scientific workflow system based on the open source PTOLEMY II system (Ludäscher was a co-PI on both projects). In the first project phase, between 2001 and 2003, the SDSC team (Altintas, Ludäscher) worked closely with a domain scientist (Matt Coleman, LLNL) and created early versions of a Promoter Identification Workflow (PIW). Towards the end of that period, the open source PTOLEMY II system was adopted by SciDAC-SDM and SEEK as the basis for a general scientific workflow system and problem-solving environment to design and execute scientific workflows, giving rise to Kepler. Once started through SciDAC-SDM and SEEK, the KEPLER leadership team was able to attract further support from funding agencies including from DOE (as part of SciDAC-2), NSF, NIH, and the Gordon and Betty Moore Foundation. Important advances to KEPLER were made by the SciDAC/SDM-SPA team during the report period (2001–2007), in particular by the SDSC team (and after Ludäscher's move) the UC Davis team, including (i) research and development of the workflow infrastructure, (ii) library (actor) development, and (iii) development of concrete scientific workflows, in collaboration with scientists. Years later, the KEPLER system was listed as one of the prominent “Big Data” research outcomes of DOE in the White House Big Data Factsheet, announced as part of the White House Big Data press release.
- Research Organization:
- Univ. of California, San Diego, CA (United States); San Diego Supercomputing Center, La Jolla, CA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR). Scientific Discovery through Advanced Computing (SciDAC)
- DOE Contract Number:
- FC02-01ER25486
- OSTI ID:
- 1044813
- Report Number(s):
- DE-FC02--01ER25486-Final-Report
- Country of Publication:
- United States
- Language:
- English
Similar Records
SciDAC - The Scientific Data Management Center (http://sdmcenter.lbl.gov)
Scientific Process Automation Improves Data Interaction