A Semi-Automated Workflow Solution for Data Set Publication
Abstract
In order to address the need for published data, considerable effort has gone into formalizing the process of data publication. From funding agencies to publishers, data publication has rapidly become a requirement. Digital Object Identifiers (DOI) and data citations have enhanced the integration and availability of data. The challenge facing data publishers now is to deal with the increased number of publishable data products and most importantly the difficulties of publishing diverse data products into an online archive. The Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC), a NASA-funded data center, faces these challenges as it deals with data products created by individual investigators. This paper summarizes the challenges of curating data and provides a summary of a workflow solution that ORNL DAAC researcher and technical staffs have created to deal with publication of the diverse data products. Finally, the workflow solution presented here is generic and can be applied to data from any scientific domain and data located at any data center.
- Authors:
-
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Environmental Sciences Division. Climate Change Science Inst. (CCSI)
- Publication Date:
- Research Org.:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Org.:
- USDOE; National Aeronautics and Space Administration (NASA), Washington, DC (United States
- OSTI Identifier:
- 1261317
- Grant/Contract Number:
- AC05-00OR22725; NNG14HH39I
- Resource Type:
- Accepted Manuscript
- Journal Name:
- ISPRS international journal of geo-information
- Additional Journal Information:
- Journal Volume: 5; Journal Issue: 3; Journal ID: ISSN 2220-9964
- Publisher:
- MDPI
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 96 KNOWLEDGE MANAGEMENT AND PRESERVATION; data ingest; publication workflow; data deluge; terrestrial ecology; automation
Citation Formats
Vannan, Suresh, Beaty, Tammy W., Cook, Robert B., Wright, Daine M., Devarakonda, Ranjeet, Wei, Yaxing, Hook, Les A., and McMurry, Benjamin F. A Semi-Automated Workflow Solution for Data Set Publication. United States: N. p., 2016.
Web. doi:10.3390/ijgi5030030.
Vannan, Suresh, Beaty, Tammy W., Cook, Robert B., Wright, Daine M., Devarakonda, Ranjeet, Wei, Yaxing, Hook, Les A., & McMurry, Benjamin F. A Semi-Automated Workflow Solution for Data Set Publication. United States. https://doi.org/10.3390/ijgi5030030
Vannan, Suresh, Beaty, Tammy W., Cook, Robert B., Wright, Daine M., Devarakonda, Ranjeet, Wei, Yaxing, Hook, Les A., and McMurry, Benjamin F. Tue .
"A Semi-Automated Workflow Solution for Data Set Publication". United States. https://doi.org/10.3390/ijgi5030030. https://www.osti.gov/servlets/purl/1261317.
@article{osti_1261317,
title = {A Semi-Automated Workflow Solution for Data Set Publication},
author = {Vannan, Suresh and Beaty, Tammy W. and Cook, Robert B. and Wright, Daine M. and Devarakonda, Ranjeet and Wei, Yaxing and Hook, Les A. and McMurry, Benjamin F.},
abstractNote = {In order to address the need for published data, considerable effort has gone into formalizing the process of data publication. From funding agencies to publishers, data publication has rapidly become a requirement. Digital Object Identifiers (DOI) and data citations have enhanced the integration and availability of data. The challenge facing data publishers now is to deal with the increased number of publishable data products and most importantly the difficulties of publishing diverse data products into an online archive. The Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC), a NASA-funded data center, faces these challenges as it deals with data products created by individual investigators. This paper summarizes the challenges of curating data and provides a summary of a workflow solution that ORNL DAAC researcher and technical staffs have created to deal with publication of the diverse data products. Finally, the workflow solution presented here is generic and can be applied to data from any scientific domain and data located at any data center.},
doi = {10.3390/ijgi5030030},
journal = {ISPRS international journal of geo-information},
number = 3,
volume = 5,
place = {United States},
year = {Tue Mar 08 00:00:00 EST 2016},
month = {Tue Mar 08 00:00:00 EST 2016}
}
Web of Science