skip to main content
DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Research applications of primary biodiversity databases in the digital age

Abstract

Our world is in the midst of unprecedented change-climate shifts and sustained, widespread habitat degradation have led to dramatic declines in biodiversity rivaling historical extinction events. At the same time, new approaches to publishing and integrating previously disconnected data resources promise to help provide the evidence needed for more efficient and effective conservation and management. Stakeholders have invested considerable resources to contribute to online databases of species occurrences. However, estimates suggest that only 10% of biocollections are available in digital form. The biocollections community must therefore continue to promote digitization efforts, which in part requires demonstrating compelling applications of the data. Our overarching goal is therefore to determine trends in use of mobilized species occurrence data since 2010, as online systems have grown and now provide over one billion records. To do this, we characterized 501 papers that use openly accessible biodiversity databases. Our standardized tagging protocol was based on key topics of interest, including: database(s) used, taxa addressed, general uses of data, other data types linked to species occurrence data, and data quality issues addressed. We found that the most common uses of online biodiversity databases have been to estimate species distribution and richness, to outline data compilation andmore » publication, and to assist in developing species checklists or describing new species. Only 69% of papers in our dataset addressed one or more aspects of data quality, which is low considering common errors and biases known to exist in opportunistic datasets. Globally, we find that biodiversity databases are still in the initial stages of data compilation. Novel and integrative applications are restricted to certain taxonomic groups and regions with higher numbers of quality records. Continued data digitization, publication, enhancement, and quality control efforts are necessary to make biodiversity science more efficient and relevant in our fast-changing environment.« less

Authors:
ORCiD logo [1];  [2]; ORCiD logo [2];  [2];  [1];  [1];  [2];  [3];  [2]
  1. Field Museum of Natural History, Chicago, IL (United States)
  2. Univ. of Florida, Gainesville, FL (United States)
  3. Univ. de Navarra, Pamplona (Spain)
Publication Date:
Research Org.:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC)
OSTI Identifier:
1580870
Grant/Contract Number:  
AC02-05CH11231
Resource Type:
Accepted Manuscript
Journal Name:
PLoS ONE
Additional Journal Information:
Journal Volume: 14; Journal Issue: 9; Journal ID: ISSN 1932-6203
Publisher:
Public Library of Science
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; Taxonomy; Biodiversity; Vertebrates; Invertebrates; Conservation science; Species interactions; Database searching; Plant taxonomy

Citation Formats

Ball-Damerow, Joan E., Brenskelle, Laura, Barve, Narayani, Soltis, Pamela S., Sierwald, Petra, Bieler, Rüdiger, LaFrance, Raphael, Ariño, Arturo H., and Guralnick, Robert P. Research applications of primary biodiversity databases in the digital age. United States: N. p., 2019. Web. doi:10.1371/journal.pone.0215794.
Ball-Damerow, Joan E., Brenskelle, Laura, Barve, Narayani, Soltis, Pamela S., Sierwald, Petra, Bieler, Rüdiger, LaFrance, Raphael, Ariño, Arturo H., & Guralnick, Robert P. Research applications of primary biodiversity databases in the digital age. United States. doi:10.1371/journal.pone.0215794.
Ball-Damerow, Joan E., Brenskelle, Laura, Barve, Narayani, Soltis, Pamela S., Sierwald, Petra, Bieler, Rüdiger, LaFrance, Raphael, Ariño, Arturo H., and Guralnick, Robert P. Wed . "Research applications of primary biodiversity databases in the digital age". United States. doi:10.1371/journal.pone.0215794. https://www.osti.gov/servlets/purl/1580870.
@article{osti_1580870,
title = {Research applications of primary biodiversity databases in the digital age},
author = {Ball-Damerow, Joan E. and Brenskelle, Laura and Barve, Narayani and Soltis, Pamela S. and Sierwald, Petra and Bieler, Rüdiger and LaFrance, Raphael and Ariño, Arturo H. and Guralnick, Robert P.},
abstractNote = {Our world is in the midst of unprecedented change-climate shifts and sustained, widespread habitat degradation have led to dramatic declines in biodiversity rivaling historical extinction events. At the same time, new approaches to publishing and integrating previously disconnected data resources promise to help provide the evidence needed for more efficient and effective conservation and management. Stakeholders have invested considerable resources to contribute to online databases of species occurrences. However, estimates suggest that only 10% of biocollections are available in digital form. The biocollections community must therefore continue to promote digitization efforts, which in part requires demonstrating compelling applications of the data. Our overarching goal is therefore to determine trends in use of mobilized species occurrence data since 2010, as online systems have grown and now provide over one billion records. To do this, we characterized 501 papers that use openly accessible biodiversity databases. Our standardized tagging protocol was based on key topics of interest, including: database(s) used, taxa addressed, general uses of data, other data types linked to species occurrence data, and data quality issues addressed. We found that the most common uses of online biodiversity databases have been to estimate species distribution and richness, to outline data compilation and publication, and to assist in developing species checklists or describing new species. Only 69% of papers in our dataset addressed one or more aspects of data quality, which is low considering common errors and biases known to exist in opportunistic datasets. Globally, we find that biodiversity databases are still in the initial stages of data compilation. Novel and integrative applications are restricted to certain taxonomic groups and regions with higher numbers of quality records. Continued data digitization, publication, enhancement, and quality control efforts are necessary to make biodiversity science more efficient and relevant in our fast-changing environment.},
doi = {10.1371/journal.pone.0215794},
journal = {PLoS ONE},
number = 9,
volume = 14,
place = {United States},
year = {2019},
month = {9}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Save / Share:

Works referenced in this record:

Global priorities for an effective information basis of biodiversity distributions
journal, September 2015

  • Meyer, Carsten; Kreft, Holger; Guralnick, Robert
  • Nature Communications, Vol. 6, Issue 1
  • DOI: 10.1038/ncomms9221

Bridging the biodiversity data gaps: Recommendations to meet users’ data needs
journal, July 2013

  • Faith, Dan; Collen, Ben; Ariño, Arturo
  • Biodiversity Informatics, Vol. 8, Issue 2
  • DOI: 10.17161/bi.v8i2.4126

Estimating species diversity and distribution in the era of Big Data: to what extent can we trust public databases?: Species diversity and distribution in the era of Big Data
journal, May 2015

  • Maldonado, Carla; Molina, Carlos I.; Zizka, Alexander
  • Global Ecology and Biogeography, Vol. 24, Issue 8
  • DOI: 10.1111/geb.12326

Emerging Technologies to Conserve Biodiversity
journal, November 2015


The Availability of Research Data Declines Rapidly with Article Age
journal, January 2014

  • Vines, Timothy H.; Albert, Arianne Y. K.; Andrew, Rose L.
  • Current Biology, Vol. 24, Issue 1
  • DOI: 10.1016/j.cub.2013.11.014

Inselect: Automating the Digitization of Natural History Collections
journal, November 2015


Are We Losing the Science of Taxonomy?
journal, December 2011


Kurator: A Kepler Package for Data Curation Workflows
journal, January 2012


Encouraging data citation and discovery with the Data Citation Index
journal, July 2014

  • Force, Megan M.; Robinson, Nigel J.
  • Journal of Computer-Aided Molecular Design, Vol. 28, Issue 10
  • DOI: 10.1007/s10822-014-9768-5

Assessing declines of North American bumble bees (Bombus spp.) using museum specimens
journal, October 2012

  • Colla, Sheila R.; Gadallah, Fawziah; Richardson, Leif
  • Biodiversity and Conservation, Vol. 21, Issue 14
  • DOI: 10.1007/s10531-012-0383-2

Range geometry and socio-economics dominate species-level biases in occurrence information: Species-level bias in occurrence records
journal, June 2016

  • Meyer, Carsten; Jetz, Walter; Guralnick, Robert P.
  • Global Ecology and Biogeography, Vol. 25, Issue 10
  • DOI: 10.1111/geb.12483

20 Years of Persistent Identifiers – Which Systems are Here to Stay?
journal, March 2017


taxize: taxonomic search and retrieval in R
journal, January 2013


The eBird enterprise: An integrated approach to development and application of citizen science
journal, January 2014


Motivating Online Publication of Data
journal, May 2009


BARCODING: bold: The Barcode of Life Data System (http://www.barcodinglife.org): BARCODING
journal, January 2007


Relative abundances of living and dead molluscs in two Californian lagoons
journal, April 1976


Assessment of user needs of primary biodiversity data: Analysis, concerns, and challenges
journal, July 2013

  • Ariño, Arturo H.; Chavan, Vishwas; Faith, Daniel P.
  • Biodiversity Informatics, Vol. 8, Issue 2
  • DOI: 10.17161/bi.v8i2.4094

Global Coordination and Standardisation in Marine Biodiversity through the World Register of Marine Species (WoRMS) and Related Databases
journal, January 2013


Mobilizing Mollusks: Status Update on Mollusk Collections in the U.S.A. and Canada
journal, December 2018

  • Sierwald, Petra; Bieler, Rüdiger; Shea, Elizabeth K.
  • American Malacological Bulletin, Vol. 36, Issue 2
  • DOI: 10.4003/006.036.0202

A Novel Automated Mass Digitisation Workflow for Natural History Microscope Slides
journal, March 2019

  • Allan, E. Louise; Livermore, Laurence; Price, Benjamin
  • Biodiversity Data Journal, Vol. 7
  • DOI: 10.3897/BDJ.7.e32342

Widespread mistaken identity in tropical plant collections
journal, November 2015


More than 75 percent decline over 27 years in total flying insect biomass in protected areas
journal, October 2017


Synthesis of phylogeny and taxonomy into a comprehensive tree of life
journal, September 2015

  • Hinchliff, Cody E.; Smith, Stephen A.; Allman, James F.
  • Proceedings of the National Academy of Sciences, Vol. 112, Issue 41
  • DOI: 10.1073/pnas.1423041112

The biodiversity of species and their rates of extinction, distribution, and protection
journal, May 2014


Before reproducibility must come preproducibility
journal, May 2018


Spatial and topical imbalances in biodiversity research
journal, July 2018


Mapping the biodiversity of tropical insects: species richness and inventory completeness of African sphingid moths: Mapping the biodiversity of tropical insects
journal, January 2013

  • Ballesteros-Mejia, Liliana; Kitching, Ian J.; Jetz, Walter
  • Global Ecology and Biogeography, Vol. 22, Issue 5
  • DOI: 10.1111/geb.12039

Tracking historic migrations of the Irish potato famine pathogen, Phytophthora infestans
journal, November 2002


The role of natural history collections in documenting species declines
journal, January 1998


A data citation roadmap for scientific publishers
journal, November 2018


Biodiversity data should be published, cited, and peer reviewed
journal, August 2013

  • Costello, Mark J.; Michener, William K.; Gahegan, Mark
  • Trends in Ecology & Evolution, Vol. 28, Issue 8
  • DOI: 10.1016/j.tree.2013.05.002

No specimen left behind: industrial scale digitization of natural history collections
journal, July 2012


Five task clusters that enable efficient and effective digitization of biological collections
journal, July 2012


A New Critical Estimate of Named Species-Level Diversity of the Recent Mollusca*
journal, September 2014


The taxonomic name resolution service: an online tool for automated standardization of plant names
journal, January 2013


Predicting Total Global Species Richness Using Rates of Species Description and Estimates of Taxonomic Effort
journal, August 2011

  • Costello, Mark J.; Wilson, Simon; Houlding, Brett
  • Systematic Biology, Vol. 61, Issue 5
  • DOI: 10.1093/sysbio/syr080

Communication gaps in knowledge of freshwater fish biodiversity: implications for the management and conservation of Mexican biosphere reserves
journal, August 2011


Bias in freshwater biodiversity sampling: the case of Iberian water beetles
journal, September 2008


Knowledge behind conservation status decisions: Data basis for “Data Deficient” Brazilian plant species
journal, May 2014

  • Sousa-Baena, Mariane S.; Garcia, Letícia Couto; Townsend Peterson, A.
  • Biological Conservation, Vol. 173
  • DOI: 10.1016/j.biocon.2013.06.034

Overcoming sampling bias in studies of terrestrial gastropods
journal, June 1982

  • Boag, D. A.
  • Canadian Journal of Zoology, Vol. 60, Issue 6
  • DOI: 10.1139/z82-173

Linking macroecology and community ecology: refining predictions of species distributions using biotic interaction networks
journal, April 2017

  • Staniczenko, Phillip P. A.; Sivasubramaniam, Prabu; Suttle, K. Blake
  • Ecology Letters, Vol. 20, Issue 6
  • DOI: 10.1111/ele.12770

Mass extinction in poorly known taxa
journal, June 2015

  • Régnier, Claire; Achaz, Guillaume; Lambert, Amaury
  • Proceedings of the National Academy of Sciences, Vol. 112, Issue 25
  • DOI: 10.1073/pnas.1502350112

The geospatial data quality REST API for primary biodiversity data
journal, February 2016


Online solutions and the ‘Wallacean shortfall’: what does GBIF contribute to our knowledge of species' ranges?
journal, March 2013

  • Beck, Jan; Ballesteros-Mejia, Liliana; Nagel, Peter
  • Diversity and Distributions, Vol. 19, Issue 8
  • DOI: 10.1111/ddi.12083

Just How Imperiled Are Aquatic Insects? A Case Study of Stoneflies (Plecoptera) in Illinois
journal, November 2005


Increasing the efficiency of digitization workflows for herbarium specimens
journal, July 2012


Strategies for the sustainability of online open-access biodiversity databases
journal, May 2014


Whole-drawer imaging for digital management and curation of a large entomological collection
journal, July 2012


Biodiversity data obsolescence and land uses changes
journal, January 2016

  • Escribano, Nora; Ariño, Arturo H.; Galicia, David
  • PeerJ, Vol. 4
  • DOI: 10.7717/peerj.2743

Predicting species distribution combining multi-scale drivers
journal, October 2017


Distorted Views of Biodiversity: Spatial and Temporal Bias in Species Occurrence Data
journal, June 2010


A Standardized Reference Data Set for Vertebrate Taxon Name Resolution
journal, January 2016


Detecting long-term occupancy changes in Californian odonates from natural history and citizen science records
journal, July 2017

  • Rapacciuolo, G.; Ball-Damerow, J. E.; Zeilinger, A. R.
  • Biodiversity and Conservation, Vol. 26, Issue 12
  • DOI: 10.1007/s10531-017-1399-4

New developments in museum-based informatics and applications in biodiversity analysis
journal, September 2004


Changes in occurrence, richness, and biological traits of dragonflies and damselflies (Odonata) in California and Nevada over the past century
journal, May 2014

  • Ball-Damerow, Joan E.; M’Gonigle, Leithen K.; Resh, Vincent H.
  • Biodiversity and Conservation, Vol. 23, Issue 8
  • DOI: 10.1007/s10531-014-0707-5

Data Leakage and Loss in Biodiversity Informatics
journal, November 2018

  • Peterson, A. Townsend; Asase, Alex; Canhos, Dora
  • Biodiversity Data Journal, Vol. 6
  • DOI: 10.3897/BDJ.6.e26826

Data archiving in ecology and evolution: best practices
journal, February 2011


Multidimensional biases, gaps and uncertainties in global plant occurrence information
journal, June 2016

  • Meyer, Carsten; Weigelt, Patrick; Kreft, Holger
  • Ecology Letters, Vol. 19, Issue 8
  • DOI: 10.1111/ele.12624

Whole-Drawer Imaging of Entomological Collections: Benefits, Limitations and Alternative Applications
journal, October 2014

  • Holovachov, Oleksandr; Zatushevsky, Andriy; Shydlovsky, Ihor
  • Journal of Conservation and Museum Studies, Vol. 12, Issue 1
  • DOI: 10.5334/jcms.1021218

The taxonomist - an endangered race. A practical proposal for its survival
journal, January 2011

  • Wägele, Heike; Klussmann-Kolb, Annette; Kuhlmann, Michael
  • Frontiers in Zoology, Vol. 8, Issue 1
  • DOI: 10.1186/1742-9994-8-25

Statistics for citizen science: extracting signals of change from noisy ecological data
journal, September 2014

  • Isaac, Nick J. B.; van Strien, Arco J.; August, Tom A.
  • Methods in Ecology and Evolution, Vol. 5, Issue 10
  • DOI: 10.1111/2041-210X.12254

The FAIR Guiding Principles for scientific data management and stewardship
journal, March 2016

  • Wilkinson, Mark D.; Dumontier, Michel; Aalbersberg, IJsbrand Jan
  • Scientific Data, Vol. 3, Issue 1
  • DOI: 10.1038/sdata.2016.18

Biodiversity in Mediterranean-climate streams of California
journal, November 2012


Current extinction rates of reptiles and amphibians
journal, October 2015


A global perspective on decadal challenges and priorities in biodiversity informatics
journal, May 2015


Leveraging the fullest potential of scientific collections through digitisation.
journal, October 2010


The data paper: a mechanism to incentivize data publishing in biodiversity science
journal, December 2011


Towards mainstreaming of biodiversity data publishing: recommendations of the GBIF Data Publishing Framework Task Group
journal, December 2011


New Push to Bring US Biological Collections to the World's Online Community
journal, September 2011


Widespread sampling biases in herbaria revealed from large-scale digitization
journal, October 2017

  • Daru, Barnabas H.; Park, Daniel S.; Primack, Richard B.
  • New Phytologist, Vol. 217, Issue 2
  • DOI: 10.1111/nph.14855

What's on the horizon for macroecology?
journal, May 2012


Digitization of Biodiversity Collections Reveals Biggest Data on Biodiversity
journal, August 2015

  • Page, Lawrence M.; MacFadden, Bruce J.; Fortes, Jose A.
  • BioScience, Vol. 65, Issue 9
  • DOI: 10.1093/biosci/biv104

The point-radius method for georeferencing locality descriptions and calculating associated uncertainty
journal, December 2004

  • Wieczorek, John; Guo, Qinghua; Hijmans, Robert
  • International Journal of Geographical Information Science, Vol. 18, Issue 8
  • DOI: 10.1080/13658810412331280211

Designing a High-Throughput Pipeline for Digitizing Pinned Insects
conference, October 2017

  • Hereld, Mark; Ferrier, Nicola J.; Agarwal, Nitin
  • 2017 IEEE 13th International Conference on e-Science (e-Science)
  • DOI: 10.1109/eScience.2017.88

The Impact of Conservation on the Status of the World’s Vertebrates
journal, October 2010

  • Hoffmann, Michael; Hilton-Taylor, Craig; Angulo, Ariadne
  • Science, Vol. 330, Issue 6010
  • DOI: 10.1126/science.1194442

Data exchange gaps in knowledge of biodiversity: implications for the management and conservation of Biosphere Reserves
journal, May 2014

  • Pino-del-Carpio, Andrea; Ariño, Arturo H.; Miranda, Rafael
  • Biodiversity and Conservation, Vol. 23, Issue 9
  • DOI: 10.1007/s10531-014-0718-2

The Anatomy of a Data Citation: Discovery, Reuse, and Credit
journal, May 2012

  • Mooney, Hailey; Newton, Mark P.
  • Journal of Librarianship and Scholarly Communication, Vol. 1, Issue 1
  • DOI: 10.7710/2162-3309.1035

Combining static and dynamic variables in species distribution models under climate change: Static and dynamic variables under climate change
journal, September 2011


Biological collections and ecological/environmental research: a review, some observations and a look to the future
journal, May 2010


Best practice for biodiversity data management and publication
journal, May 2014


A Computational- and Storage-Cloud for Integration of Biodiversity Collections
conference, October 2013

  • Matsunaga, Andrea; Thompson, Alex; Figueiredo, Renato J.
  • 2013 IEEE 9th International Conference on eScience (eScience), 2013 IEEE 9th International Conference on e-Science
  • DOI: 10.1109/eScience.2013.48

The Value of Museum Collections for Research and Society
journal, January 2004


Specimen Databases: A Case Study in Entomology using Web-based Software
journal, January 2010

  • Schuh, Randall T.; Hewson-Smith, Sheridan; Ascher, John S.
  • American Entomologist, Vol. 56, Issue 4
  • DOI: 10.1093/ae/56.4.206