DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Research applications of primary biodiversity databases in the digital age

Journal Article · · PLoS ONE

Our world is in the midst of unprecedented change-climate shifts and sustained, widespread habitat degradation have led to dramatic declines in biodiversity rivaling historical extinction events. At the same time, new approaches to publishing and integrating previously disconnected data resources promise to help provide the evidence needed for more efficient and effective conservation and management. Stakeholders have invested considerable resources to contribute to online databases of species occurrences. However, estimates suggest that only 10% of biocollections are available in digital form. The biocollections community must therefore continue to promote digitization efforts, which in part requires demonstrating compelling applications of the data. Our overarching goal is therefore to determine trends in use of mobilized species occurrence data since 2010, as online systems have grown and now provide over one billion records. To do this, we characterized 501 papers that use openly accessible biodiversity databases. Our standardized tagging protocol was based on key topics of interest, including: database(s) used, taxa addressed, general uses of data, other data types linked to species occurrence data, and data quality issues addressed. We found that the most common uses of online biodiversity databases have been to estimate species distribution and richness, to outline data compilation and publication, and to assist in developing species checklists or describing new species. Only 69% of papers in our dataset addressed one or more aspects of data quality, which is low considering common errors and biases known to exist in opportunistic datasets. Globally, we find that biodiversity databases are still in the initial stages of data compilation. Novel and integrative applications are restricted to certain taxonomic groups and regions with higher numbers of quality records. Continued data digitization, publication, enhancement, and quality control efforts are necessary to make biodiversity science more efficient and relevant in our fast-changing environment.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1580870
Journal Information:
PLoS ONE, Vol. 14, Issue 9; ISSN 1932-6203
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 61 works
Citation information provided by
Web of Science

References (171)

Global priorities for an effective information basis of biodiversity distributions journal September 2015
Estimating species diversity and distribution in the era of Big Data: to what extent can we trust public databases?: Species diversity and distribution in the era of Big Data journal May 2015
Emerging Technologies to Conserve Biodiversity journal November 2015
The Availability of Research Data Declines Rapidly with Article Age journal January 2014
Are We Losing the Science of Taxonomy? journal December 2011
Encouraging data citation and discovery with the Data Citation Index journal July 2014
Assessing declines of North American bumble bees (Bombus spp.) using museum specimens journal October 2012
Biodiversity informatics: automated approaches for documenting global biodiversity patterns and processes journal January 2009
20 Years of Persistent Identifiers – Which Systems are Here to Stay? journal March 2017
Towards Demand Driven Publishing: Approches to the Prioritisation of Digitisation of Natural History Collections data journal October 2010
Motivating Online Publication of Data journal May 2009
Relative abundances of living and dead molluscs in two Californian lagoons journal April 1976
The tragedy of the biodiversity data commons: a data impediment creeping nigher? journal January 2018
Global Coordination and Standardisation in Marine Biodiversity through the World Register of Marine Species (WoRMS) and Related Databases journal January 2013
More than 75 percent decline over 27 years in total flying insect biomass in protected areas journal October 2017
The biodiversity of species and their rates of extinction, distribution, and protection journal May 2014
Before reproducibility must come preproducibility journal May 2018
Mapping the biodiversity of tropical insects: species richness and inventory completeness of African sphingid moths: Mapping the biodiversity of tropical insects journal January 2013
The tempo and mode of the taxonomic correction process: How taxonomists have corrected and recorrected North American bird species over the last 127 years journal April 2018
Five task clusters that enable efficient and effective digitization of biological collections journal July 2012
A New Critical Estimate of Named Species-Level Diversity of the Recent Mollusca* journal September 2014
Improving species distribution models for climate change studies: variable selection and scale: Species distribution models for climate change studies journal November 2010
Predicting Total Global Species Richness Using Rates of Species Description and Estimates of Taxonomic Effort journal August 2011
The FAIR Guiding Principles for scientific data management and stewardship. other January 2016
Bias in freshwater biodiversity sampling: the case of Iberian water beetles journal September 2008
Knowledge behind conservation status decisions: Data basis for “Data Deficient” Brazilian plant species journal May 2014
Evaluation of Museum Collection Data for Use in Biodiversity Assessment journal June 2001
Linking macroecology and community ecology: refining predictions of species distributions using biotic interaction networks journal April 2017
Mass extinction in poorly known taxa journal June 2015
The geospatial data quality REST API for primary biodiversity data journal February 2016
Just How Imperiled Are Aquatic Insects? A Case Study of Stoneflies (Plecoptera) in Illinois journal November 2005
Whole-drawer imaging for digital management and curation of a large entomological collection journal July 2012
Predicting species distribution combining multi-scale drivers journal October 2017
Distorted Views of Biodiversity: Spatial and Temporal Bias in Species Occurrence Data journal June 2010
Detecting long-term occupancy changes in Californian odonates from natural history and citizen science records journal July 2017
Changes in occurrence, richness, and biological traits of dragonflies and damselflies (Odonata) in California and Nevada over the past century journal May 2014
Data Leakage and Loss in Biodiversity Informatics journal November 2018
VSEARCH: a versatile open source tool for metagenomics text January 2016
Multidimensional biases, gaps and uncertainties in global plant occurrence information journal June 2016
The taxonomist - an endangered race. A practical proposal for its survival journal January 2011
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
Current extinction rates of reptiles and amphibians journal October 2015
Leveraging the fullest potential of scientific collections through digitisation. journal October 2010
World Register of Marine Species. Available from https://www.marinespecies.org at VLIZ. Accessed yyyy-mm-dd. dataset January 2022
Mass digitization of scientific collections: New opportunities to transform the use of biological specimens and underwrite biodiversity science journal July 2012
What's on the horizon for macroecology? journal May 2012
Digitization of Biodiversity Collections Reveals Biggest Data on Biodiversity journal August 2015
The point-radius method for georeferencing locality descriptions and calculating associated uncertainty journal December 2004
Designing a High-Throughput Pipeline for Digitizing Pinned Insects conference October 2017
The Impact of Conservation on the Status of the World’s Vertebrates journal October 2010
Data exchange gaps in knowledge of biodiversity: implications for the management and conservation of Biosphere Reserves journal May 2014
Combining static and dynamic variables in species distribution models under climate change: Static and dynamic variables under climate change journal September 2011
A Computational- and Storage-Cloud for Integration of Biodiversity Collections
  • Matsunaga, Andrea; Thompson, Alex; Figueiredo, Renato J.
  • 2013 IEEE 9th International Conference on eScience (eScience), 2013 IEEE 9th International Conference on e-Science https://doi.org/10.1109/eScience.2013.48
conference October 2013
Specimen Databases: A Case Study in Entomology using Web-based Software journal January 2010
Strengths and weaknesses of museum and national survey data sets for predicting regional species richness: comparative and combined approaches: A combined approach for estimating species richness journal July 2005
Bridging the biodiversity data gaps: Recommendations to meet users’ data needs journal July 2013
Inselect: Automating the Digitization of Natural History Collections journal November 2015
Kurator: A Kepler Package for Data Curation Workflows journal January 2012
taxize: taxonomic search and retrieval in R journal January 2013
Range geometry and socio-economics dominate species-level biases in occurrence information: Species-level bias in occurrence records journal June 2016
taxize: taxonomic search and retrieval in R journal January 2013
The eBird enterprise: An integrated approach to development and application of citizen science journal January 2014
BARCODING: bold: The Barcode of Life Data System (http://www.barcodinglife.org): BARCODING journal January 2007
Identifiers for the 21st century : How to design, provision, and reuse persistent identifiers to maximize utility and impact of life science data. text January 2017
Assessment of user needs of primary biodiversity data: Analysis, concerns, and challenges journal July 2013
Mobilizing Mollusks: Status Update on Mollusk Collections in the U.S.A. and Canada journal December 2018
Widespread mistaken identity in tropical plant collections journal November 2015
Synthesis of phylogeny and taxonomy into a comprehensive tree of life journal September 2015
Bibliographic dataset characterizing studies that use online biodiversity databases dataset January 2019
Spatial and topical imbalances in biodiversity research journal July 2018
A data citation roadmap for scientific publishers journal November 2018
Biodiversity data should be published, cited, and peer reviewed journal August 2013
Evolution of orthologous tandemly arrayed gene clusters journal October 2011
No specimen left behind: industrial scale digitization of natural history collections journal July 2012
The taxonomic name resolution service: an online tool for automated standardization of plant names journal January 2013
Communication gaps in knowledge of freshwater fish biodiversity: implications for the management and conservation of Mexican biosphere reserves journal August 2011
Overcoming sampling bias in studies of terrestrial gastropods journal June 1982
How to capture developmental brain dynamics: gaps and solutions journal May 2021
Online solutions and the ‘Wallacean shortfall’: what does GBIF contribute to our knowledge of species' ranges? journal March 2013
Biodiversity on the Rocks: Macrofauna Inhabiting Authigenic Carbonate at Costa Rica Methane Seeps journal July 2015
Increasing the efficiency of digitization workflows for herbarium specimens journal July 2012
Strategies for the sustainability of online open-access biodiversity databases journal May 2014
Biodiversity data obsolescence and land uses changes journal January 2016
Overview of the BioCreative III Workshop journal January 2011
A Standardized Reference Data Set for Vertebrate Taxon Name Resolution journal January 2016
The Anatomy of a Data Citation: Discovery, Reuse, and Credit text January 2012
VSEARCH: a versatile open source tool for metagenomics journal January 2016
New developments in museum-based informatics and applications in biodiversity analysis journal September 2004
Significance of Specimen Databases from Taxonomic Revisions for Estimating and Mapping the Global Species Diversity of Invertebrates and Repatriating Reliable Specimen Data journal April 2004
Data archiving in ecology and evolution: best practices journal February 2011
Whole-Drawer Imaging of Entomological Collections: Benefits, Limitations and Alternative Applications journal October 2014
Biodiversity in Mediterranean-climate streams of California journal November 2012
A global perspective on decadal challenges and priorities in biodiversity informatics journal May 2015
Are We Filling the Data Void? An Assessment of the Amount and Extent of Plant Collection Records and Census Data Available for Tropical South America journal April 2015
Accounting for imperfect detection and survey bias in statistical analysis of presence-only data: Imperfect detection and survey bias in presence-only data journal August 2014
Rediscovery, conservation status and taxonomic assessment of Melicope degeneri (Rutaceae), Kaua‘i, Hawai‘i journal May 2011
The data paper: a mechanism to incentivize data publishing in biodiversity science journal December 2011
New Push to Bring US Biological Collections to the World's Online Community journal September 2011
Widespread sampling biases in herbaria revealed from large-scale digitization journal October 2017
Identifiers for the 21st century: How to design, provision, and reuse persistent identifiers to maximize utility and impact of life science data journal June 2017
The Anatomy of a Data Citation: Discovery, Reuse, and Credit journal May 2012
Best practice for biodiversity data management and publication journal May 2014
California dragonfly and damselfly (Odonata) database: temporal and spatial distribution of species records collected over the past century journal February 2015
The Value of Museum Collections for Research and Society journal January 2004
The phenology of Guyanese timber species: a compilation of a century of observations journal September 1991
Museum specimen data reveal emergence of a plant disease may be linked to increases in the insect vector population journal July 2017
Changes in occurrence, richness, and biological traits of dragonflies and damselflies (Odonata) in California and Nevada over the past century journal May 2014
Data exchange gaps in knowledge of biodiversity: implications for the management and conservation of Biosphere Reserves journal May 2014
Detecting long-term occupancy changes in Californian odonates from natural history and citizen science records journal July 2017
Encouraging data citation and discovery with the Data Citation Index journal July 2014
Strategies for the sustainability of online open-access biodiversity databases journal May 2014
Predicting species distribution combining multi-scale drivers journal October 2017
Kurator: A Kepler Package for Data Curation Workflows journal January 2012
New developments in museum-based informatics and applications in biodiversity analysis journal September 2004
Data archiving in ecology and evolution: best practices journal February 2011
Biodiversity data should be published, cited, and peer reviewed journal August 2013
Emerging Technologies to Conserve Biodiversity journal November 2015
The role of natural history collections in documenting species declines journal January 1998
Tracking historic migrations of the Irish potato famine pathogen, Phytophthora infestans journal November 2002
Before reproducibility must come preproducibility journal May 2018
Exposure to UV radiance predicts repeated evolution of concealed black skin in birds journal May 2020
A data citation roadmap for scientific publishers journal November 2018
Synthesis of phylogeny and taxonomy into a comprehensive tree of life journal September 2015
Current extinction rates of reptiles and amphibians journal October 2015
The geospatial data quality REST API for primary biodiversity data journal February 2016
Digitization of Biodiversity Collections Reveals Biggest Data on Biodiversity journal August 2015
The tragedy of the biodiversity data commons: a data impediment creeping nigher? journal January 2018
Predicting Total Global Species Richness Using Rates of Species Description and Estimates of Taxonomic Effort journal August 2011
Statistics for citizen science: extracting signals of change from noisy ecological data journal September 2014
Linking macroecology and community ecology: refining predictions of species distributions using biotic interaction networks journal April 2017
Mapping the biodiversity of tropical insects: species richness and inventory completeness of African sphingid moths: Mapping the biodiversity of tropical insects journal January 2013
Accounting for imperfect detection and survey bias in statistical analysis of presence-only data: Imperfect detection and survey bias in presence-only data journal August 2014
Estimating species diversity and distribution in the era of Big Data: to what extent can we trust public databases?: Species diversity and distribution in the era of Big Data journal May 2015
Range geometry and socio-economics dominate species-level biases in occurrence information: Species-level bias in occurrence records journal June 2016
Communication gaps in knowledge of freshwater fish biodiversity: implications for the management and conservation of Mexican biosphere reserves journal August 2011
Biological collections and ecological/environmental research: a review, some observations and a look to the future journal May 2010
Bias in freshwater biodiversity sampling: the case of Iberian water beetles journal September 2008
Relative abundances of living and dead molluscs in two Californian lagoons journal April 1976
Significance of Specimen Databases from Taxonomic Revisions for Estimating and Mapping the Global Species Diversity of Invertebrates and Repatriating Reliable Specimen Data journal April 2004
Widespread sampling biases in herbaria revealed from large-scale digitization journal October 2017
The Impact of Conservation on the Status of the World’s Vertebrates journal October 2010
Overcoming sampling bias in studies of terrestrial gastropods journal June 1982
Towards mainstreaming of biodiversity data publishing: recommendations of the GBIF Data Publishing Framework Task Group journal December 2011
The taxonomic name resolution service: an online tool for automated standardization of plant names journal January 2013
The taxonomist - an endangered race. A practical proposal for its survival journal January 2011
Distorted Views of Biodiversity: Spatial and Temporal Bias in Species Occurrence Data journal June 2010
Identifiers for the 21st century: How to design, provision, and reuse persistent identifiers to maximize utility and impact of life science data journal June 2017
Are We Filling the Data Void? An Assessment of the Amount and Extent of Plant Collection Records and Census Data Available for Tropical South America journal April 2015
Inselect: Automating the Digitization of Natural History Collections journal November 2015
More than 75 percent decline over 27 years in total flying insect biomass in protected areas journal October 2017
The tempo and mode of the taxonomic correction process: How taxonomists have corrected and recorrected North American bird species over the last 127 years journal April 2018
Motivating Online Publication of Data journal May 2009
Are We Losing the Science of Taxonomy? journal December 2011
New Push to Bring US Biological Collections to the World's Online Community journal September 2011
Leveraging the fullest potential of scientific collections through digitisation. journal October 2010
Approaches to estimating the universe of natural history collections data journal October 2010
Assessment of user needs of primary biodiversity data: Analysis, concerns, and challenges journal July 2013
Bridging the biodiversity data gaps: Recommendations to meet users’ data needs journal July 2013
Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps journal July 2015
A Novel Automated Mass Digitisation Workflow for Natural History Microscope Slides journal March 2019
Biodiversity Information Services: A (not-so-) little knowledge that acts journal May 2018
Challenges For Implementing Collections Data Quality Feedback: synthesizing the community experience journal June 2018
Putting your Finger upon the Simplest Data journal June 2018
Increasing the efficiency of digitization workflows for herbarium specimens journal July 2012
Five task clusters that enable efficient and effective digitization of biological collections journal July 2012
Whole-drawer imaging for digital management and curation of a large entomological collection journal July 2012
No specimen left behind: industrial scale digitization of natural history collections journal July 2012
Mass digitization of scientific collections: New opportunities to transform the use of biological specimens and underwrite biodiversity science journal July 2012
California dragonfly and damselfly (Odonata) database: temporal and spatial distribution of species records collected over the past century journal February 2015
Mobilizing Mollusks: Status Update on Mollusk Collections in the U.S.A. and Canada journal December 2018
Bibliographic dataset characterizing studies that use online biodiversity databases dataset January 2019

Cited By (5)

A New Overlooked Species of Cyrtopodium (Cymbidieae, Orchidaceae) from the Southern Andean Yungas and Chaco Serrano Ecoregions of Northern Argentina and Southwestern Bolivia journal March 2023
Check list of the Venezuelan millipedes species journal October 2019
Tests in a semi-natural environment suggest that bait and switch strategy could be used to control invasive Common Carp journal January 2020
The Odonata of Quebec: Specimen data from seven collections journal February 2020
Assessment of North American arthropod collections: prospects and challenges for addressing biodiversity research journal November 2019