DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Not just for programmers: How GitHub can accelerate collaborative and reproducible research in ecology and evolution

Journal Article · · Methods in Ecology and Evolution (Online)
ORCiD logo [1]; ORCiD logo [2]; ORCiD logo [3]; ORCiD logo [4]; ORCiD logo [3]; ORCiD logo [5]; ORCiD logo [6]; ORCiD logo [7]; ORCiD logo [8]; ORCiD logo [3]; ORCiD logo [9]; ORCiD logo [10]; ORCiD logo [11]; ORCiD logo [12]; ORCiD logo [13]; ORCiD logo [14]; ORCiD logo [13]
  1. Department of Biology Concordia University Montréal Québec Canada
  2. Département de Biologie Université de Sherbrooke Sherbrooke Québec Canada
  3. Department of Biology Carleton University Ottawa Ontario Canada
  4. Communications &, Cyber Technologies University of Arizona Arizona Tucson USA
  5. School of Natural Sciences University of California Merced California USA
  6. Terrestrial Biodiversity Norwegian Institute for Nature Research – NINA Trondheim Norway
  7. Department of Biology Georgetown University Washington District of Columbia USA
  8. Department of Human Behavior, Ecology and Culture Max Planck Institute for Evolutionary Anthropology Leipzig Germany
  9. Department of Biological Sciences University of Alberta Edmonton Alberta Canada
  10. Departments of Zoology and Botany University of British Columbia Vancouver British Columbia Canada
  11. Fisheries Department, Faculty of Natural Resources University of Guilan Sowmeh Sara Iran
  12. Department of Plant Protection, Faculty of Agriculture Ordu University Ordu Turkey
  13. Earth and Environmental Sciences Area, Lawrence Berkeley National Laboratory Berkeley California USA
  14. Cooperative Institute for Marine Resources Studies, Hatfield Marine Science Center Oregon State University Newport Oregon USA

Abstract Researchers in ecology and evolutionary biology are increasingly dependent on computational code to conduct research. Hence, the use of efficient methods to share, reproduce, and collaborate on code as well as document research is fundamental. GitHub is an online, cloud‐based service that can help researchers track, organize, discuss, share, and collaborate on software and other materials related to research production, including data, code for analyses, and protocols. Despite these benefits, the use of GitHub in ecology and evolution is not widespread. To help researchers in ecology and evolution adopt useful features from GitHub to improve their research workflows, we review 12 practical ways to use the platform. We outline features ranging from low to high technical difficulty, including storing code, managing projects, coding collaboratively, conducting peer review, writing a manuscript, and using automated and continuous integration to streamline analyses. Given that members of a research team may have different technical skills and responsibilities, we describe how the optimal use of GitHub features may vary among members of a research collaboration. As more ecologists and evolutionary biologists establish their workflows using GitHub, the field can continue to push the boundaries of collaborative, transparent, and open research.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC), Biological and Environmental Research (BER). Earth & Environmental Systems Science (EESS)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1971145
Journal Information:
Methods in Ecology and Evolution (Online), Journal Name: Methods in Ecology and Evolution (Online) Journal Issue: 6 Vol. 14; ISSN 2041-210X
Publisher:
Wiley-BlackwellCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (45)

A Beginner's Guide to Conducting Reproducible Research journal January 2021
The Availability of Research Data Declines Rapidly with Article Age journal January 2014
Elevating The Status of Code in Ecology journal January 2016
A solution for breaking the language barrier journal February 2022
A Guide to Using GitHub for Developing and Versioning Data Standards and Reporting Formats journal August 2021
1,500 scientists lift the lid on reproducibility journal May 2016
Democratic databases: science on GitHub journal October 2016
Reproducibility of computational workflows is automated using continuous analysis journal March 2017
If these data could talk journal September 2017
Iterative near-term ecological forecasting: Needs, opportunities, and challenges journal January 2018
Packaging Data Analytical Work Reproducibly Using R (and Friends) journal January 2018
Excuse Me, Do You Have a Moment to Talk About Version Control? journal January 2018
Why don't we share data and code? Perceived barriers and benefits to public archiving practices journal November 2022
Social network of software development at GitHub conference August 2016
Open Source-Style Collaborative Development Practices in Commercial Projects Using GitHub conference May 2015
How do scientists develop and use scientific software? conference May 2009
Git journal May 2012
Continuous Integration and Its Tools journal May 2014
PERFICT: A Re‐imagined foundation for predictive ecology journal March 2022
sPlotOpen – An environmentally balanced, open‐access, global dataset of vegetation plots journal June 2021
Historical and contemporary processes drive global phylogenetic structure across geographical scales: Insights from bat communities journal March 2023
A collaborative GIS programming course using GitHub Classroom journal July 2021
Ecology in the age of automation journal August 2021
Using Peer Code Review as an Educational Tool conference June 2020
Promoting Open Science Through Research Data Management journal July 2022
Rampant software errors may undermine scientific results journal July 2015
Unit testing, model validation, and biological simulation journal August 2016
Developing a modern data workflow for regularly updated data journal January 2019
Low availability of code in ecology: A call for urgent action journal July 2020
A Quick Introduction to Version Control with Git and GitHub journal January 2016
Ten Simple Rules for Taking Advantage of Git and GitHub journal July 2016
Open collaborative writing with Manubot journal June 2019
Sharing Detailed Research Data Is Associated with Increased Citation Rate journal March 2007
Willingness to Share Research Data Is Related to the Strength of the Evidence and the Quality of Reporting of Statistical Results journal November 2011
Changes in Data Sharing and Data Reuse Practices and Perceptions among Scientists Worldwide journal August 2015
Data sharing, management, use, and reuse: Practices and perceptions of scientists worldwide journal March 2020
Code for: Historical and contemporary processes drive global phylogenetic structure across geographical scales: insights from bat communities text January 2021
Code for: Not just for programmers: How GitHub can accelerate collaborative and reproducible research in ecology and evolution text January 2023
Increasing the reliability of ecological models using modern software engineering techniques
  • Scheller, Robert M.; Sturtevant, Brian R.; Gustafson, Eric J.
  • Frontiers in Ecology and the Environment, Vol. 8, Issue 5 https://doi.org/10.1890/080141
journal June 2010
Improving the culture of interdisciplinary collaboration in ecology by expanding measures of success journal February 2014
Assessing Pathways of Climate Change Effects in SpaDES: An Application to Boreal Landbirds of Northwest Territories Canada journal October 2021
Data Sharing: Convert Challenges into Opportunities journal December 2017
Foundational Practices of Research Data Management journal July 2020
Sustainable Research Software Hand-Over journal April 2021
A game theoretic analysis of research data sharing journal September 2015