DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: in silico Whole Genome Sequencer & Analyzer (iWGS): A Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies

Abstract

The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in nonmodel organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimental design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies andmore » popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects, and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS.« less

Authors:
ORCiD logo [1]; ORCiD logo [2]; ORCiD logo [2]; ORCiD logo [3]; ORCiD logo [2]; ORCiD logo [1]
  1. Vanderbilt Univ., Nashville, TN (United States). Dept. of Biological Sciences
  2. Univ. of Wisconsin, Madison, WI (United States). Wisconsin Energy Inst., J. F. Crow Inst. for the Study of Evolution, Lab. of Genetics, Genome Center of Wisconsin, Dept. of Energy Great Lakes Bioenergy Research Center
  3. US Dept. of Agriculture (USDA)., Peoria, IL (United States). National Center for Agricultural Utilization Research, Agricultural Research Service, Mycotoxin Prevention and Applied Microbiology Research Unit
Publication Date:
Research Org.:
Argonne National Laboratory (ANL), Argonne, IL (United States). Advanced Photon Source (APS); Univ. of Wisconsin, Madison (United States); Lawrence Berkeley National Laboratory (LBNL), CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER); National Institutes of Health (NIH); National Science Foundation (NSF)
OSTI Identifier:
1373365
Alternate Identifier(s):
OSTI ID: 1378364
Grant/Contract Number:  
AC02-06CH11357; AC02-05CH11231; FC02-07ER64494
Resource Type:
Accepted Manuscript
Journal Name:
G3
Additional Journal Information:
Journal Volume: 6; Journal Issue: 11; Journal ID: ISSN 2160-1836
Publisher:
Genetics Society of America
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; Genome Sequencing; High-Throughput Sequencing; De Novo Assembly; Experimental Design; Simulation; Nonmodel Organism

Citation Formats

Zhou, Xiaofan, Peris, David, Kominek, Jacek, Kurtzman, Cletus P., Hittinger, Chris Todd, and Rokas, A. in silico Whole Genome Sequencer & Analyzer (iWGS): A Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies. United States: N. p., 2016. Web. doi:10.1534/g3.116.034249.
Zhou, Xiaofan, Peris, David, Kominek, Jacek, Kurtzman, Cletus P., Hittinger, Chris Todd, & Rokas, A. in silico Whole Genome Sequencer & Analyzer (iWGS): A Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies. United States. https://doi.org/10.1534/g3.116.034249
Zhou, Xiaofan, Peris, David, Kominek, Jacek, Kurtzman, Cletus P., Hittinger, Chris Todd, and Rokas, A. Fri . "in silico Whole Genome Sequencer & Analyzer (iWGS): A Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies". United States. https://doi.org/10.1534/g3.116.034249. https://www.osti.gov/servlets/purl/1373365.
@article{osti_1373365,
title = {in silico Whole Genome Sequencer & Analyzer (iWGS): A Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies},
author = {Zhou, Xiaofan and Peris, David and Kominek, Jacek and Kurtzman, Cletus P. and Hittinger, Chris Todd and Rokas, A.},
abstractNote = {The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in nonmodel organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimental design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies and popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects, and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS.},
doi = {10.1534/g3.116.034249},
journal = {G3},
number = 11,
volume = 6,
place = {United States},
year = {Fri Sep 16 00:00:00 EDT 2016},
month = {Fri Sep 16 00:00:00 EDT 2016}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 25 works
Citation information provided by
Web of Science

Figures / Tables:

Figure 1 Figure 1: iWGS workflow. A typical iWGS analysis consists of four steps: (1) data simulation (optional); (2) preprocessing (optional); (3) de novo assembly; and (4) assembly evaluation. iWGS supports both Illumina short reads and PacBio long reads, and a wide selection of assemblers to enable de novo assembly using eithermore » or both types of data. Users can start the analysis simulating data drawn from a reference genome assembly or, alternatively, use real sequencing data as input and skip the simulation step. iWGS, in silico Whole Genome Sequencer and Analyzer; MP, mate-pair; PacBio, Pacific Biosciences; PE, paired-end.« less

Save / Share:

Works referenced in this record:

REAPR: a universal tool for genome assembly evaluation
journal, January 2013


Genome 10K: A Proposal to Obtain Whole-Genome Sequence for 10 000 Vertebrate Species
journal, November 2009


QUAST: quality assessment tool for genome assemblies
journal, February 2013


Harnessing genomics for evolutionary insights
journal, April 2009


Comprehensive variation discovery in single human genomes
journal, October 2014

  • Weisenfeld, Neil I.; Yin, Shuangye; Sharpe, Ted
  • Nature Genetics, Vol. 46, Issue 12
  • DOI: 10.1038/ng.3121

A complete bacterial genome assembled de novo using only nanopore sequencing data
journal, June 2015

  • Loman, Nicholas J.; Quick, Joshua; Simpson, Jared T.
  • Nature Methods, Vol. 12, Issue 8
  • DOI: 10.1038/nmeth.3444

Mitochondrial genome evolution in yeasts: an all-encompassing view
journal, May 2015

  • Freel, Kelle C.; Friedrich, Anne; Schacherer, Joseph
  • FEMS Yeast Research, Vol. 15, Issue 4
  • DOI: 10.1093/femsyr/fov023

Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data
journal, May 2013

  • Chin, Chen-Shan; Alexander, David H.; Marks, Patrick
  • Nature Methods, Vol. 10, Issue 6
  • DOI: 10.1038/nmeth.2474

High-quality draft assemblies of mammalian genomes from massively parallel sequence data
journal, December 2010

  • Gnerre, S.; MacCallum, I.; Przybylski, D.
  • Proceedings of the National Academy of Sciences, Vol. 108, Issue 4
  • DOI: 10.1073/pnas.1017351108

Meraculous: De Novo Genome Assembly with Short Paired-End Reads
journal, August 2011


A Whole-Genome Assembly of Drosophila
journal, March 2000


Characterizing and measuring bias in sequence data
journal, January 2013


Genome sequence of the human malaria parasite Plasmodium falciparum
journal, October 2002

  • Gardner, Malcolm J.; Hall, Neil; Fung, Eula
  • Nature, Vol. 419, Issue 6906
  • DOI: 10.1038/nature01097

Field guide to next-generation DNA sequencers: FIELD GUIDE TO NEXT-GEN SEQUENCERS
journal, May 2011


Mutation rates in mammalian genomes
journal, January 2002

  • Kumar, S.; Subramanian, S.
  • Proceedings of the National Academy of Sciences, Vol. 99, Issue 2
  • DOI: 10.1073/pnas.022629899

Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee
journal, January 2012

  • Chang, Jia-Ming; Di Tommaso, Paolo; Taly, Jean-Francois
  • BMC Bioinformatics, Vol. 13, Issue Suppl 4
  • DOI: 10.1186/1471-2105-13-S4-S1

Comparative genomics reveals insights into avian genome evolution and adaptation
journal, December 2014


Sequence assembly demystified
journal, January 2013

  • Nagarajan, Niranjan; Pop, Mihai
  • Nature Reviews Genetics, Vol. 14, Issue 3
  • DOI: 10.1038/nrg3367

Metassembler: merging and optimizing de novo genome assemblies
journal, September 2015


Inferring ancient divergences requires genes with strong phylogenetic signals
journal, May 2013


ART: a next-generation sequencing read simulator
journal, December 2011


The Theory and Practice of Genome Sequence Assembly
journal, August 2015


DBG2OLC: Efficient Assembly of Large Genomes Using Long Erroneous Reads of the Third Generation Sequencing Technologies
journal, August 2016

  • Ye, Chengxi; Hill, Christopher M.; Wu, Shigang
  • Scientific Reports, Vol. 6, Issue 1
  • DOI: 10.1038/srep31900

Reducing assembly complexity of microbial genomes with single-molecule sequencing
journal, January 2013


Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
journal, February 2008


Genome-scale approaches to resolving incongruence in molecular phylogenies
journal, October 2003

  • Rokas, Antonis; Williams, Barry L.; King, Nicole
  • Nature, Vol. 425, Issue 6960
  • DOI: 10.1038/nature02053

Creating a Buzz About Insect Genomes
journal, March 2011


Patterns of Positive Selection in Seven Ant Genomes
journal, April 2014

  • Roux, Julien; Privman, Eyal; Moretti, Sébastien
  • Molecular Biology and Evolution, Vol. 31, Issue 7
  • DOI: 10.1093/molbev/msu141

Assemblathon 1: A competitive assessment of de novo short read assembly methods
journal, September 2011


Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads
journal, April 2014

  • Kajitani, Rei; Toshimoto, Kouta; Noguchi, Hideki
  • Genome Research, Vol. 24, Issue 8
  • DOI: 10.1101/gr.170720.113

Prevention, diagnosis and treatment of high-throughput sequencing data pathologies
journal, March 2014

  • Zhou, Xiaofan; Rokas, Antonis
  • Molecular Ecology, Vol. 23, Issue 7
  • DOI: 10.1111/mec.12680

Using Cascading Bloom Filters to Improve the Memory Usage for de Brujin Graphs
book, January 2013


SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
journal, May 2012

  • Bankevich, Anton; Nurk, Sergey; Antipov, Dmitry
  • Journal of Computational Biology, Vol. 19, Issue 5
  • DOI: 10.1089/cmb.2012.0021

Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage
journal, July 2016

  • Chakraborty, Mahul; Baldwin-Brown, James G.; Long, Anthony D.
  • Nucleic Acids Research
  • DOI: 10.1093/nar/gkw654

Erratum: Overview of the yeast genome
journal, June 1997

  • Mewes, H. W.; Albermann, K.; Bähr, M.
  • Nature, Vol. 387, Issue 6634
  • DOI: 10.1038/42755

The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification
journal, October 2014

  • Reddy, T. B. K.; Thomas, Alex D.; Stamatis, Dimitri
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku950

PBSIM: PacBio reads simulator—toward accurate genome assembly
journal, November 2012


Automated ensemble assembly and validation of microbial genomes
journal, May 2014

  • Koren, Sergey; Treangen, Todd J.; Hill, Christopher M.
  • BMC Bioinformatics, Vol. 15, Issue 1
  • DOI: 10.1186/1471-2105-15-126

Efficient de novo assembly of large genomes using compressed data structures
journal, December 2011


Genome sequencing of chimpanzee malaria parasites reveals possible pathways of adaptation to human hosts
journal, September 2014

  • Otto, Thomas D.; Rayner, Julian C.; Böhme, Ulrike
  • Nature Communications, Vol. 5, Issue 1
  • DOI: 10.1038/ncomms5754

Simulation of Genome-Wide Evolution under Heterogeneous Substitution Models and Complex Multispecies Coalescent Histories
journal, March 2014

  • Arenas, Miguel; Posada, David
  • Molecular Biology and Evolution, Vol. 31, Issue 5
  • DOI: 10.1093/molbev/msu078

The Genome Sequence of Saccharomyces eubayanus and the Domestication of Lager-Brewing Yeasts
journal, August 2015

  • Baker, EmilyClare; Wang, Bing; Bellora, Nicolas
  • Molecular Biology and Evolution, Vol. 32, Issue 11
  • DOI: 10.1093/molbev/msv168

Finished bacterial genomes from shotgun sequence data
journal, July 2012


SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler
journal, December 2012


Genomics and the making of yeast biodiversity
journal, December 2015

  • Hittinger, Chris Todd; Rokas, Antonis; Bai, Feng-Yan
  • Current Opinion in Genetics & Development, Vol. 35
  • DOI: 10.1016/j.gde.2015.10.008

Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences
journal, June 2014


RAMPART: a workflow management system for de novo genome assembly
journal, January 2015


GAGE-B: an evaluation of genome assemblers for bacterial organisms
journal, May 2013


One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly
journal, February 2015


pIRS: Profile-based Illumina pair-end reads simulator
journal, April 2012


The MaSuRCA genome assembler
journal, August 2013


Informed and automated k-mer size selection for genome assembly
journal, June 2013


Genome Sequence and Analysis of a Stress-Tolerant, Wild-Derived Strain of Saccharomyces cerevisiae Used in Biofuels Research
journal, April 2016

  • McIlwain, Sean J.; Peris, David; Sardi, Maria
  • G3: Genes|Genomes|Genetics, Vol. 6, Issue 6
  • DOI: 10.1534/g3.116.029389

Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi
journal, December 2012


ABySS: A parallel assembler for short read sequence data
journal, February 2009


Analysis of the genome sequence of the flowering plant Arabidopsis thaliana
journal, December 2000

  • Arabidopsis Genome Initiative,
  • Nature, Vol. 408, Issue 6814, p. 796-815
  • DOI: 10.1038/35048692

Assembling large genomes with single-molecule sequencing and locality-sensitive hashing
journal, May 2015

  • Berlin, Konstantin; Koren, Sergey; Chin, Chen-Shan
  • Nature Biotechnology, Vol. 33, Issue 6
  • DOI: 10.1038/nbt.3238

Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement
journal, November 2014


The complete sequence of the mitochondrial genome of Saccharomyces cerevisiae
journal, December 1998


Extensive Error in the Number of Genes Inferred from Draft Genome Assemblies
journal, December 2014


Mitochondrial inheritance in budding yeasts: towards an integrated understanding
journal, November 2010


GAGE: A critical evaluation of genome assemblies and assembly algorithms
journal, January 2012

  • Salzberg, S. L.; Phillippy, A. M.; Zimin, A.
  • Genome Research, Vol. 22, Issue 3
  • DOI: 10.1101/gr.131383.111

The Genome Sequence of Drosophila melanogaster
journal, March 2000


Overview of the yeast genome
journal, May 1997

  • Mewes, H. W.; Albermann, K.; Bähr, M.
  • Nature, Vol. 387, Issue S6632
  • DOI: 10.1038/387s007

Exploiting sparseness in de novo genome assembly
journal, April 2012


One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly
journal, February 2015


Mitochondrial inheritance in budding yeasts: towards an integrated understanding
journal, November 2010


Harnessing genomics for evolutionary insights
journal, April 2009


Genome sequence of the human malaria parasite Plasmodium falciparum
journal, October 2002

  • Gardner, Malcolm J.; Hall, Neil; Fung, Eula
  • Nature, Vol. 419, Issue 6906
  • DOI: 10.1038/nature01097

Genome-scale approaches to resolving incongruence in molecular phylogenies
journal, October 2003

  • Rokas, Antonis; Williams, Barry L.; King, Nicole
  • Nature, Vol. 425, Issue 6960
  • DOI: 10.1038/nature02053

Inferring ancient divergences requires genes with strong phylogenetic signals
journal, May 2013


Genome sequencing of chimpanzee malaria parasites reveals possible pathways of adaptation to human hosts
journal, September 2014

  • Otto, Thomas D.; Rayner, Julian C.; Böhme, Ulrike
  • Nature Communications, Vol. 5, Issue 1
  • DOI: 10.1038/ncomms5754

Comprehensive variation discovery in single human genomes
journal, October 2014

  • Weisenfeld, Neil I.; Yin, Shuangye; Sharpe, Ted
  • Nature Genetics, Vol. 46, Issue 12
  • DOI: 10.1038/ng.3121

Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data
journal, May 2013

  • Chin, Chen-Shan; Alexander, David H.; Marks, Patrick
  • Nature Methods, Vol. 10, Issue 6
  • DOI: 10.1038/nmeth.2474

A complete bacterial genome assembled de novo using only nanopore sequencing data
journal, June 2015

  • Loman, Nicholas J.; Quick, Joshua; Simpson, Jared T.
  • Nature Methods, Vol. 12, Issue 8
  • DOI: 10.1038/nmeth.3444

Sequence assembly demystified
journal, January 2013

  • Nagarajan, Niranjan; Pop, Mihai
  • Nature Reviews Genetics, Vol. 14, Issue 3
  • DOI: 10.1038/nrg3367

DBG2OLC: Efficient Assembly of Large Genomes Using Long Erroneous Reads of the Third Generation Sequencing Technologies
journal, August 2016

  • Ye, Chengxi; Hill, Christopher M.; Wu, Shigang
  • Scientific Reports, Vol. 6, Issue 1
  • DOI: 10.1038/srep31900

Mutation rates in mammalian genomes
journal, January 2002

  • Kumar, S.; Subramanian, S.
  • Proceedings of the National Academy of Sciences, Vol. 99, Issue 2
  • DOI: 10.1073/pnas.022629899

High-quality draft assemblies of mammalian genomes from massively parallel sequence data
journal, December 2010

  • Gnerre, S.; MacCallum, I.; Przybylski, D.
  • Proceedings of the National Academy of Sciences, Vol. 108, Issue 4
  • DOI: 10.1073/pnas.1017351108

SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
journal, May 2012

  • Bankevich, Anton; Nurk, Sergey; Antipov, Dmitry
  • Journal of Computational Biology, Vol. 19, Issue 5
  • DOI: 10.1089/cmb.2012.0021

ART: a next-generation sequencing read simulator
journal, December 2011


pIRS: Profile-based Illumina pair-end reads simulator
journal, April 2012


PBSIM: PacBio reads simulator—toward accurate genome assembly
journal, November 2012


QUAST: quality assessment tool for genome assemblies
journal, February 2013


GAGE-B: an evaluation of genome assemblers for bacterial organisms
journal, May 2013


The MaSuRCA genome assembler
journal, August 2013


Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences
journal, June 2014


RAMPART: a workflow management system for de novo genome assembly
journal, January 2015


Mitochondrial genome evolution in yeasts: an all-encompassing view
journal, May 2015

  • Freel, Kelle C.; Friedrich, Anne; Schacherer, Joseph
  • FEMS Yeast Research, Vol. 15, Issue 4
  • DOI: 10.1093/femsyr/fov023

Genome 10K: A Proposal to Obtain Whole-Genome Sequence for 10 000 Vertebrate Species
journal, November 2009


Simulation of Genome-Wide Evolution under Heterogeneous Substitution Models and Complex Multispecies Coalescent Histories
journal, March 2014

  • Arenas, Miguel; Posada, David
  • Molecular Biology and Evolution, Vol. 31, Issue 5
  • DOI: 10.1093/molbev/msu078

The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification
journal, October 2014

  • Reddy, T. B. K.; Thomas, Alex D.; Stamatis, Dimitri
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku950

Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage
journal, July 2016

  • Chakraborty, Mahul; Baldwin-Brown, James G.; Long, Anthony D.
  • Nucleic Acids Research
  • DOI: 10.1093/nar/gkw654

Metassembler: Merging and optimizing de novo genome assemblies
journal, March 2015

  • Wences, Alejandro Hernandez; Schatz, Michel C.
  • Genome Biology
  • DOI: 10.1101/016352

Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
journal, February 2008


ABySS: A parallel assembler for short read sequence data
journal, February 2009


Assemblathon 1: A competitive assessment of de novo short read assembly methods
journal, September 2011


Efficient de novo assembly of large genomes using compressed data structures
journal, December 2011


GAGE: A critical evaluation of genome assemblies and assembly algorithms
journal, January 2012

  • Salzberg, S. L.; Phillippy, A. M.; Zimin, A.
  • Genome Research, Vol. 22, Issue 3
  • DOI: 10.1101/gr.131383.111

Prevention, diagnosis and treatment of high-throughput sequencing data pathologies
journal, March 2014

  • Zhou, Xiaofan; Rokas, Antonis
  • Molecular Ecology, Vol. 23, Issue 7
  • DOI: 10.1111/mec.12680

Comparative genomics reveals insights into avian genome evolution and adaptation
journal, December 2014


The Genome Sequence of Drosophila melanogaster
journal, March 2000


Creating a Buzz About Insect Genomes
journal, March 2011


The Theory and Practice of Genome Sequence Assembly
journal, August 2015


Assessment of de novoassemblers for draft genomes: a case study with fungal genomes
journal, December 2014


Using cascading Bloom filters to improve the memory usage for de Brujin graphs
journal, January 2014

  • Salikhov, Kamil; Sacomoto, Gustavo; Kucherov, Gregory
  • Algorithms for Molecular Biology, Vol. 9, Issue 1
  • DOI: 10.1186/1748-7188-9-2

Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species
journal, July 2013

  • Bradnam, Keith R.; Fass, Joseph N.; Alexandrov, Anton
  • GigaScience, Vol. 2, Issue 1
  • DOI: 10.1186/2047-217x-2-10

REAPR: a universal tool for genome assembly evaluation
journal, January 2013


Characterizing and measuring bias in sequence data
journal, January 2013


Automated ensemble assembly and validation of microbial genomes
text, January 2014

  • Koren, Sergey; Treangen, Todd J.; Hill, Christopher M.
  • Springer Nature
  • DOI: 10.13016/4pfi-yvf6

Extensive Error in the Number of Genes Inferred from Draft Genome Assemblies
journal, December 2014


Meraculous: De Novo Genome Assembly with Short Paired-End Reads
journal, August 2011


Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement
journal, November 2014


Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi
journal, December 2012


Informed and Automated k-Mer Size Selection for Genome Assembly
preprint, January 2013


Patterns of positive selection in seven ant genomes
preprint, January 2013


Works referencing / citing this record:

Repeat-aware evaluation of scaffolding tools
journal, March 2018


Eukaryotic Acquisition of a Bacterial Operon
journal, August 2018

  • Kominek, Jacek; Doering, Drew T.; Opulente, Dana A.
  • Cell
  • DOI: 10.1101/399394

sppIDer: A Species Identification Tool to Investigate Hybrid Genomes with High-Throughput Sequencing
journal, September 2018

  • Langdon, Quinn K.; Peris, David; Kyle, Brian
  • Molecular Biology and Evolution
  • DOI: 10.1093/molbev/msy166

Hybridization and adaptive evolution of diverse Saccharomyces species for cellulosic biofuel production
journal, March 2017

  • Peris, David; Moriarty, Ryan V.; Alexander, William G.
  • Biotechnology for Biofuels, Vol. 10, Issue 1
  • DOI: 10.1186/s13068-017-0763-7

Fermentation innovation through complex hybridization of wild and domesticated yeasts
journal, October 2019

  • Langdon, Quinn K.; Peris, David; Baker, EmilyClare P.
  • Nature Ecology & Evolution, Vol. 3, Issue 11
  • DOI: 10.1038/s41559-019-0998-8

Next-generation forward genetic screens: using simulated data to improve the design of mapping-by-sequencing experiments in Arabidopsis
journal, September 2019

  • Wilson-Sánchez, David; Lup, Samuel Daniel; Sarmiento-Mañús, Raquel
  • Nucleic Acids Research, Vol. 47, Issue 21
  • DOI: 10.1093/nar/gkz806

Evolutionary instability of CUG-Leu in the genetic code of budding yeasts
journal, May 2018

  • Krassowski, Tadeusz; Coughlan, Aisling Y.; Shen, Xing-Xing
  • Nature Communications, Vol. 9, Issue 1
  • DOI: 10.1038/s41467-018-04374-7

Mitochondria-encoded genes contribute to evolution of heat and cold tolerance in yeast
journal, January 2019

  • Li, Xueying C.; Peris, David; Hittinger, Chris Todd
  • Science Advances, Vol. 5, Issue 1
  • DOI: 10.1126/sciadv.aav1848

Solving scaffolding problem with repeats
posted_content, May 2018


Extensive loss of cell-cycle and DNA repair genes in an ancient lineage of bipolar budding yeasts
journal, May 2019


Evolutionary instability of CUG-Leu in the genetic code of budding yeasts
journal, May 2018

  • Krassowski, Tadeusz; Coughlan, Aisling Y.; Shen, Xing-Xing
  • Nature Communications, Vol. 9, Issue 1
  • DOI: 10.1038/s41467-018-04374-7

Repeat-aware evaluation of scaffolding tools
journal, March 2018


Next-generation forward genetic screens: using simulated data to improve the design of mapping-by-sequencing experiments in Arabidopsis
journal, September 2019

  • Wilson-Sánchez, David; Lup, Samuel Daniel; Sarmiento-Mañús, Raquel
  • Nucleic Acids Research, Vol. 47, Issue 21
  • DOI: 10.1093/nar/gkz806

Solving scaffolding problem with repeats
posted_content, May 2018


Mitochondria-encoded genes contribute to evolution of heat and cold tolerance in yeast
journal, January 2019

  • Li, Xueying C.; Peris, David; Hittinger, Chris Todd
  • Science Advances, Vol. 5, Issue 1
  • DOI: 10.1126/sciadv.aav1848

Extensive loss of cell-cycle and DNA repair genes in an ancient lineage of bipolar budding yeasts
journal, May 2019


Figures/Tables have been extracted from DOE-funded journal article accepted manuscripts.