skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

Journal Article · · Nature (London)
DOI:https://doi.org/10.1038/nature15714· OSTI ID:1436509
 [1];  [1];  [2];  [3];  [4];  [5];  [6];  [6];  [6];  [7];  [4];  [8];  [9];  [9];  [10];  [1]
  1. Donal Danforth Plant Science Center, St. Louis, MO (United States)
  2. Univ. of California, Berkeley, CA (United States); Michigan State Univ., East Lansing, MI (United States)
  3. Univ. of Arizona, Tucson, AZ (United States); Fujan Agriculture and Forestry Univ., Fuzhou (China)
  4. Univ. of California, Berkeley, CA (United States)
  5. Univ. of Bonn, Bonn (Germany); Central Univ. of Tamil Nadu, Thiruvarur (India)
  6. Pacific Biosciences, Menlo Park, CA (United States)
  7. Univ. of Arizona, Tucson, AZ (United States)
  8. Univ. of Bonn, Bonn (Germany)
  9. BioNano Genomics, San Diego, CA (United States)
  10. Ibis Biosciences, Carlsbad, CA (United States)

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.

Research Organization:
Donald Danforth Plant Science Center, St. Louis, MO (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
SC0012639
OSTI ID:
1436509
Journal Information:
Nature (London), Vol. 527, Issue 7579; ISSN 0028-0836
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 209 works
Citation information provided by
Web of Science

References (60)

The Sorghum bicolor genome and the diversification of grasses journal January 2009
Topological analysis and interactive visualization of biological networks and protein structures journal March 2012
Do Plants Have a One-Way Ticket to Genomic Obesity? journal September 1997
Fast and accurate short read alignment with Burrows-Wheeler transform journal May 2009
Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data journal May 2013
Trimmomatic: a flexible trimmer for Illumina sequence data journal April 2014
The map-based sequence of the rice genome journal August 2005
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks journal March 2012
A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome journal July 2014
Full-length transcriptome assembly from RNA-Seq data without a reference genome journal May 2011
Architecture and evolution of a minute plant genome journal May 2013
How to usefully compare homologous plant genes and chromosomes as DNA sequences: How to usefully compare plant genomes journal February 2008
Screening synteny blocks in pairwise genome comparisons through integer programming journal April 2011
The Spirodela polyrhiza genome reveals insights into its neotenous reduction fast growth and aquatic lifestyle journal February 2014
The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus) journal April 2008
Adaptive seeds tame genomic sequence comparison journal January 2011
Tandem repeats finder: a program to analyze DNA sequences journal January 1999
Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution journal January 2013
A Solution to the C-Value Paradox and the Function of Junk DNA: The Genome Balance Hypothesis journal June 2015
STRING v9.1: protein-protein interaction networks, with increased coverage and integration journal November 2012
Genome size is a strong predictor of cell size and stomatal density in angiosperms journal September 2008
LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons journal January 2008
Reference genome sequence of the model plant Setaria journal May 2012
Preparation of megabase-size DNA from plant nuclei journal January 1995
Analysis of the genome sequence of the flowering plant Arabidopsis thaliana journal December 2000
Pfam: the protein families database journal November 2013
The B73 Maize Genome: Complexity, Diversity, and Dynamics journal November 2009
CD-HIT Suite: a web server for clustering and comparing biological sequences journal January 2010
Repbase Update, a database of eukaryotic repetitive elements journal January 2005
Patching gaps in plant genomes results in gene movement and erosion of colinearity journal June 2010
InterProScan: protein domains identifier journal July 2005
MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes journal November 2007
Do Plants Have a One-Way Ticket to Genomic Obesity? journal September 1997
Rice by the numbers: A good grain journal October 2014
Genome sequencing and analysis of the model grass Brachypodium distachyon journal February 2010
Considering Transposable Element Diversification in De Novo Annotation Approaches journal January 2011
The miniature genome of a carnivorous plant Genlisea aurea contains a low number of genes and short non-coding sequences journal January 2013
Comparative Genomic Paleontology across Plant Kingdom Reveals the Dynamics of TE-Driven Genome Evolution journal February 2013
Plant genome size variation: bloating and purging DNA journal March 2014
OrthoMCL: Identification of Ortholog Groups for Eukaryotic Genomes journal September 2003
Angiosperm genome comparisons reveal early polyploidy in the monocot lineage journal December 2009
Resolving the complexity of the human genome using single-molecule sequencing journal November 2014
The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools journal December 2011
A travel guide to Cytoscape plugins journal November 2012
Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology journal December 2014
De novo identification of repeat families in large genomes journal June 2005
The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data journal July 2010
Genome conflict in the gramineae journal November 2004
Progress, challenges and the future of crop genomes journal April 2015
The Universal Protein Resource (UniProt): an expanding universe of protein information journal January 2006
Defining functional DNA elements in the human genome journal April 2014
Assembling large genomes with single-molecule sequencing and locality-sensitive hashing journal May 2015
Genome mapping on nanochannel arrays for structural variation analysis and sequence assembly journal July 2012
Characterization of the human ESC transcriptome by hybrid sequencing journal November 2013
Ancient genomes reveal social and genetic structure of Late Neolithic Switzerland journal April 2020
Characterization and functional analysis of phytoene synthase gene family in tobacco journal January 2021
The Sorghum bicolor genome and the diversification of grasses text January 2009
STRING v9.1: protein-protein interaction networks, with increased coverage and integration text January 2013
Genome sequencing and analysis of the model grass Brachypodium distachyon text January 2010
Patching gaps in plant genomes results in gene movement and erosion of colinearity text January 2010

Cited By (103)

A guide to sequence your favorite plant genomes journal March 2018
A chromosome-scale assembly of the model desiccation tolerant grass Oropetium thomaeum journal November 2018
Development of a goosegrass ( Eleusine indica ) draft genome and application to weed science research journal April 2019
PacBio single-molecule long-read sequencing shed new light on the transcripts and splice isoforms of the perennial ryegrass journal January 2020
A Brief History of Biochemical Genetics’ 50 Years and a Reflection About Past and Present Research Directions journal February 2018
Water lilies as emerging models for Darwin’s abominable mystery journal October 2017
High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development journal June 2017
A footprint of desiccation tolerance in the genome of Xerophyta viscosa journal March 2017
Extreme haplotype variation in the desiccation-tolerant clubmoss Selaginella lepidophylla journal January 2018
A chromosome-scale assembly of the sorghum genome using nanopore sequencing and optical mapping journal November 2018
The Rosa genome provides new insights into the domestication of modern roses journal April 2018
Origin and evolution of the octoploid strawberry genome journal February 2019
Genome assembly of a tropical maize inbred line provides insights into structural variation and crop improvement journal May 2019
Genome of the tropical plant Marchantia inflexa: implications for sex chromosome evolution and dehydration tolerance journal June 2019
Assembling the genome of the African wild rice Oryza longistaminata by exploiting synteny in closely related Oryza species journal October 2018
An improved genome assembly for Larimichthys crocea reveals hepcidin gene expansion with diversified regulation and function journal November 2018
Optical DNA mapping in nanofluidic devices: principles and applications journal January 2017
Computational genomic identification and functional reconstitution of plant natural product biosynthetic pathways journal January 2016
Genome-level responses to the environment: plant desiccation tolerance journal April 2019
Chromosome-level assembly of Arabidopsis thaliana L er reveals the extent of translocation and inversion polymorphisms journal June 2016
Analysis of tandem gene copies in maize chromosomal regions reconstructed from long sequence reads journal June 2016
Plant evolution and environmental adaptation unveiled by long-read whole-genome sequencing of Spirodela journal September 2019
The Small Nuclear Genomes of Selaginella Are Associated with a Low Rate of Genome Size Evolution journal April 2016
Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity journal December 2017
Haplotype-phased genome and evolution of phytonutrient pathways of tetraploid blueberry journal January 2019
Chromosomal-level reference genome of Chinese peacock butterfly (Papilio bianor) based on third-generation DNA sequencing and Hi-C analysis journal November 2019
agriGO v2.0: a GO analysis toolkit for the agricultural community, 2017 update journal May 2017
Assessing genome assembly quality using the LTR Assembly Index (LAI) journal August 2018
Telling plant species apart with DNA: from barcodes to genomes
  • Hollingsworth, Peter M.; Li, De-Zhu; van der Bank, Michelle
  • Philosophical Transactions of the Royal Society B: Biological Sciences, Vol. 371, Issue 1702 https://doi.org/10.1098/rstb.2015.0338
journal September 2016
Single molecule long read sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster journal July 2016
A chromosome scale assembly of the model desiccation tolerant grass Oropetium thomaeum posted_content July 2018
A Partially Phase-Separated Genome Sequence Assembly of the Vitis Rootstock ‘Börner’ (Vitis riparia x Vitis cinerea) and its Exploitation for Marker Development and Targeted Mapping posted_content November 2019
Single-molecule sequencing resolves the detailed structure of complex satellite DNA loci in Drosophila melanogaster journal April 2017
Improving and correcting the contiguity of long-read genome assemblies of three plant species using optical mapping and chromosome conformation capture data journal February 2017
Collections-based science in the 21st Century: Collections-based science in the 21st Century journal May 2018
DNA methylation of retrotransposons, DNA transposons and genes in sugar beet ( Beta vulgaris L.) journal April 2017
Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula journal August 2017
P_RNA_scaffolder: a fast and accurate genome scaffolder using paired-end RNA-sequencing reads journal March 2018
PacBio single-molecule long-read sequencing shed new light on the complexity of the Carex breviculmis transcriptome journal October 2019
Full-length transcriptome sequencing reveals the low-temperature-tolerance mechanism of Medicago falcata roots journal December 2019
Haplotype-resolved genomes of geminivirus-resistant and geminivirus-susceptible African cassava cultivars journal September 2019
Preferential retention of genes from one parental genome after polyploidy illustrates the nature and scope of the genomic conflicts induced by hybridization journal March 2018
Characterization of the whole chloroplast genome of Chikusichloa mutica and its comparison with other rice tribe (Oryzeae) species journal May 2017
Potential of Low-Coverage Genotyping-by-Sequencing and Imputation for Cost-Effective Genomic Selection in Biparental Segregating Populations journal March 2017
Analysis of Transcriptome and Epitranscriptome in Plants Using PacBio Iso-Seq and Nanopore-Based Direct RNA Sequencing journal March 2019
How Single Molecule Real-Time Sequencing and Haplotype Phasing Have Enabled Reference-Grade Diploid Genome Assembly of Wine Grapes journal May 2017
The Sequenced Angiosperm Genomes and Genome Databases journal April 2018
Improving the Yield and Nutritional Quality of Forage Crops journal April 2018
Hardwood Tree Genomics: Unlocking Woody Plant Biology journal December 2018
The Dynamic Responses of Cell Walls in Resurrection Plants During Dehydration and Rehydration journal January 2020
Advances in Integrating Genomics and Bioinformatics in the Plant Breeding Pipeline journal May 2018
Plant Desiccation Tolerance and its Regulation in the Foliage of Resurrection “Flowering-Plant” Species journal August 2018
Genome Sequencing and Assembly by Long Reads in Plants journal December 2017
Full-Length Transcriptome Analysis of the ABCB, PIN/PIN-LIKES, and AUX/LAX Families Involved in Somatic Embryogenesis of Lilium pumilum DC. Fisch. journal January 2020
The Complete Chloroplast Genome Sequence of the Medicinal Plant Swertia mussotii Using the PacBio RS II Platform journal August 2016
De novo assembly of a Chinese soybean genome journal July 2018
Improved maize reference genome with single-molecule technologies journal June 2017
Rapid genotype imputation from sequence without reference panels journal July 2016
Developing naturally stress-resistant crops for a sustainable agriculture journal November 2018
Allele-defined genome of the autopolyploid sugarcane Saccharum spontaneum L. journal October 2018
A gene-rich fraction analysis of the Passiflora edulis genome reveals highly conserved microsyntenic regions with two related Malpighiales species journal August 2018
A high-quality genome of Eragrostis curvula grass provides insights into Poaceae evolution and supports new strategies to enhance forage quality journal July 2019
Comprehensive evaluation of non-hybrid genome assembly tools for third-generation PacBio long-read sequence data journal November 2017
Vegetative desiccation tolerance in the resurrection plant Xerophyta humilis has not evolved through reactivation of the seed canonical LAFL regulatory network journal December 2019
Gene duplication and evolution in recurring polyploidization–diploidization cycles in plants journal February 2019
A De Novo Genome Sequence Assembly of the Arabidopsis thaliana Accession Niederzenz-1 Displays Presence/Absence Variation and Strong Synteny journal October 2016
Molecular Mechanisms of Acclimatization to Phosphorus Starvation and Recovery Underlying Full-Length Transcriptome Profiling in Barley (Hordeum vulgare L.) journal April 2018
A Partially Phase-Separated Genome Sequence Assembly of the Vitis Rootstock ‘Börner’ (Vitis riparia × Vitis cinerea) and Its Exploitation for Marker Development and Targeted Mapping journal March 2020
Genome-Wide Analysis of ROS Antioxidant Genes in Resurrection Species Suggest an Involvement of Distinct ROS Detoxification Systems during Desiccation text January 2019
Sequencing and de novo assembly of a near complete indica rice genome journal May 2017
Long-read sequencing data analysis for yeasts journal May 2018
Genome-Guided Phylo-Transcriptomic Methods and the Nuclear Phylogenetic Tree of the Paniceae Grasses journal October 2017
Molecular responses to dehydration and desiccation in desiccation-tolerant angiosperm plants journal January 2018
Widespread lateral gene transfer among grasses journal April 2021
Morphometric and productive characteristics of sorghum genotypes for forage production in the Brazilian semi-arid journal September 2018
Computational aspects underlying genome to phenome analysis in plants text January 2019
Genome-Wide Analysis of ROS Antioxidant Genes in Resurrection Species Suggest an Involvement of Distinct ROS Detoxification Systems during Desiccation text January 2019
plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters journal April 2017
Recompleting the Caenorhabditis elegans genome journal May 2019
Computational aspects underlying genome to phenome analysis in plants journal January 2019
Haplotype-resolved genomes of geminivirus-resistant and geminivirus-susceptible African cassava cultivars text January 2019
High contiguity Arabidopsis thaliana genome assembly with a single nanopore flow cell journal February 2018
Genome-Wide Analysis of ROS Antioxidant Genes in Resurrection Species Suggest an Involvement of Distinct ROS Detoxification Systems during Desiccation journal June 2019
IsoSeq transcriptome assembly of C 3 panicoid grasses provides tools to study evolutionary change in the Panicoideae journal February 2020
Biomonitoring for traditional herbal medicinal products using DNA metabarcoding and single molecule, real-time sequencing journal May 2018
Centromere evolution and CpG methylation during vertebrate speciation journal November 2017
Characterization, correction and de novo assembly of an Oxford Nanopore genomic dataset from Agrobacterium tumefaciens journal June 2016
The nuclear genome of Rhazya stricta and the evolution of alkaloid diversity in a medically relevant clade of Apocynaceae journal September 2016
Comparative optical genome analysis of two pangolin species: Manis pentadactyla and Manis javanica journal December 2016
A near complete, chromosome-scale assembly of the black raspberry (Rubus occidentalis) genome journal August 2018
pBACode: a random-barcode-based high-throughput approach for BAC paired-end sequencing and physical clone mapping journal December 2016
Enhancing the GABI-KatArabidopsis thalianaT-DNA Insertion Mutant Database by Incorporating Araport11 Annotation journal December 2016
Contrasting genome dynamics between domesticated and wild yeasts journal September 2016
LTR_retriever: a highly accurate and sensitive program for identification of LTR retrotransposons journal August 2017
LRSDAY: Long-read Sequencing Data Analysis for Yeasts journal September 2017
Comparison of the two up-to-date sequencing technologies for genome assembly: HiFi reads of Pacbio Sequel II system and ultralong reads of Oxford Nanopore journal February 2020
Integrating Genomic Data Sets for Knowledge Discovery: An Informed Approach to Management of Captive Endangered Species journal January 2016
Perspectives on Structural, Physiological, Cellular, and Molecular Responses to Desiccation in Resurrection Plants journal June 2018
Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data journal September 2016
MASQC: Next Generation Sequencing Assists Third Generation Sequencing for Quality Control in N6-Methyladenine DNA Identification journal March 2020
Quality Control of the Traditional Patent Medicine Yimu Wan Based on SMRT Sequencing and DNA Barcoding journal May 2017
Domestication and Improvement in the Model C4 Grass, Setaria journal May 2018
Enzymes and Metabolites in Carbohydrate Metabolism of Desiccation Tolerant Plants journal December 2016