skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A survey of the sorghum transcriptome using single-molecule long reads

Journal Article · · Nature Communications
DOI:https://doi.org/10.1038/ncomms11706· OSTI ID:1306471

Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.

Research Organization:
Colorado State Univ., Fort Collins, CO (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
SC0010733
OSTI ID:
1306471
Journal Information:
Nature Communications, Vol. 7; ISSN 2041-1723
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 345 works
Citation information provided by
Web of Science

References (59)

Functional consequences of developmentally regulated alternative splicing journal September 2011
Direct sequencing of Arabidopsis thaliana RNA reveals patterns of cleavage and polyadenylation journal July 2012
Genome-wide landscape of polyadenylation in Arabidopsis provides evidence for extensive alternative polyadenylation journal July 2011
Complexity of the Alternative Splicing Landscape in Plants journal October 2013
Alternative cleavage and polyadenylation: extent, regulation and function journal June 2013
Regulation of Alternative Splicing Through Coupling with Transcription and Chromatin Structure journal June 2015
Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing journal November 2008
Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis journal March 2012
Alternative Splicing of Pre-Messenger RNAs in Plants in the Genomic Era journal June 2007
Alternative splicing of pre-mRNAs of Arabidopsis serine/arginine-rich proteins: regulation by hormones and stresses: Stress regulation of alternative splicing of SR genes journal February 2007
Ectopic Expression of atRSZ33 Reveals Its Function in Splicing and Causes Pleiotropic Changes in Development journal September 2003
Evolutionary conservation and regulation of particular alternative splicing events in plant SR proteins journal August 2006
atSRp30, one of two SF2/ASF-like proteins from Arabidopsis thaliana, regulates splicing of specific plant genes journal April 1999
The Arabidopsis splicing factor SR1 is regulated by alternative splicing journal January 2000
Genome-wide analysis of alternative pre-mRNA splicing in Arabidopsis thaliana based on full-length cDNA sequences journal September 2004
Genomewide comparative analysis of alternative splicing in plants journal April 2006
Genome-wide mapping of alternative splicing in Arabidopsis thaliana journal October 2009
Discovery and Expression Analysis of Alternative Splicing Events Conserved among Plant SR Proteins journal December 2013
Identification of an intronic splicing regulatory element involved in auto-regulation of alternative splicing of SCL33 pre-mRNA journal October 2012
Genome-Wide Analysis of Alternative Splicing Landscapes Modulated during Plant-Virus Interactions in Brachypodium distachyon journal January 2015
Genome-Wide Analysis of Alternative Splicing in Zea mays: Landscape and Genetic Regulation journal September 2014
Assessment of transcript reconstruction methods for RNA-seq journal November 2013
A single-molecule long-read survey of the human transcriptome journal October 2013
Characterization of the human ESC transcriptome by hybrid sequencing journal November 2013
Exploiting single-molecule transcript sequencing for eukaryotic gene prediction journal September 2015
Full-length transcriptome sequences and splice variants obtained by a combination of sequencing platforms applied to different root tissues of Salvia miltiorrhiza and tanshinone biosynthesis journal May 2015
A near complete snapshot of the Zea mays seedling transcriptome revealed from ultra-deep sequencing journal March 2014
The Sorghum bicolor genome and the diversification of grasses journal January 2009
Whole-genome sequencing reveals untapped genetic potential in Africa’s indigenous cereal crop sorghum journal August 2013
Bio-Fuel Crops Research for Energy Security and Rural Development in Developing Countries journal October 2008
Sweet sorghum as a model system for bioenergy crops journal June 2012
LoRDEC: accurate and efficient long read error correction journal August 2014
proovread : large-scale high-accuracy PacBio correction through iterative short read consensus journal July 2014
Genome-wide survey of Alternative Splicing in Sorghum Bicolor journal June 2014
Coupling mRNA processing with transcription in time and space journal February 2014
Genome level analysis of rice mRNA 3′-end processing signals and alternative polyadenylation journal April 2008
Transcriptome dynamics through alternative polyadenylation in developmental and environmental responses in plants revealed by deep sequencing journal August 2011
FY Is an RNA 3′ End-Processing Factor that Interacts with FCA to Control the Arabidopsis Floral Transition journal June 2003
Targeted 3' Processing of Antisense Transcripts Triggers Arabidopsis FLC Chromatin Silencing journal December 2009
Poly(A)-tail profiling reveals an embryonic switch in translational control journal January 2014
Regulation of mRNA Translation and Stability by microRNAs journal June 2010
Both introns and long 3′-UTRs operate as cis-acting elements to trigger nonsense-mediated decay in plants journal November 2006
MEME SUITE: tools for motif discovery and searching journal May 2009
Compilation of mRNA Polyadenylation Signals in Arabidopsis Revealed a New Signal Element and Potential Secondary Structures journal June 2005
The contribution of AAUAAA and the upstream element UUUGUA to the efficiency of mRNA 3′-end formation in plants. journal May 1994
GFOLD: a generalized fold change for ranking differentially expressed genes from RNA-seq data journal August 2012
miRBase: microRNA sequences, targets and gene nomenclature journal January 2006
miRBase: tools for microRNA genomics journal December 2007
miRBase: integrating microRNA annotation and deep-sequencing data journal October 2010
miRBase: annotating high confidence microRNAs using deep sequencing data journal November 2013
Long non-coding RNAs and their functions in plants journal October 2015
Genome-wide identification of long noncoding natural antisense transcripts and their responses to light in Arabidopsis journal January 2014
GMAP: a genomic mapping and alignment program for mRNA and EST sequences journal February 2005
Functional annotation of the transcriptome of Sorghum bicolor in response to osmotic stress and abscisic acid journal October 2011
SpliceGrapher: detecting patterns of alternative splicing from RNA-Seq data in the context of gene models and EST data journal January 2012
miRBase: tools for microRNA genomics. text January 2008
Assessment of transcript reconstruction methods for RNA-seq journalarticle January 2018
miRBase: microRNA sequences, targets and gene nomenclature. text January 2006
The Sorghum bicolor genome and the diversification of grasses text January 2009

Cited By (108)

UNAGI: an automated pipeline for nanopore full-length cDNA sequencing uncovers novel transcripts and isoforms in yeast journal January 2020
Physiological and RNA-seq analyses provide insights into the response mechanism of the Cf-10-mediated resistance to Cladosporium fulvum infection in tomato journal January 2018
Long read reference genome-free reconstruction of a full-length transcriptome from Astragalus membranaceus reveals transcript variants involved in bioactive compound biosynthesis journal August 2017
Multi-strategic RNA-seq analysis reveals a high-resolution transcriptional landscape in cotton journal October 2019
Poly(A) inclusive RNA isoform sequencing (PAIso−seq) reveals wide-spread non-adenosine residues within RNA poly(A) tails journal November 2019
RNA sequencing and swarm intelligence–enhanced classification algorithm development for blood-based disease diagnostics using spliced blood platelet RNA journal March 2019
De novo hybrid assembly of the rubber tree genome reveals evidence of paleotetraploidy in Hevea species journal February 2017
Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes journal March 2017
Onco-proteogenomics: Multi-omics level data integration for accurate phenotype prediction journal August 2017
The complexity of alternative splicing and landscape of tissue-specific expression in lotus (Nelumbo nucifera) unveiled by Illumina- and single-molecule real-time-based RNA-sequencing journal June 2019
Characterization and analysis of the transcriptome in Gymnocypris selincuoensis on the Qinghai-Tibetan Plateau using single-molecule long-read sequencing and RNA-seq journal July 2019
Single-molecule, full-length transcript sequencing provides insight into the extreme metabolism of the ruby-throated hummingbird Archilochus colubris journal February 2018
Exploring the fate of mRNA in aging seeds: protection, destruction, or slow decay? journal June 2018
Analysis of Transcripts and splice isoforms in Red Clover (Trifolium pratense L.) by single-molecule long-read sequencing posted_content May 2018
Metabolic labeling of RNAs uncovers hidden features and dynamics of the Arabidopsis thaliana transcriptome posted_content March 2019
A technology-agnostic long-read analysis pipeline for transcriptome discovery and quantification posted_content March 2020
Illuminating the dark side of the human transcriptome with TAMA Iso-Seq analysis posted_content September 2019
High resolution annotation of zebrafish transcriptome using long-read sequencing journal July 2018
A comparative transcriptional landscape of maize and sorghum obtained by single-molecule sequencing journal April 2018
Plant ISOform sequencing database (PISO): a comprehensive repertory of full‐length transcripts in plants journal January 2019
Araport11: a complete reannotation of the Arabidopsis thaliana reference genome journal February 2017
Global identification of alternative splicing via comparative analysis of SMRT- and Illumina-based RNA-seq in strawberry journal February 2017
Proteogenomic analysis reveals alternative splicing and translation as part of the abscisic acid response in Arabidopsis seedlings journal June 2017
Comprehensive profiling of rhizome-associated alternative splicing and alternative polyadenylation in moso bamboo ( Phyllostachys edulis ) journal June 2017
De novo genome assembly of the stress tolerant forest species Casuarina equisetifolia provides insight into secondary growth journal December 2018
Wide‐ranging transcriptome remodelling mediated by alternative polyadenylation in response to abiotic stresses in Sorghum journal January 2020
ISOdb: A Comprehensive Database of Full-Length Isoforms Generated by Iso-Seq journal November 2018
Normalized long read RNA sequencing in chicken reveals transcriptome complexity similar to human journal April 2017
A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing journal May 2017
Full-length transcriptome sequences of ephemeral plant Arabidopsis pumila provides insight into gene expression dynamics during continuous salt stress journal September 2018
Genome-wide profiling of the alternative splicing provides insights into development in Plutella xylostella journal June 2019
Comparative transcriptome and coexpression network analysis of carpel quantitative variation in Paeonia rockii journal August 2019
Single-molecule real-time transcript sequencing identified flowering regulatory genes in Crocus sativus journal November 2019
SMRT sequencing of a full-length transcriptome reveals transcript variants involved in C18 unsaturated fatty acid biosynthesis and metabolism pathways at chilling temperature in Pennisetum giganteum journal January 2020
PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome journal July 2020
Full-length transcript sequencing and comparative transcriptomic analysis to evaluate the contribution of osmotic and ionic stress components towards salinity tolerance in the roots of cultivated alfalfa (Medicago sativa L.) journal January 2019
Reconstruction of the full-length transcriptome atlas using PacBio Iso-Seq provides insight into the alternative splicing in Gossypium australe journal August 2019
Single-molecule real-time sequencing facilitates the analysis of transcripts and splice isoforms of anthers in Chinese cabbage (Brassica rapa L. ssp. pekinensis) journal November 2019
Analysis and comprehensive comparison of PacBio and nanopore-based RNA sequencing of the Arabidopsis transcriptome journal June 2020
Bridging the gap between reference and real transcriptomes journal June 2019
Transcriptome assembly from long-read RNA-seq alignments with StringTie2 journal December 2019
Opportunities and challenges in long-read sequencing data analysis journal February 2020
Revealing the transcriptomic complexity of switchgrass by PacBio long-read sequencing journal June 2018
Comprehensive comparison of Pacific Biosciences and Oxford Nanopore Technologies and their applications to transcriptome analysis journal January 2017
Comprehensive comparison of Pacific Biosciences and Oxford Nanopore Technologies and their applications to transcriptome analysis journal January 2017
Screening and characterization of long noncoding RNAs involved in the albinism of Ananas comosus var. bracteatus leaves journal November 2019
Analysis of Transcriptome and Epitranscriptome in Plants Using PacBio Iso-Seq and Nanopore-Based Direct RNA Sequencing journal March 2019
Reviving the Transcriptome Studies: An Insight Into the Emergence of Single-Molecule Transcriptome Sequencing journal April 2019
Transcriptome Profiling Using Single-Molecule Direct RNA Sequencing Approach for In-depth Understanding of Genes in Secondary Metabolism Pathways of Camellia sinensis journal July 2017
Abiotic Stresses Modulate Landscape of Poplar Transcriptome via Alternative Splicing, Differential Intron Retention, and Isoform Ratio Switching journal February 2018
Molecular Mechanisms of Acclimatization to Phosphorus Starvation and Recovery Underlying Full-Length Transcriptome Profiling in Barley (Hordeum vulgare L.) journal April 2018
Expanding Alternative Splicing Identification by Integrating Multiple Sources of Transcription Data in Tomato journal May 2019
Large Scale Profiling of Protein Isoforms Using Label-Free Quantitative Proteomics Revealed the Regulation of Nonsense-Mediated Decay in Moso Bamboo (Phyllostachys edulis) journal July 2019
Isoform Sequencing and State-of-Art Applications for Unravelling Complexity of Plant Transcriptomes journal January 2018
Full-Length Transcriptome Sequencing and Different Chemotype Expression Profile Analysis of Genes Related to Monoterpenoid Biosynthesis in Cinnamomum porrectum journal December 2019
SMRT sequencing analysis reveals the full-length transcripts and alternative splicing patterns in Ananas comosus var. bracteatus journal January 2019
PacBio single-molecule long-read sequencing shed new light on the transcripts and splice isoforms of the perennial ryegrass journal January 2020
Analysis of transcripts and splice isoforms in Medicago sativa L. by single-molecule long-read sequencing journal January 2019
Piercing the dark matter: bioinformatics of long-range sequencing and mapping journal March 2018
A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing journal August 2017
A survey of transcriptome complexity in Sus scrofa using single-molecule long-read sequencing journal May 2018
Genome and Transcriptome Sequencing of the Astaxanthin-Producing Green Microalga, Haematococcus pluvialis journal November 2018
De novo clustering of long reads by gene from transcriptomics data journal September 2018
IsoCon: Deciphering highly similar multigene family transcripts from Iso-Seq data posted_content January 2018
Transcriptome assembly from long-read RNA-seq alignments with StringTie2 posted_content July 2019
A global survey of alternative splicing in allopolyploid cotton: landscape, complexity and regulation journal September 2017
The developmental dynamics of the Populus stem transcriptome journal June 2018
Genome assembly provides insights into the genome evolution and flowering regulation of orchardgrass journal July 2019
A survey of transcriptome complexity using PacBio single-molecule real-time analysis combined with Illumina RNA sequencing for a better understanding of ricinoleic acid biosynthesis in Ricinus communis journal June 2019
Combining next-generation sequencing and single-molecule sequencing to explore brown plant hopper responses to contrasting genotypes of japonica rice journal August 2019
Candidate genes for grape white rot resistance based on SMRT and Illumina sequencing journal November 2019
Root Hair Single Cell Type Specific Profiles of Gene Expression and Alternative Polyadenylation Under Cadmium Stress journal May 2019
PacBio Long-Read Sequencing Reveals the Transcriptomic Complexity and Aux/IAA Gene Evolution in Gnetum (Gnetales) journal November 2019
Investigation of RNA Editing Sites within Bound Regions of RNA-Binding Proteins journal November 2019
Dynamic Changes in Metabolite Accumulation and the Transcriptome during Leaf Growth and Development in Eucommia ulmoides journal August 2019
Transcriptome Analysis of Drought-Resistant and Drought-Sensitive Sorghum (Sorghum bicolor) Genotypes in Response to PEG-Induced Drought Stress journal January 2020
The Complete Chloroplast Genome Sequence of the Medicinal Plant Swertia mussotii Using the PacBio RS II Platform journal August 2016
Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon journal November 2018
Transcriptome profiling for floral development in reblooming cultivar ‘High Noon’ of Paeonia suffruticosa journal October 2019
Single-molecule long-read transcriptome profiling of Platysternon megacephalum mitochondrial genome with gene rearrangement and control region duplication journal September 2018
rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data journal September 2019
Detecting alternatively spliced transcript isoforms from single-molecule long-read sequences without a reference genome journal June 2017
Transcriptome analysis based on a combination of sequencing platforms provides insights into leaf pigmentation in Acer rubrum journal June 2019
SMRT Sequencing of a Full-length Transcriptome Reveals Transcript Variants Involved in C18 Unsaturated Fatty Acid Biosynthesis and Metabolism Pathways at Chilling Temperature in Pennisetum giganteum posted_content January 2020
PacBio full-length cDNA sequencing integrated with RNA-seq reads drastically improves the discovery of splicing transcripts in rice journal December 2018
ReorientExpress: reference-free orientation of nanopore cDNA reads with deep learning journal November 2019
SMRT Sequencing of a Full-length Transcriptome Reveals Transcript Variants Involved in C18 Unsaturated Fatty Acid Biosynthesis and Metabolism Pathways at Chilling Temperature in Pennisetum giganteum posted_content June 2019
Analysis of transcripts and splice isoforms in red clover (Trifolium pratense L.) by single-molecule long-read sequencing journal November 2018
Transcriptome analysis of heat stress and drought stress in pearl millet based on Pacbio full-length transcriptome sequencing journal July 2020
Analysis and comprehensive comparison of PacBio and nanopore-based RNA sequencing of the Arabidopsis transcriptome posted_content June 2020
Transcriptomic profiles of 33 opium poppy samples in different tissues, growth phases, and cultivars journal May 2019
Long noncoding RNAs in the model species Brachypodium distachyon journal September 2017
Unveiling novel targets of paclitaxel resistance by single molecule long-read RNA sequencing in breast cancer journal April 2019
Variant phasing and haplotypic expression from long-read sequencing in maize journal February 2020
TranscriptClean: variant-aware correction of indels, mismatches and splice junctions in long-read transcripts journal June 2018
Long-read sequencing of the coffee bean transcriptome reveals the diversity of full-length transcripts journal August 2017
Inspecting abundantly expressed genes in male strobili in sugi (Cryptomeria japonica D. Don) via a highly accurate cDNA assembly journal April 2020
Hybrid sequencing reveals insight into heat sensing and signaling of bread wheat journal April 2019
Characterization and Functional Analysis of Polyadenylation Sites in Fast and Slow Muscles journal March 2020
Dual Platform Long-Read RNA-Sequencing Dataset of the Human Cytomegalovirus Lytic Transcriptome journal September 2018
Getting the Entire Message: Progress in Isoform Sequencing journal August 2019
Long Non-coding RNAs in Endothelial Biology journal May 2018
Integrative Analysis of Three RNA Sequencing Methods Identifies Mutually Exclusive Exons of MADS-Box Isoforms During Early Bud Development in Picea abies journal November 2018
Full-Length Transcriptome Assembly of Italian Ryegrass Root Integrated with RNA-Seq to Identify Genes in Response to Plant Cadmium Stress journal February 2020
Co-Expression Network Analysis of Spleen Transcriptome in Rock Bream (Oplegnathus fasciatus) Naturally Infected with Rock Bream Iridovirus (RBIV) journal March 2020
Discovery of Geranylgeranyl Pyrophosphate Synthase (GGPPS) Paralogs from Haematococcus pluvialis Based on Iso-Seq Analysis and Their Function on Astaxanthin Biosynthesis journal December 2019
Analyses of alternative polyadenylation: from old school biochemistry to high-throughput technologies journal April 2017
Uncovering full-length transcript isoforms of sugarcane cultivar Khon Kaen 3 using single-molecule long-read sequencing journal October 2018