Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Nucleotide polymorphism and copy number variant detection using exome capture and next-generation sequencing in the polyploid grass Panicum virgatum

Journal Article · · The Plant Journal
DOI:https://doi.org/10.1111/tpj.12601· OSTI ID:1625935
 [1];  [2];  [2];  [2];  [2];  [3];  [4];  [4];  [4];  [2];  [5];  [6]
  1. Michigan State Univ., East Lansing, MI (United States). Dept. of Energy Great Lakes Bioenergy Research Center; Michigan State Univ., East Lansing, MI (United States). Dept. of Plant Biology; DOE/OSTI
  2. Michigan State Univ., East Lansing, MI (United States). Dept. of Energy Great Lakes Bioenergy Research Center; Michigan State Univ., East Lansing, MI (United States). Dept. of Plant Biology
  3. Great Lakes Bioenergy Research Center (GLBRC), Madison, WI (United States); US Dept. of Agriculture (USDA), Madison, WI (United States). Agricultural Research Service (ARS). US Dairy Forage Research Center
  4. Roche-NimbleGen, Madison, WI (United States)
  5. Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison WI 53706 USA; US Dairy Forage Research Center, USDA-ARS, 1925 Linden Dr. Madison WI 53706-1108 USA
  6. Department of Energy Great Lakes Bioenergy Research Center, Michigan State University, East Lansing MI 48824 USA; Department of Plant Biology, Michigan State University, East Lansing MI 48824 USA

Switchgrass (Panicum virgatum) is a polyploid, outcrossing grass species native to North America and has recently been recognized as a potential biofuel feedstock crop. Significant phenotypic variation including ploidy is present across the two primary ecotypes of switchgrass, referred to as upland and lowland switchgrass. The tetraploid switchgrass genome is approximately 1400 Mbp, split between two subgenomes, with significant repetitive sequence content limiting the efficiency of re-sequencing approaches for determining genome diversity. To characterize genetic diversity in upland and lowland switchgrass as a first step in linking genotype to phenotype, we designed an exome capture probe set based on transcript assemblies that represent approximately 50 Mb of annotated switchgrass exome sequences. We then evaluated and optimized the probe set using solid phase comparative genome hybridization and liquid phase exome capture followed by next-generation sequencing. Using the optimized probe set, we assessed variation in the exomes of eight switchgrass genotypes representing tetraploid lowland and octoploid upland cultivars to benchmark our exome capture probe set design. We identified ample variation in the switchgrass genome including 1 395 501 single nucleotide polymorphisms (SNPs), 8173 putative copy number variants and 3336 presence/absence variants. While the majority of the SNPs (84%) detected was bi-allelic, a substantial number was tri-allelic with limited occurrence of tetra-allelic polymorphisms consistent with the heterozygous and polyploid nature of the switchgrass genome. Collectively, these data demonstrate the efficacy of exome capture for discovery of genome variation in a polyploid species with a large, repetitive and heterozygous genome.

Research Organization:
Univ. of Wisconsin, Madison, WI (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
Grant/Contract Number:
FC02-07ER64494
OSTI ID:
1625935
Journal Information:
The Plant Journal, Journal Name: The Plant Journal Journal Issue: 6 Vol. 79; ISSN 0960-7412
Publisher:
Society for Experimental BiologyCopyright Statement
Country of Publication:
United States
Language:
English

References (71)

Analysis of expressed sequence tags and the identification of associated short tandem repeats in switchgrass journal July 2005
Amplification of prolamin storage protein genes in different subfamilies of the Poaceae journal August 2009
α-synuclein locus duplication as a cause of familial Parkinson's disease journal September 2004
The map-based sequence of the rice genome journal August 2005
Net energy of cellulosic ethanol from switchgrass journal January 2008
Genomewide SNP variation reveals relationships among landraces and modern varieties of rice journal July 2009
Detection of inherited mutations for breast and ovarian cancer using genomic capture and massively parallel sequencing journal June 2010
InterProScan - an integration platform for the signature-recognition methods in InterPro journal September 2001
GMAP: a genomic mapping and alignment program for mRNA and EST sequences journal February 2005
The Sequence Alignment/Map format and SAMtools journal June 2009
BEDTools: a flexible suite of utilities for comparing genomic features journal January 2010
PlantGDB: a resource for comparative plant genomics journal December 2007
The Pfam protein families database journal November 2011
Pervasive gene content variation and copy number variation in maize and its undomesticated progenitor journal October 2010
OrthoMCL: Identification of Ortholog Groups for Eukaryotic Genomes journal September 2003
Sequence, Regulation, and Evolution of the Maize 22-kD α Zein Gene Family journal October 2001
Distribution and Characterization of Regulatory Elements in the Human Genome journal December 2002
Phenotypic and Genomic Analyses of a Fast Neutron Mutant Population Resource in Soybean journal February 2011
Genome-Wide Comparison of Nucleotide-Binding Site-Leucine-Rich Repeat-Encoding Genes in Arabidopsis journal August 2011
Development of an integrated transcript sequence database and a gene expression atlas for gene discovery and analysis in switchgrass ( Panicum virgatum L.) journal February 2013
Whole-exome targeted sequencing of the uncharacterized pine genome journal May 2013
Copy number variation in potato - an asexually propagated autotetraploid species journal May 2013
Common Sequence Polymorphisms Shaping Genetic Diversity in Arabidopsis thaliana journal July 2007
The B73 Maize Genome: Complexity, Diversity, and Dynamics journal November 2009
Copy Number Variation of Multiple Genes at Rhg1 Mediates Nematode Resistance in Soybean journal October 2012
THE ROLE OF KNOX GENES IN PLANT DEVELOPMENT journal November 2004
Genes encoding pentatricopeptide repeat (PPR) proteins are not conserved in location in plant genomes and may be subject to diversifying selection journal January 2007
The need for speed journal January 2009
Genome-wide patterns of genetic variation in sweet and grain sorghum (Sorghum bicolor) journal January 2011
Distribution, functional impact, and origin mechanisms of copy number variation in the barley genome journal June 2013
Maize Inbreds Exhibit High Levels of Copy Number Variation (CNV) and Presence/Absence Variation (PAV) in Genome Content journal November 2009
A Genome-Wide Survey of Switchgrass Genome Structure and Organization journal April 2012
Exploring the Switchgrass Transcriptome Using Second-Generation Sequencing Technology journal March 2012
Comparative Studies of Copy Number Variation Detection Methods for Next-Generation Sequencing Technologies journal March 2013
A Next-Generation Sequencing Method for Genotyping-by-Sequencing of Highly Heterozygous Autotetraploid Potato journal May 2013
A High-Density Simple Sequence Repeat-Based Genetic Linkage Map of Switchgrass journal March 2012
Comparative Genomics in Switchgrass Using 61,585 High-Quality Expressed Sequence Tags journal January 2008
Genome-size Variation in Switchgrass (Panicum virgatum): Flow Cytometry and Cytology Reveal Rampant Aneuploidy journal January 2010
The Switchgrass Genome: Tools and Strategies journal January 2011
Generation of Transcript Assemblies and Identification of Single Nucleotide Polymorphisms from Seven Lowland and Upland Cultivars of Switchgrass journal July 2014
Data from: Nucleotide polymorphism and copy number variant detection using exome capture and next generation sequencing in the polyploid grass Panicum virgatum dataset January 2015
Hierarchical classification of switchgrass genotypes using SSR and chloroplast sequences: ecotypes, ploidies, gene pools, and cultivars journal November 2010
Next-Generation Sequencing of Crown and Rhizome Transcriptome from an Upland, Tetraploid Switchgrass journal December 2011
DNA duplication associated with Charcot-Marie-Tooth disease type 1A journal July 1991
α-synuclein locus duplication as a cause of familial Parkinson's disease journal September 2004
Development of switchgrass (Panicum virgatum) as a bioenergy feedstock in the United States journal June 2005
The Sorghum bicolor genome and the diversification of grasses journal January 2009
Genome sequencing and analysis of the model grass Brachypodium distachyon journal February 2010
Reference genome sequence of the model plant Setaria journal May 2012
Maize HapMap2 identifies extant variation from a genome in flux journal June 2012
SPOCD1 is an essential executor of piRNA-directed de novo DNA methylation journal July 2020
Ribosomal DNA spacer-length polymorphisms in barley: mendelian inheritance, chromosomal location, and population dynamics. journal December 1984
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data journal November 2009
Clusters of Resistance Genes in Plants Evolve by Divergent Selection and a Birth-and-Death Process journal November 1998
Structural Variants in the Soybean Genome Localize to Clusters of Biotic Stress-Response Genes journal June 2012
Rice structural variation: a comparative analysis of structural variation between rice and three of its closest relatives in the genus Oryza: Rice structural variation journal September 2010
Changes in genome content generated via segregation of non-allelic homologs: Segregation of non-allelic homologs journal August 2012
Targeted re-sequencing of the allohexaploid wheat exome: Wheat exome capture and targeted re-sequencing journal June 2012
Barley whole exome capture: a tool for genomic research in the genus Hordeum and beyond journal August 2013
Comparative genomic hybridization for molecular cytogenetic analysis of solid tumors journal October 1992
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome journal January 2009
Whole exome capture in solution with 3 Gbp of data journal January 2010
Handbook of Bioenergy Crop Plants book March 2012
Warm-Season (C4) Grasses book January 2004
Estimates of Genetic Parameters in Switchgrass1 journal January 1983
Diversity among Populations of Switchgrass Based on RAPD Markers journal January 1996
Chromosome Number and Nuclear DNA Content of Several Switchgrass Populations journal January 1996
Incompatibility Systems in Switchgrass journal January 2002
An Analysis of Variation Between Upland and Lowland Switchgrass, Panicum Virgatum L., in Central Oklahoma journal November 1966
The Sorghum bicolor genome and the diversification of grasses text January 2009
Genome sequencing and analysis of the model grass Brachypodium distachyon text January 2010

Cited By (12)

Liquid-phase sequence capture and targeted re-sequencing revealed novel polymorphisms in tomato genes belonging to the MEP carotenoid pathway journal July 2017
Sequence capture by hybridization to explore modern and ancient genomic diversity in model and nonmodel organisms journal April 2016
Genotyping-by-sequencing approaches to characterize crop genomes: choosing the right tool for the right application journal January 2017
Accuracy of Genomic Prediction in Switchgrass ( Panicum virgatum L.) Improved by Accounting for Linkage Disequilibrium journal February 2016
Quantitative Trait Loci for Freezing Tolerance in a Lowland x Upland Switchgrass Population journal March 2019
Data from: Nucleotide polymorphism and copy number variant detection using exome capture and next generation sequencing in the polyploid grass Panicum virgatum dataset January 2015
Genome-Wide Associations with Resistance to Bipolaris Leaf Spot (Bipolaris oryzae (Breda de Haan) Shoemaker) in a Northern Switchgrass Population (Panicum virgatum L.) posted_content August 2019
Development of highly reliable in silico SNP resource and genotyping assay from exome capture and sequencing: an example from black spruce ( Picea mariana ) journal October 2015
Biological invasions, climate change and genomics journal December 2014
Diversity and population structure of northern switchgrass as revealed through exome capture sequencing journal November 2015
Switchgrass as a bioenergy feedstock: advances in breeding and genomics research journal June 2015
Biological Invasions, Climate Change, and Genomics book July 2016