skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Sub genome anchored physical frameworks of the allotetraploid Upland cotton (Gossypium hirsutum L.) genome, and an approach toward reference-grade assemblies of polyploids

Journal Article · · Scientific Reports
 [1]; ORCiD logo [2];  [3];  [3];  [4];  [4]; ORCiD logo [3];  [5];  [6];  [7];  [8];  [6]; ORCiD logo [4]
  1. Clemson Univ., SC (United States)
  2. US Dept. of Agriculture-Agricultural Research Service (USDA-ARS), Stoneville, MS (United States). Genomics and Bioinformatics Research Unit
  3. Texas A & M Univ., College Station, TX (United States)
  4. Univ. of Texas, Austin, TX (United States)
  5. US Dept. of Agriculture-Agricultural Research Service (USDA-ARS), Stoneville, MS (United States). Crop Genetics Research Unit
  6. HudsonAlpha Inst. for Biotechnology, Huntsville, AL (United States)
  7. Cotton Inc., Cary, NC (United States). Agriculture and Environmental Research
  8. Mississippi State Univ., Mississippi State, MS (United States)

Like those of many agricultural crops, the cultivated cotton is an allotetraploid and has a large genome (~2.5 gigabase pairs). The two sub genomes, A and D, are highly similar but unequally sized and repeat-rich, which pose significant challenges for accurate genome reconstruction using standard approaches. Here we report the development of BAC libraries, sub genome specific physical maps, and a new-generation sequencing approach that will lead to a reference-grade genome assembly for Upland cotton. Three BAC libraries were constructed, fingerprinted, and integrated with BAC-end sequences (BES) to produce a de novo whole-genome physical map. The BAC map was partitioned by sub genomes through alignment to the diploid progenitor D-genome reference sequence with densely spaced BES anchor points and computational filtering. The physical maps were validated with FISH and genetic mapping of SNP markers derived from BES. Two pairs of homeologous chromosomes, A11/D11 and A12/D12, were used to assess multiplex sequencing approaches for completeness and scalability. The results represent the first sub genome anchored physical maps of Upland cotton, and a new-generation approach to the whole-genome sequencing, which will lead to the reference-grade assembly of allopolyploid cotton and serve as a general strategy for sequencing other polyploid species.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States). National Energy Research Scientific Computing Center (NERSC)
Sponsoring Organization:
USDOE Office of Science (SC); National Science Foundation (NSF)
OSTI ID:
1543775
Journal Information:
Scientific Reports, Vol. 7, Issue 1; ISSN 2045-2322
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 18 works
Citation information provided by
Web of Science

References (58)

A genetically anchored physical framework for Theobroma cacao cv. Matina 1-6 journal August 2011
Microscopy and Bioinformatic Analyses of Lipid Metabolism Implicate a Sporophytic Signaling Network Supporting Pollen Development in Arabidopsis journal July 2008
Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome journal August 2014
Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp. journal April 2015
Five-Color-Based High-Information-Content Fingerprinting of Bacterial Artificial Chromosome Clones Using Type IIS Restriction Endonucleases journal June 2001
Duplicate gene evolution, homoeologous recombination, and transcriptome characterization in allopolyploid cotton journal January 2012
A comprehensive meta QTL analysis for fiber quality, yield, yield related and morphological traits, drought tolerance, and disease resistance in tetraploid cotton journal January 2013
The B73 Maize Genome: Complexity, Diversity, and Dynamics journal November 2009
Trimmomatic: a flexible trimmer for Illumina sequence data journal April 2014
BAC Sequencing Using Pooled Methods book September 2014
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs journal September 1997
An Improved Approach for Construction of Bacterial Artificial Chromosome Libraries journal August 1998
A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome journal July 2014
Chromosome structural changes in diploid and tetraploid A genomes of Gossypium journal April 2006
Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres journal December 2012
Estimation of the Nuclear DNA Content of Gossypium Species journal February 2005
The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution journal March 2013
De novo assembly of human genomes with massively parallel short read sequencing journal December 2009
The genome sequence of Sea-Island cotton (Gossypium barbadense) provides insights into the allopolyploidization and development of superior spinnable fibres journal December 2015
Construction and Identification of Bacterial Artificial Chromosome Library for 0-613-2R in Upland Cotton journal February 2006
Development of molecular markers for genetic male sterility in Gossypium hirsutum journal June 2015
Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement journal April 2015
The tobacco genome sequence and its comparison with those of tomato and potato journal May 2014
Genome sequence of the cultivated cotton Gossypium arboreum journal May 2014
Genome sequence of the palaeopolyploid soybean journal January 2010
Cytochrome b5 Reductase Encoded by CBR1 Is Essential for a Functional Male Gametophyte in Arabidopsis journal August 2013
Assembly algorithms for next-generation sequencing data journal June 2010
Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution journal April 2015
Construction of a bacterial artificial chromosome library for Gossypium herbaceum var. africanum journal June 2013
Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss journal February 2011
Allele-Specific, Bidirectional Silencing of an Alcohol Dehydrogenase Gene in Different Organs of Interspecific Diploid Cotton Hybrids journal September 2005
Molecular mapping of genic male-sterile genes ms 15 , ms 5 and ms 6 in tetraploid cotton journal April 2009
The draft genome of a diploid cotton Gossypium raimondii journal August 2012
Functional analysis of Hsp70 superfamily proteins of rice (Oryza sativa) journal December 2012
Serine/threonine protein phosphatases: Multi-purpose enzymes in control of defense mechanisms journal December 2011
Genetic Diversity in Gossypium hirsutum and the Origin of Upland Cotton journal November 1992
Dissecting the genome of the polyploid crop oilseed rape by transcriptome sequencing journal July 2011
Consed: A Graphical Tool for Sequence Finishing journal March 1998
Plant disease resistance genes: Current status and future directions journal April 2012
ABySS: A parallel assembler for short read sequence data journal February 2009
The coffee genome provides insight into the convergent evolution of caffeine biosynthesis journal September 2014
Analysis of the genome sequence of the flowering plant Arabidopsis thaliana journal December 2000
The Origin of American Tetraploid Gossypium Species journal May 1940
Gene Expression Changes and Early Events in Cotton Fibre Development journal September 2007
Construction of a plant-transformation-competent BIBAC library and genome sequence analysis of polyploid Upland cotton (Gossypium hirsutum L.) journal January 2013
A cysteine-rich receptor-like kinase NCRK and a pathogen-induced protein kinase RBK1 are Rop GTPase interactors: Rop GTPases interacting with protein kinases journal December 2007
Construction and characterization of a bacterial artificial chromosome library for the allotetraploid Gossypium tomentosum journal January 2015
The wondrous cycles of polyploidy in plants journal October 2015
Cytological studies in cotton: IV. Chromosome conjugation in interspecific hybrids journal February 1937
Toward Sequencing Cotton ( Gossypium ) Genomes: Figure 1. journal December 2007
Prospecting for Genes involved in transcriptional regulation of plant defenses, a bioinformatics approach journal January 2011
Comparative development of fiber in wild and cultivated cotton journal January 2001
A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. japonica) journal April 2002
Roles for mannitol and mannitol dehydrogenase in active oxygen-mediated plant defense journal December 1998
Contigs Built with Fingerprints, Markers, and FPC V4.7 journal November 2000
BAC-End Sequence-Based SNP Mining in Allotetraploid Cotton (Gossypium) Utilizing Resequencing Data, Phylogenetic Inferences, and Perspectives for Genetic Mapping journal April 2015
Construction of integrated genetic linkage maps by means of a new computer package: Join Map journal May 1993
Cotton: Origin, History, Technology, and Production journal October 2000

Cited By (6)

Cotton CENTRORADIALIS/TERMINAL FLOWER 1/SELF-PRUNING genes functionally diverged to differentially impact plant architecture journal September 2018
Impact of Chromosomal Rearrangements on the Interpretation of Lupin Karyotype Evolution journal April 2019
Unraveling cis and trans regulatory evolution during cotton domestication journal November 2019
Functional divergence of cellulose synthase orthologs in between wild Gossypium raimondii and domesticated G. arboreum diploid cotton species journal September 2019
Genetic Analysis of the Transition from Wild to Domesticated Cotton ( Gossypium hirsutum L.) journal February 2020
Identification of cotton MOTHER OF FT AND TFL1 homologs, GhMFT1 and GhMFT2, involved in seed germination journal April 2019