Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Generation of a chromosome-scale genome assembly of the insect-repellent terpenoid-producing Lamiaceae species, Callicarpa americana

Journal Article · · GigaScience
 [1];  [2];  [3];  [3];  [2];  [3];  [3];  [3];  [3];  [2];  [2];  [3];  [3]
  1. Michigan State Univ., East Lansing, MI (United States); Michigan State University Department of Plant Biology
  2. Univ. of Florida, Gainesville, FL (United States)
  3. Michigan State Univ., East Lansing, MI (United States)
Plants exhibit wide chemical diversity due to the production of specialized metabolites that function as pollinator attractants, defensive compounds, and signaling molecules. Lamiaceae (mints) are known for their chemodiversity and have been cultivated for use as culinary herbs, as well as sources of insect repellents, health-promoting compounds, and fragrance. We report the chromosome-scale genome assembly of Callicarpa americana L. (American beautyberry), a species within the early-diverging Callicarpoideae clade of Lamiaceae, known for its metallic purple fruits and use as an insect repellent due to its production of terpenoids. Using long-read sequencing and Hi-C scaffolding, we generated a 506.1-Mb assembly spanning 17 pseudomolecules with N50 contig and N50 scaffold sizes of 7.5 and 29.0 Mb, respectively. In all, 32,164 genes were annotated, including 53 candidate terpene synthases and 47 putative clusters of specialized metabolite biosynthetic pathways. Our analyses revealed 3 putative whole-genome duplication events, which, together with local tandem duplications, contributed to gene family expansion of terpene synthases. Kolavenyl diphosphate is a gateway to many of the bioactive terpenoids in C. americana; experimental validation confirmed that CamTPS2 encodes kolavenyl diphosphate synthase. Syntenic analyses with Tectona grandis L. f. (teak), a member of the Tectonoideae clade of Lamiaceae known for exceptionally strong wood resistant to insects, revealed 963 collinear blocks and 21,297 C. americana syntelogs. Access to the C. americana genome provides a road map for rapid discovery of genes encoding plant-derived agrichemicals and a key resource for understanding the evolution of chemical diversity in Lamiaceae.
Research Organization:
Great Lakes Bioenergy Research Center, Madison, WI (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
FC02-07ER64494; SC0018409
OSTI ID:
1764716
Journal Information:
GigaScience, Journal Name: GigaScience Journal Issue: 9 Vol. 9; ISSN 2047-217X
Publisher:
BioMed CentralCopyright Statement
Country of Publication:
United States
Language:
English

References (53)

GC-MS data set for Generation of a chromosome-scale genome assembly of the insect-repellant terpenoid-producing Lamiaceae species, Callicarpa americana dataset January 2020
Expanding the Landscape of Diterpene Structural Diversity through Stereochemically Controlled Combinatorial Biosynthesis journal January 2016
De novo sequencing of the Lavandula angustifolia genome reveals highly duplicated and optimized features for essential oil production journal September 2018
The Maize An2 Gene is Induced by Fusarium Attack and Encodes an ent-Copalyl Diphosphate Synthase journal December 2005
Analysis of the Genome Sequence of the Medicinal Plant Salvia miltiorrhiza journal June 2016
Phylogenomic Mining of the Mints Reveals Multiple Mechanisms Contributing to the Evolution of Chemical Diversity in Lamiaceae journal August 2018
The Reference Genome Sequence of Scutellaria baicalensis Provides Insights into the Evolution of Wogonin Biosynthesis journal July 2019
Isolation and Identification of Mosquito Bite Deterrent Terpenoids from Leaves of American ( Callicarpa americana ) and Japanese ( Callicarpa japonica ) Beautyberry journal July 2005
Full-length transcriptome assembly from RNA-Seq data without a reference genome journal May 2011
Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions journal November 2013
Near-optimal probabilistic RNA-seq quantification journal April 2016
Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome journal March 2017
HISAT: a fast spliced aligner with low memory requirements journal March 2015
Chromosome-scale scaffolding of the black raspberry (Rubus occidentalis L.) genome based on chromatin interaction data journal February 2018
Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype journal August 2019
SiZer for Exploration of Structures in Curves journal September 1999
A fast, lock-free approach for efficient parallel counting of occurrences of k-mers journal January 2011
InterProScan 5: genome-scale protein function classification journal January 2014
BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs journal June 2015
GenomeScope: fast reference-free genome profiling from short reads journal March 2017
A Comprehensive Survey on the Terpene Synthase Gene Family Provides New Insight into Its Evolutionary Patterns journal July 2019
High-quality assembly of the reference genome for scarlet sage, Salvia splendens, an economically important ornamental plant journal June 2018
A chromosomal-scale genome assembly of Tectona grandis reveals the importance of tandem gene duplication and enables discovery of genes in natural product biosynthetic pathways journal January 2019
A (–)-kolavenyl diphosphate synthase catalyzes the first step of salvinorin A biosynthesis in Salvia divinorum journal February 2017
Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies journal October 2003
MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity journal January 2012
The Pfam protein families database: towards a more sustainable future journal December 2015
plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters journal April 2017
Plant P450s as versatile drivers for evolution of species-specific chemical diversity journal February 2013
Canu: scalable and accurate long-read assembly via adaptive k -mer weighting and repeat separation journal March 2017
Manoyl Oxide (13R), the Biosynthetic Precursor of Forskolin, Is Synthesized in Specialized Root Cork Cells in Coleus forskohlii journal January 2014
MAKER-P: A Tool Kit for the Rapid Creation, Management, and Quality Control of Plant Genome Annotations journal December 2013
Comparative transcriptomics of three Poaceae species reveals patterns of gene expression evolution: Comparative transcriptome analyses in grasses journal June 2012
Plant metabolic clusters – from genetics to genomics journal April 2016
Drivers of metabolic diversification: how dynamic genomic neighbourhoods generate new biosynthetic pathways in the Brassicaceae journal December 2019
The terpene synthase gene family in Tripterygium wilfordii harbors a labdane-type diterpene synthase among the monoterpene synthase TPS-b subfamily journal February 2017
Araport11: a complete reannotation of the Arabidopsis thaliana reference genome journal February 2017
Biosynthesis of the psychotropic plant diterpene salvinorin A: Discovery and characterization of the Salvia divinorum clerodienyl diphosphate synthase journal February 2017
The Amborella Genome and the Evolution of Flowering Plants journal December 2013
Keeping the Bugs at Bay journal July 2006
Repbase Update, a database of eukaryotic repetitive elements journal January 2005
Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis journal December 2006
Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data journal January 2013
OrthoFinder: phylogenetic orthology inference for comparative genomics journal November 2019
Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement journal November 2014
Cutadapt removes adapter sequences from high-throughput sequencing reads journal May 2011
Comparative Oligo-FISH Mapping: An Efficient and Powerful Methodology To Reveal Karyotypic and Chromosomal Evolution journal December 2017
mixtools : An R Package for Analyzing Finite Mixture Models journal January 2009
Biologically Active Natural Products of the Genus Callicarpa journal June 2008
EvoPipes.net: Bioinformatic Tools for Ecological and Evolutionary Genomics journal January 2010
Generation of a chromosome-scale genome assembly of the insect-repellant terpenoid-producing Lamiaceae species, Callicarpa americana dataset January 2020
Phylotranscriptomic analyses reveal asymmetrical gene duplication dynamics and signatures of ancient polyploidy in mints dataset January 2019
Supporting data for "Generation of a chromosome-scale genome assembly of the insect-repellent terpenoid-producing Lamiaceae species, Callicarpa americana" dataset January 2020