Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A haplotype-resolved reference genome for Eucalyptus grandis

Journal Article · · G3: Genes, Genomes, Genetics

Eucalyptus grandis is a hardwood tree used worldwide as pure species or hybrid partner to breed fast-growing plantation forestry crops that serve as feedstocks of timber and lignocellulosic biomass for pulp, paper, biomaterials, and biorefinery products. The current v2.0 genome reference for the species served as the first reference for the genus and has helped drive the development of molecular breeding tools for eucalypts. Using PacBio HiFi long reads and Omni-C proximity ligation sequencing, we produced an improved, haplotype-phased assembly (v4.0) for TAG0014, an early-generation selection of E. grandis. The 2 haplotypes are 571 Mbp (HAP1) and 552 Mbp (HAP2) in size and consist of 37 and 46 contigs scaffolded onto 11 chromosomes (contig N50 of 28.9 and 16.7 Mbp), respectively. These haplotype assemblies are 70-90 Mbp smaller than the diploid v2.0 assembly but capture all except one of the 22 telomeres, suggesting that substantial redundant sequence was included in the previous assembly. A total of 35,929 (HAP1) and 35,583 (HAP2) gene models were annotated, of which 438 and 472 contain long introns (>10 kbp) in gene models previously (v2.0) identified as multiple smaller genes. These and other improvements have increased gene annotation completeness levels from 93.8 to 99.4% in the v4.0 assembly. We found that 6,493 and 6,346 genes are within tandem duplicate arrays (HAP1 and HAP2, respectively, 18.4 and 17.8% of the total) and >43.8% of the haplotype assemblies consists of repeat elements. Analysis of synteny between the haplotypes and the E. grandis v2.0 reference genome revealed extensive regions of collinearity, but also some major rearrangements, and provided a preview of population and pangenome variation in the species.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
US Department of Energy; USDOE Office of Science (SC), Biological and Environmental Research (BER) (SC-23), Biological Systems Science Division (SC-23.2 )
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
2573992
Journal Information:
G3: Genes, Genomes, Genetics, Journal Name: G3: Genes, Genomes, Genetics Journal Issue: 7 Vol. 15
Country of Publication:
United States
Language:
English

Similar Records

The genome of Eucalyptus grandis
Journal Article · Wed Jun 11 00:00:00 EDT 2014 · Nature (London) · OSTI ID:1148848

Four chromosome scale genomes and a pan-genome annotation to accelerate pecan tree breeding
Journal Article · Mon Jul 05 00:00:00 EDT 2021 · Nature Communications · OSTI ID:1816167

A chromosome-level genome assembly of the Chinese cork oak (Quercus variabilis)
Journal Article · Fri Sep 23 00:00:00 EDT 2022 · Frontiers in Plant Science · OSTI ID:1889313

Related Subjects