Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms
Abstract
Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RNA-Seq datasets from different growth conditions combined with published expressed sequence tags and protein sequences from multiple taxa to explore the genome of the model diatom Phaeodactylum tricornutum, and introduce 1,489 novel genes. The new annotation additionally permitted the discovery of extensive alternative splicing in diatoms, including intron retention and exon skipping, which increase the diversity of transcripts generated in changing environments. In addition, we have used up-to-date reference sequence libraries to dissect the taxonomic origins of diatom genes. We show that the P. tricornutum genome is enriched in lineage-specific genes, with up to 47% of the gene models present only possessing orthologues in other stramenopile groups. Finally, we have performed a comprehensive de novo annotation of repetitive elements showing novel classes of transposable elements such as SINE, MITE and TRIM/LARD. This work provides a solid foundation for future studies of diatom gene function, evolution and ecology.
- Authors:
-
- PSL Univ., Paris (France)
- Wellcome Trust Genome Campus, Cambridge (United Kingdom)
- Univ. Paris-Saclay, Versailles (France)
- Rutgers Univ., Newark, NJ (United States)
- J. Craig Venter Institute Inc., Rockville, MD (United States)
- J. Craig Venter Institute Inc., Rockville, MD (United States); Univ. of California, San Diego, CA (United States)
- Publication Date:
- Research Org.:
- J. Craig Venter Institute Inc., Rockville, MD (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1500040
- Grant/Contract Number:
- SC0008593
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Scientific Reports
- Additional Journal Information:
- Journal Volume: 8; Journal Issue: 1; Journal ID: ISSN 2045-2322
- Publisher:
- Nature Publishing Group
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES
Citation Formats
Rastogi, Achal, Maheswari, Uma, Dorrell, Richard G., Vieira, Fabio Rocha Jimenez, Maumus, Florian, Kustka, Adam, McCarthy, James, Allen, Andy E., Kersey, Paul, Bowler, Chris, and Tirichine, Leila. Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms. United States: N. p., 2018.
Web. doi:10.1038/s41598-018-23106-x.
Rastogi, Achal, Maheswari, Uma, Dorrell, Richard G., Vieira, Fabio Rocha Jimenez, Maumus, Florian, Kustka, Adam, McCarthy, James, Allen, Andy E., Kersey, Paul, Bowler, Chris, & Tirichine, Leila. Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms. United States. https://doi.org/10.1038/s41598-018-23106-x
Rastogi, Achal, Maheswari, Uma, Dorrell, Richard G., Vieira, Fabio Rocha Jimenez, Maumus, Florian, Kustka, Adam, McCarthy, James, Allen, Andy E., Kersey, Paul, Bowler, Chris, and Tirichine, Leila. Mon .
"Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms". United States. https://doi.org/10.1038/s41598-018-23106-x. https://www.osti.gov/servlets/purl/1500040.
@article{osti_1500040,
title = {Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms},
author = {Rastogi, Achal and Maheswari, Uma and Dorrell, Richard G. and Vieira, Fabio Rocha Jimenez and Maumus, Florian and Kustka, Adam and McCarthy, James and Allen, Andy E. and Kersey, Paul and Bowler, Chris and Tirichine, Leila},
abstractNote = {Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RNA-Seq datasets from different growth conditions combined with published expressed sequence tags and protein sequences from multiple taxa to explore the genome of the model diatom Phaeodactylum tricornutum, and introduce 1,489 novel genes. The new annotation additionally permitted the discovery of extensive alternative splicing in diatoms, including intron retention and exon skipping, which increase the diversity of transcripts generated in changing environments. In addition, we have used up-to-date reference sequence libraries to dissect the taxonomic origins of diatom genes. We show that the P. tricornutum genome is enriched in lineage-specific genes, with up to 47% of the gene models present only possessing orthologues in other stramenopile groups. Finally, we have performed a comprehensive de novo annotation of repetitive elements showing novel classes of transposable elements such as SINE, MITE and TRIM/LARD. This work provides a solid foundation for future studies of diatom gene function, evolution and ecology.},
doi = {10.1038/s41598-018-23106-x},
journal = {Scientific Reports},
number = 1,
volume = 8,
place = {United States},
year = {Mon Mar 19 00:00:00 EDT 2018},
month = {Mon Mar 19 00:00:00 EDT 2018}
}
Web of Science
Works referenced in this record:
A multi-objective optimization approach accurately resolves protein domain architectures
journal, October 2015
- Bernardes, J. S.; Vieira, F. R. J.; Zaverucha, G.
- Bioinformatics, Vol. 32, Issue 3
Glycosyltransferase Family 43 Is Also Found in Early Eukaryotes and Has Three Subfamilies in Charophycean Green Algae
journal, May 2015
- Taujale, Rahil; Yin, Yanbin
- PLOS ONE, Vol. 10, Issue 5
Ensembl Genomes 2016: more genomes, more complexity
journal, November 2015
- Kersey, Paul Julian; Allen, James E.; Armean, Irina
- Nucleic Acids Research, Vol. 44, Issue D1
Short interspersed elements (SINEs) are a major source of canine genomic diversity
journal, December 2005
- Wang, W.
- Genome Research, Vol. 15, Issue 12
MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects
journal, December 2011
- Holt, Carson; Yandell, Mark
- BMC Bioinformatics, Vol. 12, Issue 1
An integrative analysis of post-translational histone modifications in the marine diatom Phaeodactylum tricornutum
journal, May 2015
- Veluchamy, Alaguraj; Rastogi, Achal; Lin, Xin
- Genome Biology, Vol. 16, Issue 1
Probing the evolutionary history of epigenetic mechanisms: what can we learn from marine diatoms
journal, January 2015
- Rastogi, Achal; Lin, Xin; Lombard, Bérangère
- AIMS Genetics, Vol. 2, Issue 3
Dnmt1-Independent CG Methylation Contributes to Nucleosome Positioning in Diverse Eukaryotes
journal, March 2014
- Huff, Jason T.; Zilberman, Daniel
- Cell, Vol. 156, Issue 6
The Genome of the Diatom Thalassiosira Pseudonana: Ecology, Evolution, and Metabolism
journal, October 2004
- Armbrust, E. V.
- Science, Vol. 306, Issue 5693
Reverse transcriptase genes are highly abundant and transcriptionally active in marine plankton assemblages
journal, November 2015
- Lescot, Magali; Hingamp, Pascal; Kojima, Kenji K.
- The ISME Journal, Vol. 10, Issue 5
Comparative genomic analysis of fungal genomes reveals intron-rich ancestors
journal, January 2007
- Stajich, Jason E.; Dietrich, Fred S.; Roy, Scott W.
- Genome Biology, Vol. 8, Issue 10
Genomic Footprints of a Cryptic Plastid Endosymbiosis in Diatoms
journal, June 2009
- Moustafa, A.; Beszteri, B.; Maier, U. G.
- Science, Vol. 324, Issue 5935
Horizontal gene transfer in eukaryotic evolution
journal, August 2008
- Keeling, Patrick J.; Palmer, Jeffrey D.
- Nature Reviews Genetics, Vol. 9, Issue 8
HECTAR: A method to predict subcellular targeting in heterokonts
journal, January 2008
- Gschloessl, Bernhard; Guermeur, Yann; Cock, J. Mark
- BMC Bioinformatics, Vol. 9, Issue 1
Fast and SNP-tolerant detection of complex variants and splicing in short reads
journal, February 2010
- Wu, T. D.; Nacu, S.
- Bioinformatics, Vol. 26, Issue 7, p. 873-881
Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation
journal, January 2017
- Ruffier, Magali; Kähäri, Andreas; Komorowska, Monika
- Database, Vol. 2017
Regulatory activities of transposable elements: from conflicts to benefits
journal, November 2016
- Chuong, Edward B.; Elde, Nels C.; Feschotte, Cédric
- Nature Reviews Genetics, Vol. 18, Issue 2
The Phaeodactylum genome reveals the evolutionary history of diatom genomes
journal, October 2008
- Bowler, Chris; Allen, Andrew E.; Badger, Jonathan H.
- Nature, Vol. 456, Issue 7219
Plastid proteome prediction for diatoms and other algae with secondary plastids of the red lineage
journal, January 2015
- Gruber, Ansgar; Rocap, Gabrielle; Kroth, Peter G.
- The Plant Journal, Vol. 81, Issue 3
Autophagy in plants and algae
journal, December 2014
- Bassham, Diane C.; Crespo, Jose L.
- Frontiers in Plant Science, Vol. 5
Transposable Element Domestication As an Adaptation to Evolutionary Conflicts
journal, November 2017
- Jangam, Diwash; Feschotte, Cédric; Betrán, Esther
- Trends in Genetics, Vol. 33, Issue 11
Improvement in Protein Domain Identification Is Reached by Breaking Consensus, with the Agreement of Many Profiles and Domain Co-occurrence
journal, July 2016
- Bernardes, Juliana; Zaverucha, Gerson; Vaquero, Catherine
- PLOS Computational Biology, Vol. 12, Issue 7
Considering Transposable Element Diversification in De Novo Annotation Approaches
journal, January 2011
- Flutre, Timothée; Duprat, Elodie; Feuillet, Catherine
- PLoS ONE, Vol. 6, Issue 1
Widespread intron retention in mammals functionally tunes transcriptomes
journal, September 2014
- Braunschweig, Ulrich; Barbosa-Morais, Nuno L.; Pan, Qun
- Genome Research, Vol. 24, Issue 11
Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses
journal, April 2012
- Jourdren, Laurent; Bernard, Maria; Dillies, Marie-Agnès
- Bioinformatics, Vol. 28, Issue 11
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks
journal, March 2012
- Trapnell, Cole; Roberts, Adam; Goff, Loyal
- Nature Protocols, Vol. 7, Issue 3
Reevaluating the Green Contribution to Diatom Genomes
journal, January 2012
- Deschamps, Philippe; Moreira, David
- Genome Biology and Evolution, Vol. 4, Issue 7
N -Glycans of Phaeodactylum tricornutum Diatom and Functional Characterization of Its N -Acetylglucosaminyltransferase I Enzyme
journal, December 2010
- Baïet, Bérengère; Burel, Carole; Saint-Jean, Bruno
- Journal of Biological Chemistry, Vol. 286, Issue 8
InterPro: the integrative protein signature database
journal, January 2009
- Hunter, S.; Apweiler, R.; Attwood, T. K.
- Nucleic Acids Research, Vol. 37, Issue Database
Oceanographic and Biogeochemical Insights from Diatom Genomes
journal, January 2010
- Bowler, Chris; Vardi, Assaf; Allen, Andrew E.
- Annual Review of Marine Science, Vol. 2, Issue 1
Epigenetic regulation of intragenic transposable elements impacts gene transcription in Arabidopsis thaliana
journal, March 2015
- Le, Tu N.; Miyazaki, Yuji; Takuno, Shohei
- Nucleic Acids Research, Vol. 43, Issue 8
The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): Illuminating the Functional Diversity of Eukaryotic Life in the Oceans through Transcriptome Sequencing
journal, June 2014
- Keeling, Patrick J.; Burki, Fabien; Wilcox, Heather M.
- PLoS Biology, Vol. 12, Issue 6
UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches
journal, November 2014
- Suzek, B. E.; Wang, Y.; Huang, H.
- Bioinformatics, Vol. 31, Issue 6
The Evolution of Silicon Transport in Eukaryotes
journal, October 2016
- Marron, Alan O.; Ratcliffe, Sarah; Wheeler, Glen L.
- Molecular Biology and Evolution, Vol. 33, Issue 12
Amino Acid Biosynthesis Pathways in Diatoms
journal, April 2013
- Bromke, Mariusz
- Metabolites, Vol. 3, Issue 2
Digital expression profiling of novel diatom transcripts provides insight into their biological functions
journal, January 2010
- Maheswari, Uma; Jabbari, Kamel; Petit, Jean-Louis
- Genome Biology, Vol. 11, Issue 8
Gene prediction with a hidden Markov model and a new intron submodel
journal, September 2003
- Stanke, M.; Waack, S.
- Bioinformatics, Vol. 19, Issue Suppl 2
Cross-kingdom patterns of alternative splicing and splice recognition
journal, January 2008
- McGuire, Abigail M.; Pearson, Matthew D.; Neafsey, Daniel E.
- Genome Biology, Vol. 9, Issue 3
Zinc finger proteins: new insights into structural and functional diversity
journal, February 2001
- Laity, John H.; Lee, Brian M.; Wright, Peter E.
- Current Opinion in Structural Biology, Vol. 11, Issue 1
Efficient mapping of Applied Biosystems SOLiD sequence data to a reference genome for functional genomic applications
journal, October 2008
- Ondov, Brian D.; Varadarajan, Anjana; Passalacqua, Karla D.
- Bioinformatics, Vol. 24, Issue 23
Finding a partner in the ocean: molecular and evolutionary bases of the response to sexual cues in a planktonic diatom
journal, April 2017
- Basu, Swaraj; Patil, Shrikant; Mapleson, Daniel
- New Phytologist, Vol. 215, Issue 1
The ankyrin repeat: a diversity of interactions on a common structural framework
journal, August 1999
- Sedgwick, Steven G.; Smerdon, Stephen J.
- Trends in Biochemical Sciences, Vol. 24, Issue 8
Insights into global diatom distribution and diversity in the world’s ocean
journal, February 2016
- Malviya, Shruti; Scalco, Eleonora; Audic, Stéphane
- Proceedings of the National Academy of Sciences, Vol. 113, Issue 11
Primary Production of the Biosphere: Integrating Terrestrial and Oceanic Components
journal, July 1998
- Field, C. B.
- Science, Vol. 281, Issue 5374
De novo identification of repeat families in large genomes
journal, June 2005
- Price, A. L.; Jones, N. C.; Pevzner, P. A.
- Bioinformatics, Vol. 21, Issue Suppl 1
Insights into the role of DNA methylation in diatoms by genome-wide profiling in Phaeodactylum tricornutum
journal, July 2013
- Veluchamy, Alaguraj; Lin, Xin; Maumus, Florian
- Nature Communications, Vol. 4, Issue 1
Evaluation and improvements of clustering algorithms for detecting remote homologous protein families
journal, February 2015
- Bernardes, Juliana S.; Vieira, Fabio RJ; Costa, Lygia MM
- BMC Bioinformatics, Vol. 16, Issue 1
The Revised Classification of Eukaryotes
journal, September 2012
- Adl, Sina M.; Simpson, Alastair G. B.; Lane, Christopher E.
- Journal of Eukaryotic Microbiology, Vol. 59, Issue 5
The plastid genome of some eustigmatophyte algae harbours a bacteria-derived six-gene cluster for biosynthesis of a novel secondary metabolite
journal, November 2016
- Yurchenko, Tatiana; Ševčíková, Tereza; Strnad, Hynek
- Open Biology, Vol. 6, Issue 11
Origin and evolution of SINEs in eukaryotic genomes
journal, June 2011
- Kramerov, D. A.; Vassetzky, N. S.
- Heredity, Vol. 107, Issue 6
The Evolution of Intron Size in Amniotes: A Role for Powered Flight?
journal, January 2012
- Zhang, Qu; Edwards, Scott V.
- Genome Biology and Evolution, Vol. 4, Issue 10
Recent progress in diatom genomics and epigenomics
journal, April 2017
- Tirichine, Leila; Rastogi, Achal; Bowler, Chris
- Current Opinion in Plant Biology, Vol. 36
Evolutionary Dynamics of Intron Size, Genome Size, and Physiological Correlates in Archosaurs
journal, November 2002
- Waltari, Eric; Edwards, Scott V.
- The American Naturalist, Vol. 160, Issue 5
Algal endosymbionts as vectors of horizontal gene transfer in photosynthetic eukaryotes
journal, January 2013
- Qiu, Huan; Yoon, Hwan Su; Bhattacharya, Debashish
- Frontiers in Plant Science, Vol. 4
Endosymbiotic origin and differential loss of eukaryotic genes
journal, August 2015
- Ku, Chuan; Nelson-Sathi, Shijulal; Roettger, Mayo
- Nature, Vol. 524, Issue 7566
Web Apollo: a web-based genomic annotation editing platform
journal, January 2013
- Lee, Eduardo; Helt, Gregg A.; Reese, Justin T.
- Genome Biology, Vol. 14, Issue 8
The Response of Diatom Central Carbon Metabolism to Nitrogen Starvation Is Different from That of Green Algae and Higher Plants
journal, November 2011
- Hockin, Nicola Louise; Mock, Thomas; Mulholland, Francis
- Plant Physiology, Vol. 158, Issue 1
Loss of Nucleosomal DNA Condensation Coincides with Appearance of a Novel Nuclear Protein in Dinoflagellates
journal, December 2012
- Gornik, Sebastian G.; Ford, Kristina L.; Mulhern, Terrence D.
- Current Biology, Vol. 22, Issue 24
Cell-cycle response to nutrient starvation in two phytoplankton species, Thalassiosira weissflogii and Hymenomonas carterae
journal, September 1987
- Vaulot, D.; Olson, R. J.; Merkel, S.
- Marine Biology, Vol. 95, Issue 4
Evolutionary genomics of the cold-adapted diatom Fragilariopsis cylindrus
journal, January 2017
- Mock, Thomas; Otillar, Robert P.; Strauss, Jan
- Nature, Vol. 541, Issue 7638
Eukaryotic plankton diversity in the sunlit ocean
journal, May 2015
- de Vargas, C.; Audic, S.; Henry, N.
- Science, Vol. 348, Issue 6237
Protein networks identify novel symbiogenetic genes resulting from plastid endosymbiosis
journal, March 2016
- Méheust, Raphaël; Zelzion, Ehud; Bhattacharya, Debashish
- Proceedings of the National Academy of Sciences, Vol. 113, Issue 13
Repbase Update, a database of eukaryotic repetitive elements
journal, January 2005
- Jurka, J.; Kapitonov, V. V.; Pavlicek, A.
- Cytogenetic and Genome Research, Vol. 110, Issue 1-4
Chimeric origins of ochrophytes and haptophytes revealed through an ancient plastid proteome
journal, May 2017
- Dorrell, Richard G.; Gile, Gillian; McCallum, Giselle
- eLife, Vol. 6
Potential impact of stress activated retrotransposons on genome evolution in a marine diatom
journal, January 2009
- Maumus, Florian; Allen, Andrew E.; Mhiri, Corinne
- BMC Genomics, Vol. 10, Issue 1
Comparative genomic analysis of fungal genomes reveals intron-rich ancestors
text, January 2007
- S., Dietrich, Fred; E., Stajich, Jason; W., Roy, Scott
- BioMed Central Ltd
The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): Illuminating the Functional Diversity of Eukaryotic Life in the Oceans through Transcriptome Sequencing
text, January 2014
- Keeling, Patrick J.; Burki, Fabien; Wilcox, Heather M.
- Columbia University
The Evolution of Silicon Transport in Eukaryotes.
text, January 2016
- Marron, Alan; Ratcliffe, Sarah; Wheeler, Glen L.
- Apollo - University of Cambridge Repository
Works referencing / citing this record:
Genome streamlining via complete loss of introns has occurred multiple times in lichenized fungal mitochondria
journal, March 2019
- Pogoda, Cloe S.; Keepers, Kyle G.; Nadiadi, Arif Y.
- Ecology and Evolution, Vol. 9, Issue 7
A genomics approach reveals the global genetic polymorphism, structure, and functional diversity of ten accessions of the marine model diatom Phaeodactylum tricornutum
journal, October 2019
- Rastogi, Achal; Vieira, Fabio Rocha Jimenez; Deton-Cabanillas, Anne-Flore
- The ISME Journal, Vol. 14, Issue 2
De novo transcriptome assembly and analysis of the freshwater araphid diatom Fragilaria radians, Lake Baikal
journal, September 2019
- Galachyants, Yuri Pavlovich; Zakharova, Yulia Robertovna; Volokitina, Nadezda Antonovna
- Scientific Data, Vol. 6, Issue 1
Comparative in depth RNA sequencing of P. tricornutum’s morphotypes reveals specific features of the oval morphotype
journal, September 2018
- Ovide, Clément; Kiefer-Meyer, Marie-Christine; Bérard, Caroline
- Scientific Reports, Vol. 8, Issue 1
Different iron storage strategies among bloom-forming diatoms
journal, December 2018
- Lampe, Robert H.; Mann, Elizabeth L.; Cohen, Natalie R.
- Proceedings of the National Academy of Sciences, Vol. 115, Issue 52
Downregulation of mitochondrial alternative oxidase affects chloroplast function, redox status and stress response in a marine diatom
journal, November 2018
- Murik, Omer; Tirichine, Leila; Prihoda, Judit
- New Phytologist, Vol. 221, Issue 3
Isoprenoid biosynthesis in the diatom Haslea ostrearia
journal, December 2018
- Athanasakoglou, Anastasia; Grypioti, Emilia; Michailidou, Sofia
- New Phytologist, Vol. 222, Issue 1
Endocytosis-mediated siderophore uptake as a strategy for Fe acquisition in diatoms
journal, May 2018
- Kazamia, Elena; Sutak, Robert; Paz-Yepes, Javier
- Science Advances, Vol. 4, Issue 5
Enhanced pyruvate metabolism in plastids by overexpression of putative plastidial pyruvate transporter in Phaeodactylum tricornutum
journal, July 2020
- Seo, Seungbeom; Kim, Joon; Lee, Jun-Woo
- Biotechnology for Biofuels, Vol. 13, Issue 1
Extensive chloroplast genome rearrangement amongst three closely related Halamphora spp. (Bacillariophyceae), and evidence for rapid evolution as compared to land plants
journal, July 2019
- Hamsher, Sarah E.; Keepers, Kyle G.; Pogoda, Cloe S.
- PLOS ONE, Vol. 14, Issue 7
Update of the list of QPS‐recommended biological agents intentionally added to food or feed as notified to EFSA 10: Suitability of taxonomic units notified to EFSA until March 2019
journal, July 2019
- Koutsoumanis, Kostas; Allende, Ana; Alvarez‐Ordóñez, Avelino
- EFSA Journal, Vol. 17, Issue 7
A Potential Role for Epigenetic Processes in the Acclimation Response to Elevated pCO2 in the Model Diatom Phaeodactylum tricornutum
journal, January 2019
- Huang, Ruiping; Ding, Jiancheng; Gao, Kunshan
- Frontiers in Microbiology, Vol. 9
Blasticidin-S deaminase, a new selection marker for genetic transformation of the diatom Phaeodactylum tricornutum
journal, January 2018
- Buck, Jochen M.; Río Bártulos, Carolina; Gruber, Ansgar
- PeerJ, Vol. 6