DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits

Journal Article · · Scientific Reports

Abstract Invasive plant pathogenic fungi have a global impact, with devastating economic and environmental effects on crops and forests. Biosurveillance, a critical component of threat mitigation, requires risk prediction based on fungal lifestyles and traits. Recent studies have revealed distinct genomic patterns associated with specific groups of plant pathogenic fungi. We sought to establish whether these phytopathogenic genomic patterns hold across diverse taxonomic and ecological groups from the Ascomycota and Basidiomycota, and furthermore, if those patterns can be used in a predictive capacity for biosurveillance. Using a supervised machine learning approach that integrates phylogenetic and genomic data, we analyzed 387 fungal genomes to test a proof-of-concept for the use of genomic signatures in predicting fungal phytopathogenic lifestyles and traits during biosurveillance activities. Our machine learning feature sets were derived from genome annotation data of carbohydrate-active enzymes (CAZymes), peptidases, secondary metabolite clusters (SMCs), transporters, and transcription factors. We found that machine learning could successfully predict fungal lifestyles and traits across taxonomic groups, with the best predictive performance coming from feature sets comprising CAZyme, peptidase, and SMC data. While phylogeny was an important component in most predictions, the inclusion of genomic data improved prediction performance for every lifestyle and trait tested. Plant pathogenicity was one of the best-predicted traits, showing the promise of predictive genomics for biosurveillance applications. Furthermore, our machine learning approach revealed expansions in the number of genes from specific CAZyme and peptidase families in the genomes of plant pathogens compared to non-phytopathogenic genomes (saprotrophs, endo- and ectomycorrhizal fungi). Such genomic feature profiles give insight into the evolution of fungal phytopathogenicity and could be useful to predict the risks of unknown fungi in future biosurveillance activities.

Sponsoring Organization:
USDOE
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
2009003
Journal Information:
Scientific Reports, Journal Name: Scientific Reports Journal Issue: 1 Vol. 13; ISSN 2045-2322
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (80)

The cellulase encoded by the native plasmid of Clavibacter michiganensis ssp. sepedonicus plays a role in virulence and contains an expansin-like domain journal November 2000
Characterisation of Aspergillus niger prolyl aminopeptidase journal January 2005
Proteases from phytopathogenic fungi and their importance in phytopathogenicity journal August 2016
Biosurveillance of forest insects: part II—adoption of genomic tools by end user communities and barriers to integration journal July 2018
Biosurveillance of forest insects: part I—integration and application of genomic tools to the surveillance of non-native forest insects journal August 2018
Ecological impacts of non-indigenous invasive fungi as forest pathogens journal July 2008
Recent approaches on the genomic analysis of the phytopathogenic fungus Colletotrichum spp. journal May 2019
Plant–pathogen arms races at the molecular level journal August 2000
Dating the molecular clock in fungi – how close are we? journal February 2010
The Botrytis cinerea aspartic proteinase family journal January 2010
Host–pathogen warfare at the plant cell wall journal August 2009
Plant expansins: diversity and interactions with plant cell walls journal June 2015
Pathogenic attributes of Sclerotinia sclerotiorum : Switching from a biotrophic to necrotrophic lifestyle journal April 2015
101 Dothideomycetes genomes: A test case for predicting lifestyles and emergence of pathogens journal June 2020
ROC-ing along: Evaluation and interpretation of receiver operating characteristic curves journal June 2016
Pathogenesis, parasitism and mutualism in the trophic space of microbe–plant interactions journal August 2010
Emerging infectious diseases of plants: pathogen pollution, climate change and agrotechnology drivers journal October 2004
Analysis of a Secreted Aspartic Peptidase Disruption Mutant of Glomerella cingulata journal March 2004
The plant immune system journal November 2006
Emerging fungal threats to animal, plant and ecosystem health journal April 2012
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets journal October 2017
Genome sequencing and analysis of the biomass-degrading fungus Trichoderma reesei (syn. Hypocrea jecorina) journal May 2008
Convergent losses of decay mechanisms and rapid turnover of symbiosis genes in mycorrhizal mutualists journal February 2015
Plant immunity: towards an integrated view of plant–pathogen interactions journal June 2010
Evolution and genome architecture in fungal plant pathogens journal August 2017
Large-scale genome sequencing of mycorrhizal fungi provides insights into the early evolution of symbiotic traits journal October 2020
Fungal lifestyle reflected in serine protease repertoire journal August 2017
Comparative genomics provides insights into the lifestyle and reveals functional heterogeneity of dark septate endophytic fungi journal April 2018
Secret lifestyles of Neurospora crassa journal May 2014
Infection structures of biotrophic and hemibiotrophic fungal plant pathogens journal March 2001
Expression cloning of a fungal proline-rich glycoprotein specific to the biotrophic interface formed in the Colletotrichum-bean interaction journal July 1998
Crystal structure and activity of Bacillus subtilis YoaJ (EXLX1), a bacterial expansin that promotes root colonization journal October 2008
Nonindigenous species introductions: a threat to Canada's forests and forest economy1 journal June 2002
From genomes to forest management – tackling invasivePhytophthoraspecies in the era of genomics journal July 2019
Dating divergences in the Fungal Tree of Life: review and new analyses journal November 2006
Genome-wide annotation, comparison and functional genomics of carbohydrate-active enzymes in legumes infecting Fusarium oxysporum formae speciales journal January 2020
Supervised learning on phylogenetically distributed data journal December 2020
Genus-Wide Comparative Genome Analyses ofColletotrichumSpecies Reveal Specific Gene Family Losses and Gains during Adaptation to Specific Infection Lifestyles journal April 2016
FastTree: Computing Large Minimum Evolution Trees with Profiles instead of a Distance Matrix journal April 2009
The carbohydrate-active enzyme database: functions and literature journal November 2021
VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center journal October 2021
MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform journal July 2002
MycoCosm portal: gearing up for 1000 fungal genomes journal December 2013
Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation journal November 2015
Ensembl Genomes 2020—enabling non-vertebrate genomic research journal October 2019
Selection of Conserved Blocks from Multiple Alignments for Their Use in Phylogenetic Analysis journal April 2000
Cloning and Characterization of a Novel Invertase from the Obligate Biotroph Uromyces fabae and Analysis of Expression Patterns of Host and Pathogen Invertases in the Course of Infection journal June 2006
The Endo-β-1,4-glucanase CelA of Clavibacter michiganensis subsp. michiganensis Is a Pathogenicity Determinant Required for Induction of Bacterial Wilt of Tomato journal July 2000
Disease Management in the Genomics Era—Summaries of Focus Issue Papers journal October 2016
Role of Swollenin, an Expansin-Like Protein from Trichoderma , in Plant Root Colonization journal April 2008
The infection cushion of Botrytis cinerea: a fungal ‘weapon’ of plant‐biomass destruction journal February 2021
Genomic biosurveillance of forest invasive alien enemies: A story written in code journal June 2019
Arabidopsis pathology breathes new life into the necrotrophs-vs.-biotrophs classification of fungal pathogens journal July 2004
The biosecurity threat to the UK and global environment from international trade in plants journal October 2008
Terminology for Plant Parasites journal July 1966
Concepts in Fungal Nutrition and the Origin of Biotrophy journal May 1973
Sucrose-metabolizing enzymes from leaves of barley infected with brown rust (Puccinia hordei Otth.) journal April 1992
Glycosyltransferases and their products: cryptococcal variations on fungal themes journal June 2006
A unique invertase is important for sugar absorption of an obligate biotrophic pathogen during infection journal July 2017
Comparative genomics reveals unique wood‐decay strategies and fruiting body development in the Schizophyllaceae journal June 2019
Planted forest health: The need for a global strategy journal August 2015
Forest health and global change journal August 2015
Fungal Effectors and Plant Susceptibility journal April 2015
Plant Cell Wall–Degrading Enzymes and Their Secretion in Plant-Pathogenic Fungi journal August 2014
Friend or foe? Evolutionary history of glycoside hydrolase family 32 genes encoding for sucrolytic activity in fungi and its implications for plant-fungal symbioses journal January 2009
Comparative analysis of fungal genomes reveals different plant cell wall degrading capacity in fungi journal January 2013
Arsenal of plant cell wall degrading enzymes reflects host preference among plant pathogenic fungi journal January 2011
Saprophytic and pathogenic fungi in the Ceratocystidaceae differ in their ability to metabolize plant-derived sucrose journal December 2015
Widespread signatures of selection for secreted peptidases in a fungal plant pathogen journal January 2018
Network analysis exposes core functions in major lifestyles of fungal and oomycete plant pathogens journal December 2019
Comparative transcriptomic analysis of races 1, 2, 5 and 6 of Fusarium oxysporum f.sp. pisi in a susceptible pea host identifies differential pathogenicity profiles journal October 2021
Comparative Genomics Reveals Insight into Virulence Strategies of Plant Pathogenic Oomycetes journal October 2013
Diverse Lifestyles and Strategies of Plant Pathogenesis Encoded in the Genomes of Eighteen Dothideomycetes Fungi journal December 2012
Fungal Genomics Challenges the Dogma of Name-Based Biosecurity journal May 2016
A conserved fungal glycosyltransferase facilitates pathogenesis of plants by enabling hyphal growth on solid surfaces journal October 2017
Increasing forest loss worldwide from invasive pests requires new trade regulations journal October 2014
Fungal Infection of Plants journal October 1996
“CATAStrophy,” a Genome-Informed Trophic Classification of Filamentous Plant Pathogens – How Many Different Types of Filamentous Plant Pathogens Are There? journal January 2020
Evolution of virulence in fungal plant pathogens: exploiting fungal genomics to control plant disease journal May 2015
Mycosphere Essays 9: Defining biotrophs and hemibiotrophs journal January 2016