Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Predicting antimicrobial resistance using conserved genes

Journal Article · · PLoS Computational Biology (Online)
 [1];  [1];  [1];  [2];  [3]
  1. Argonne National Lab. (ANL), Argonne, IL (United States); Univ. of Chicago, IL (United States)
  2. Fellowship for Interpretation of Genomes, Burr Ridge, IL (United States)
  3. Argonne National Lab. (ANL), Argonne, IL (United States); Univ. of Chicago, IL (United States); Fellowship for Interpretation of Genomes, Burr Ridge, IL (United States); Northwestern Univ., Evanston, IL (United States). Northwestern Argonne Inst. for Science and Engineering
A growing number of studies are using machine learning models to accurately predict antimicrobial resistance (AMR) phenotypes from bacterial sequence data. Although these studies are showing promise, the models are typically trained using features derived from comprehensive sets of AMR genes or whole genome sequences and may not be suitable for use when genomes are incomplete. In this study, we explore the possibility of predicting AMR phenotypes using incomplete genome sequence data. Models were built from small sets of randomly-selected core genes after removing the AMR genes. For Klebsiella pneumoniae, Mycobacterium tuberculosis, Salmonella enterica, and Staphylococcus aureus, we report that it is possible to classify susceptible and resistant phenotypes with average F1 scores ranging from 0.80–0.89 with as few as 100 conserved non-AMR genes, with very major error rates ranging from 0.11–0.23 and major error rates ranging from 0.10–0.20. Models built from core genes have predictive power in cases where the primary AMR mechanisms result from SNPs or horizontal gene transfer. By randomly sampling non-overlapping sets of core genes, we show that F1 scores and error rates are stable and have little variance between replicates. Although these small core gene models have lower accuracies and higher error rates than models built from the corresponding assembled genomes, the results suggest that sufficient variation exists in the core non-AMR genes of a species for predicting AMR phenotypes.
Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
Defense Advanced Research Projects Agency (DARPA); National Institute of Allergy and Infectious Diseases (NIAID); USDOE
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1757970
Journal Information:
PLoS Computational Biology (Online), Journal Name: PLoS Computational Biology (Online) Journal Issue: 10 Vol. 16; ISSN 1553-7358
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English

References (60)

Machine learning and structural analysis of Mycobacterium tuberculosis pan-genome identifies genetic signatures of antibiotic resistance journal October 2018
SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing journal May 2012
Prediction of Acquired Antimicrobial Resistance for Multiple Bacterial Species Using Neural Networks journal January 2020
Interplay in the Selection of Fluoroquinolone Resistance and Bacterial Fitness journal August 2009
Quinolone-resistant mutations of the gyrA gene of Escherichia coli journal January 1988
A Commensal Strain of Staphylococcus epidermidis Overexpresses Membrane Proteins Associated with Pathogenesis When Grown in Biofilms journal April 2015
Clinical and microbiological implications of time-to-positivity of blood cultures in patients with Gram-negative bacilli bacteremia journal February 2013
Blood culture-based diagnosis of bacteraemia: state of the art journal April 2015
Expanded-spectrum cephalosporin resistance in non-typhoid Salmonella journal June 2004
The MycoBrowser portal: A comprehensive and manually annotated resource for mycobacterial genomes journal January 2011
Updating benchtop sequencing performance comparison journal April 2013
Rapid inference of antibiotic resistance and susceptibility by genomic neighbour typing journal February 2020
Genome-wide analysis of multi- and extensively drug-resistant Mycobacterium tuberculosis journal January 2018
Developing an in silico minimum inhibitory concentration panel test for Klebsiella pneumoniae journal January 2018
RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes journal February 2015
Antimicrobial Resistance Prediction in PATRIC and RAST journal June 2016
Genes required for mycobacterial growth defined by high density mutagenesis: Genes required for mycobacterial growth journal March 2003
From The Cover: Genome-wide requirements for Mycobacterium tuberculosis adaptation and survival in macrophages journal May 2005
Proteogenomic Analysis of Mycobacterium tuberculosis By High Resolution Mass Spectrometry journal October 2011
Antimicrobial Susceptibility Testing: A Review of General Principles and Contemporary Practices journal December 2009
Comparison of the Proteome of Isoniazid-Resistant and -Susceptible Strains of Mycobacterium tuberculosis journal December 2006
Capsular Polysaccharide Types and Virulence-Related Traits of Epidemic KPC-Producing Klebsiella pneumoniae Isolates in a Chinese University Hospital journal October 2017
KMC 2: fast and resource-frugal k-mer counting journal January 2015
Clinical Implications of Genomic Adaptation and Evolution of Carbapenem-Resistant Klebsiella pneumoniae journal February 2017
Identification of acquired antimicrobial resistance genes journal July 2012
Predicting antimicrobial susceptibilities for Escherichia coli and Klebsiella pneumoniae isolates using whole genomic sequence data journal May 2013
WGS to predict antibiotic MICs for Neisseria gonorrhoeae journal March 2017
MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability journal January 2013
Adaptive Landscapes of Resistance Genes Change as Antibiotic Concentrations Change journal June 2015
PATRIC, the bacterial bioinformatics database and analysis resource journal November 2013
Interactive Tree Of Life (iTOL) v4: recent updates and new developments journal April 2019
Duration of hypotension before initiation of effective antimicrobial therapy is the critical determinant of survival in human septic shock* journal January 2006
Machine learning for the prediction of antibacterial susceptibility in Mycobacterium tuberculosis conference June 2014
BapA, a large secreted protein required for biofilm formation and host colonization of Salmonella enterica serovar Enteritidis: Bap-like protein of Salmonella Enteritidis journal October 2005
Identification of a novel antigen from Staphylococcus epidermidis journal November 2000
Roles of two large serine recombinases in mobilizing the methicillin-resistance cassette SCC mec: Roles of the SCCmec recombinases journal May 2013
Genomics of antibiotic-resistance prediction in Pseudomonas aeruginosa : Predicting antibiotic resistance in journal June 2017
Whole-Genome Sequencing for Detecting Antimicrobial Resistance in Nontyphoidal Salmonella journal July 2016
A New Class of Genetic Element, Staphylococcus Cassette Chromosome mec, Encodes Methicillin Resistance in Staphylococcus aureus journal June 2000
Biological Cost of Single and Multiple Norfloxacin Resistance Mutations in Escherichia coli Implicated in Urinary Tract Infections journal June 2005
Differential Gene Expression Profiling of Staphylococcus aureus Cultivated under Biofilm and Planktonic Conditions journal May 2005
Gene Acquisition at the Insertion Site for SCCmec, the Genomic Island Conferring Methicillin Resistance in Staphylococcus aureus journal December 2007
A Novel Outer Membrane Protein, Wzi, Is Involved in Surface Assembly of the Escherichia coli K30 Group 1 Capsule journal October 2003
Using Machine Learning To Predict Antimicrobial MICs and Associated Genomic Features for Nontyphoidal Salmonella journal October 2018
wzi Gene Sequencing, a Rapid Method for Determination of Capsular Type for Klebsiella Strains journal October 2013
Comprehensive Essentiality Analysis of the Mycobacterium tuberculosis Genome via Saturating Transposon Mutagenesis journal January 2017
Prediction of Acquired Antimicrobial Resistance for Multiple Bacterial Species Using Neural Networks journal January 2020
XGBoost: A Scalable Tree Boosting System conference January 2016
Antimicrobial resistance: risk associated with antibiotic overuse and initiatives to reduce the problem journal September 2014
Predictive computational phenotyping and biomarker discovery using reference-free genome comparisons journal September 2016
Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data journal December 2018
Evaluation of parameters affecting performance and reliability of machine learning-based antibiotic susceptibility testing from whole genome sequencing data journal September 2019
Machine learning with random subspace ensembles identifies antimicrobial resistance determinants from pan-genomes of three pathogens journal March 2020
FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments journal March 2010
High-Resolution Phenotypic Profiling Defines Genes Essential for Mycobacterial Growth and Cholesterol Catabolism journal September 2011
Predicting antimicrobial resistance in Pseudomonas aeruginosa with machine learning‐enabled molecular diagnostics journal February 2020
Controlling Antimicrobial Resistance in Hospitals: Infection Control and Use of Antibiotics journal April 2001
PATtyFams: Protein Families for the Microbial Genomes in the PATRIC Database journal February 2016
Evaluation of Machine Learning and Rules-Based Approaches for Predicting Antimicrobial Resistance Profiles in Gram-negative Bacilli from Whole Genome Sequence Data journal November 2016
The Complex Relationship between Virulence and Antibiotic Resistance journal January 2017

Similar Records

Predicting Antimicrobial Resistance Using Partial Genome Alignments
Journal Article · Mon Jun 14 20:00:00 EDT 2021 · mSystems · OSTI ID:1854526

Antimicrobial resistance prediction in PATRIC and RAST
Journal Article · Mon Jun 13 20:00:00 EDT 2016 · Scientific Reports · OSTI ID:1258659

Predicting variable gene content in Escherichia coli using conserved genes
Journal Article · Tue Jun 13 20:00:00 EDT 2023 · mSystems · OSTI ID:2324772

Related Subjects