Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting
Abstract
Background Although pathogenic Gram-negative bacteria lack their own ubiquitination machinery, they have evolved or acquired virulence effectors that can manipulate the host ubiquitination process through structural and/or functional mimicry of host machinery. Many such effectors have been identified in a wide variety of bacterial pathogens that share little sequence similarity amongst themselves or with eukaryotic ubiquitin E3 ligases. Methods To allow identification of novel bacterial E3 ubiquitin ligase effectors from protein sequences we have developed a machine learning approach, the SVM-based Identification and Evaluation of Virulence Effector Ubiquitin ligases (SIEVE-Ub). We extend the string kernel approach used previously to sequence classification by introducing reduced amino acid (RED) alphabet encoding for protein sequences. Results We found that 14mer peptides with amino acids represented as simply either hydrophobic or hydrophilic provided the best models for discrimination of E3 ligases from other effector proteins with a receiver-operator characteristic area under the curve (AUC) of 0.90. When considering a subset of E3 ubiquitin ligase effectors that do not fall into known sequence based families we found that the AUC was 0.82, demonstrating the effectiveness of our method at identifying novel functional family members. Feature selection was used to identify a parsimonious set of 10more »
- Authors:
-
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States of America, Department of Molecular Microbiology and Immunology, Oregon Health & Science University, Portland, OR, United States of America
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States of America
- Department of Molecular Microbiology and Immunology, Oregon Health & Science University, Portland, OR, United States of America
- Center for Brain Immunology and Glia, University of Virginia, Charlottesville, United States of America
- Publication Date:
- Research Org.:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1525490
- Alternate Identifier(s):
- OSTI ID: 1544792
- Report Number(s):
- PNNL-SA-138492
Journal ID: ISSN 2167-8359; e7055
- Grant/Contract Number:
- AC05-76RLO01830; AC05-76RL01830
- Resource Type:
- Published Article
- Journal Name:
- PeerJ
- Additional Journal Information:
- Journal Name: PeerJ Journal Volume: 7; Journal ID: ISSN 2167-8359
- Publisher:
- PeerJ
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; 97 MATHEMATICS AND COMPUTING; machine learning; pathogens; effectors; prediction; protein sequence
Citation Formats
McDermott, Jason E., Cort, John R., Nakayasu, Ernesto S., Pruneda, Jonathan N., Overall, Christopher, and Adkins, Joshua N. Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting. United States: N. p., 2019.
Web. doi:10.7717/peerj.7055.
McDermott, Jason E., Cort, John R., Nakayasu, Ernesto S., Pruneda, Jonathan N., Overall, Christopher, & Adkins, Joshua N. Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting. United States. https://doi.org/10.7717/peerj.7055
McDermott, Jason E., Cort, John R., Nakayasu, Ernesto S., Pruneda, Jonathan N., Overall, Christopher, and Adkins, Joshua N. Fri .
"Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting". United States. https://doi.org/10.7717/peerj.7055.
@article{osti_1525490,
title = {Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting},
author = {McDermott, Jason E. and Cort, John R. and Nakayasu, Ernesto S. and Pruneda, Jonathan N. and Overall, Christopher and Adkins, Joshua N.},
abstractNote = {Background Although pathogenic Gram-negative bacteria lack their own ubiquitination machinery, they have evolved or acquired virulence effectors that can manipulate the host ubiquitination process through structural and/or functional mimicry of host machinery. Many such effectors have been identified in a wide variety of bacterial pathogens that share little sequence similarity amongst themselves or with eukaryotic ubiquitin E3 ligases. Methods To allow identification of novel bacterial E3 ubiquitin ligase effectors from protein sequences we have developed a machine learning approach, the SVM-based Identification and Evaluation of Virulence Effector Ubiquitin ligases (SIEVE-Ub). We extend the string kernel approach used previously to sequence classification by introducing reduced amino acid (RED) alphabet encoding for protein sequences. Results We found that 14mer peptides with amino acids represented as simply either hydrophobic or hydrophilic provided the best models for discrimination of E3 ligases from other effector proteins with a receiver-operator characteristic area under the curve (AUC) of 0.90. When considering a subset of E3 ubiquitin ligase effectors that do not fall into known sequence based families we found that the AUC was 0.82, demonstrating the effectiveness of our method at identifying novel functional family members. Feature selection was used to identify a parsimonious set of 10 RED peptides that provided good discrimination, and these peptides were found to be located in functionally important regions of the proteins involved in E2 and host target protein binding. Our general approach enables construction of models based on other effector functions. We used SIEVE-Ub to predict nine potential novel E3 ligases from a large set of bacterial genomes. SIEVE-Ub is available for download at https://doi.org/10.6084/m9.figshare.7766984.v1 or https://github.com/biodataganache/SIEVE-Ub for the most current version.},
doi = {10.7717/peerj.7055},
journal = {PeerJ},
number = ,
volume = 7,
place = {United States},
year = {Fri Jun 07 00:00:00 EDT 2019},
month = {Fri Jun 07 00:00:00 EDT 2019}
}
https://doi.org/10.7717/peerj.7055
Web of Science
Works referenced in this record:
Gene Selection for Cancer Classification using Support Vector Machines
journal, January 2002
- Guyon, Isabelle; Weston, Jason; Barnhill, Stephen
- Machine Learning, Vol. 46, Issue 1/3, p. 389-422
Hijacking the host ubiquitin pathway: structural strategies of bacterial E3 ubiquitin ligases
journal, February 2010
- Hicks, Stuart W.; Galán, Jorge E.
- Current Opinion in Microbiology, Vol. 13, Issue 1
Structural mimicry in bacterial virulence
journal, August 2001
- Stebbins, C. Erec; Galán, Jorge E.
- Nature, Vol. 412, Issue 6848
RavN is a member of a previously unrecognized group of Legionella pneumophila E3 ubiquitin ligases
journal, February 2018
- Lin, Yi-Han; Lucas, María; Evans, Timothy R.
- PLOS Pathogens, Vol. 14, Issue 2
Structure of the Shigella T3SS effector IpaH defines a new class of E3 ubiquitin ligases
journal, November 2008
- Singer, Alexander U.; Rohde, John R.; Lam, Robert
- Nature Structural & Molecular Biology, Vol. 15, Issue 12
A multi-pronged search for a common structural motif in the secretion signal of Salmonella enterica serovar Typhimurium type III effector proteins
journal, January 2010
- Buchko, Garry W.; Niemann, George; Baker, Erin S.
- Molecular BioSystems, Vol. 6, Issue 12
The Spectrum Kernel: a String Kernel for svm Protein Classification
conference, November 2011
- Leslie, Christina; Eskin, Eleazar; Noble, William Stafford
- Proceedings of the Pacific Symposium, Biocomputing 2002
Identification of an unconventional E3 binding surface on the UbcH5 Ub conjugate recognized by a pathogenic bacterial E3 ligase.
journal, February 2010
- Levin, I.; Eakin, C.; Blanc, M. -P.
- Proceedings of the National Academy of Sciences, Vol. 107, Issue 7
Biochemical and Structural Studies of a HECT-like Ubiquitin Ligase from Escherichia coli O157:H7
journal, October 2010
- Lin, David Yin-wei; Diao, Jianbo; Zhou, Daoguo
- Journal of Biological Chemistry, Vol. 286, Issue 1
ElaD, a Deubiquitinating Protease Expressed by E. coli
journal, April 2007
- Catic, André; Misaghi, Shahram; Korbel, Gregory A.
- PLoS ONE, Vol. 2, Issue 4
Biopython: freely available Python tools for computational molecular biology and bioinformatics
journal, March 2009
- Cock, P. J. A.; Antao, T.; Chang, J. T.
- Bioinformatics, Vol. 25, Issue 11
DRREP: deep ridge regressed epitope predictor
journal, October 2017
- Sher, Gene; Zhi, Degui; Zhang, Shaojie
- BMC Genomics, Vol. 18, Issue S6
Glutamine Deamidation and Dysfunction of Ubiquitin/NEDD8 Induced by a Bacterial Effector Family
journal, August 2010
- Cui, J.; Yao, Q.; Li, S.
- Science, Vol. 329, Issue 5996
Improvements to PATRIC, the all-bacterial Bioinformatics Database and Analysis Resource Center
journal, November 2016
- Wattam, Alice R.; Davis, James J.; Assaf, Rida
- Nucleic Acids Research, Vol. 45, Issue D1
A family of Salmonella virulence factors functions as a distinct class of autoregulated E3 ubiquitin ligases
journal, March 2009
- Quezada, C. M.; Hicks, S. W.; Galan, J. E.
- Proceedings of the National Academy of Sciences, Vol. 106, Issue 12
MusiteDeep: a deep-learning framework for general and kinase-specific phosphorylation site prediction
journal, August 2017
- Wang, Duolin; Zeng, Shuai; Xu, Chunhui
- Bioinformatics, Vol. 33, Issue 24
Exploitation of Host Polyubiquitination Machinery through Molecular Mimicry by Eukaryotic-Like Bacterial F-Box Effectors
journal, January 2010
- Price, Christopher T. D.; Kwaik, Yousef Abu
- Frontiers in Microbiology, Vol. 1
Mismatch string kernels for discriminative protein classification
journal, January 2004
- Leslie, C. S.; Eskin, E.; Cohen, A.
- Bioinformatics, Vol. 20, Issue 4
HECT and RING finger families of E3 ubiquitin ligases at a glance
journal, February 2012
- Metzger, M. B.; Hristova, V. A.; Weissman, A. M.
- Journal of Cell Science, Vol. 125, Issue 3
Automated Alphabet Reduction for Protein Datasets
journal, January 2009
- Bacardit, Jaume; Stout, Michael; Hirst, Jonathan D.
- BMC Bioinformatics, Vol. 10, Issue 1
Functional domains and motifs of bacterial type III effector proteins and their roles in infection
journal, November 2011
- Dean, Paul
- FEMS Microbiology Reviews, Vol. 35, Issue 6
pROC: an open-source package for R and S+ to analyze and compare ROC curves
journal, March 2011
- Robin, Xavier; Turck, Natacha; Hainard, Alexandre
- BMC Bioinformatics, Vol. 12, Issue 1
Bacterial Interference of Ubiquitination and Deubiquitination
journal, March 2007
- Rytkönen, Anne; Holden, David W.
- Cell Host & Microbe, Vol. 1, Issue 1
Type III Secretion Effectors of the IpaH Family Are E3 Ubiquitin Ligases
journal, March 2007
- Rohde, John R.; Breitkreutz, Ashton; Chenal, Alexandre
- Cell Host & Microbe, Vol. 1, Issue 1
Genome-Scale Identification of Legionella pneumophila Effectors Using a Machine Learning Approach
journal, July 2009
- Burstein, David; Zusman, Tal; Degtyar, Elena
- PLoS Pathogens, Vol. 5, Issue 7
What is a support vector machine?
journal, December 2006
- Noble, William S.
- Nature Biotechnology, Vol. 24, Issue 12
Prediction of multi-drug resistance transporters using a novel sequence analysis method
journal, January 2015
- McDermott, Jason E.; Bruillard, Paul; Overall, Christopher C.
- F1000Research, Vol. 4
Accurate Prediction of Secreted Substrates and Identification of a Conserved Putative Secretion Signal for Type III Secretion Systems
journal, April 2009
- Samudrala, Ram; Heffron, Fred; McDermott, Jason E.
- PLoS Pathogens, Vol. 5, Issue 4
Sequence-Based Prediction of Type III Secreted Proteins
journal, April 2009
- Arnold, Roland; Brandmaier, Stefan; Kleine, Frederick
- PLoS Pathogens, Vol. 5, Issue 4
CaMELS: In silico prediction of calmodulin binding proteins and their binding sites : ABBASI et al.
journal, July 2017
- Abbasi, Wajid Arshad; Asif, Amina; Andleeb, Saiqa
- Proteins: Structure, Function, and Bioinformatics, Vol. 85, Issue 9
Catch me if you can: bacterial effectors and plant targets
journal, November 2012
- Deslandes, Laurent; Rivas, Susana
- Trends in Plant Science, Vol. 17, Issue 11
Breaking the chains: structure and function of the deubiquitinases
journal, August 2009
- Komander, David; Clague, Michael J.; Urbé, Sylvie
- Nature Reviews Molecular Cell Biology, Vol. 10, Issue 8
Cytosol as battleground: ubiquitin as a weapon for both host and pathogen
journal, April 2010
- Collins, Cathleen A.; Brown, Eric J.
- Trends in Cell Biology, Vol. 20, Issue 4
Ubiquitylation in innate and adaptive immunity
journal, March 2009
- Bhoj, Vijay G.; Chen, Zhijian J.
- Nature, Vol. 458, Issue 7237
Five Mechanisms of Manipulation by Bacterial Effectors: A Ubiquitous Theme
journal, August 2012
- Anderson, David M.; Frank, Dara W.
- PLoS Pathogens, Vol. 8, Issue 8
NleG Type 3 Effectors from Enterohaemorrhagic Escherichia coli Are U-Box E3 Ubiquitin Ligases
journal, June 2010
- Wu, Bin; Skarina, Tatiana; Yee, Adelinda
- PLoS Pathogens, Vol. 6, Issue 6
Computational Prediction of Type III and IV Secreted Effectors in Gram-Negative Bacteria
journal, October 2010
- McDermott, Jason E.; Corrigan, Abigail; Peterson, Elena
- Infection and Immunity, Vol. 79, Issue 1
Identification of DNA-Binding Proteins Using Mixed Feature Representation Methods
journal, September 2017
- Qu, Kaiyang; Han, Ke; Wu, Song
- Molecules, Vol. 22, Issue 10
The Universal Protein Resource (UniProt)
journal, December 2004
- Bairoch, A.
- Nucleic Acids Research, Vol. 33, Issue Database issue
Non-canonical ubiquitin-based signals for proteasomal degradation
journal, February 2012
- Kravtsova-Ivantsiv, Y.; Ciechanover, A.
- Journal of Cell Science, Vol. 125, Issue 3
How microbes utilize host ubiquitination
journal, October 2009
- Spallek, Thomas; Robatzek, Silke; Göhre, Vera
- Cellular Microbiology, Vol. 11, Issue 10
A Class of Edit Kernels for SVMs to Predict Translation Initiation Sites in Eukaryotic mRNAs
journal, July 2005
- Li, Haifeng; Jiang, Tao
- Journal of Computational Biology, Vol. 12, Issue 6
High-order neural networks and kernel methods for peptide-MHC binding prediction
journal, July 2015
- Kuksa, Pavel P.; Min, Martin Renqiang; Dugar, Rishabh
- Bioinformatics
Functional Analysis of NopM, a Novel E3 Ubiquitin Ligase (NEL) Domain Effector of Rhizobium sp. Strain NGR234
journal, May 2012
- Xin, Da-Wei; Liao, Sha; Xie, Zhi-Ping
- PLoS Pathogens, Vol. 8, Issue 5
Predicting human protein subcellular localization by heterogeneous and comprehensive approaches
journal, June 2017
- Tung, Chi-Hua; Chen, Chi-Wei; Sun, Han-Hao
- PLOS ONE, Vol. 12, Issue 6