DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting

Abstract

Background Although pathogenic Gram-negative bacteria lack their own ubiquitination machinery, they have evolved or acquired virulence effectors that can manipulate the host ubiquitination process through structural and/or functional mimicry of host machinery. Many such effectors have been identified in a wide variety of bacterial pathogens that share little sequence similarity amongst themselves or with eukaryotic ubiquitin E3 ligases. Methods To allow identification of novel bacterial E3 ubiquitin ligase effectors from protein sequences we have developed a machine learning approach, the SVM-based Identification and Evaluation of Virulence Effector Ubiquitin ligases (SIEVE-Ub). We extend the string kernel approach used previously to sequence classification by introducing reduced amino acid (RED) alphabet encoding for protein sequences. Results We found that 14mer peptides with amino acids represented as simply either hydrophobic or hydrophilic provided the best models for discrimination of E3 ligases from other effector proteins with a receiver-operator characteristic area under the curve (AUC) of 0.90. When considering a subset of E3 ubiquitin ligase effectors that do not fall into known sequence based families we found that the AUC was 0.82, demonstrating the effectiveness of our method at identifying novel functional family members. Feature selection was used to identify a parsimonious set of 10more » RED peptides that provided good discrimination, and these peptides were found to be located in functionally important regions of the proteins involved in E2 and host target protein binding. Our general approach enables construction of models based on other effector functions. We used SIEVE-Ub to predict nine potential novel E3 ligases from a large set of bacterial genomes. SIEVE-Ub is available for download at https://doi.org/10.6084/m9.figshare.7766984.v1 or https://github.com/biodataganache/SIEVE-Ub for the most current version.« less

Authors:
 [1];  [2];  [2];  [3];  [4];  [2]
  1. Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States of America, Department of Molecular Microbiology and Immunology, Oregon Health & Science University, Portland, OR, United States of America
  2. Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States of America
  3. Department of Molecular Microbiology and Immunology, Oregon Health & Science University, Portland, OR, United States of America
  4. Center for Brain Immunology and Glia, University of Virginia, Charlottesville, United States of America
Publication Date:
Research Org.:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1525490
Alternate Identifier(s):
OSTI ID: 1544792
Report Number(s):
PNNL-SA-138492
Journal ID: ISSN 2167-8359; e7055
Grant/Contract Number:  
AC05-76RLO01830; AC05-76RL01830
Resource Type:
Published Article
Journal Name:
PeerJ
Additional Journal Information:
Journal Name: PeerJ Journal Volume: 7; Journal ID: ISSN 2167-8359
Publisher:
PeerJ
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; 97 MATHEMATICS AND COMPUTING; machine learning; pathogens; effectors; prediction; protein sequence

Citation Formats

McDermott, Jason E., Cort, John R., Nakayasu, Ernesto S., Pruneda, Jonathan N., Overall, Christopher, and Adkins, Joshua N. Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting. United States: N. p., 2019. Web. doi:10.7717/peerj.7055.
McDermott, Jason E., Cort, John R., Nakayasu, Ernesto S., Pruneda, Jonathan N., Overall, Christopher, & Adkins, Joshua N. Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting. United States. https://doi.org/10.7717/peerj.7055
McDermott, Jason E., Cort, John R., Nakayasu, Ernesto S., Pruneda, Jonathan N., Overall, Christopher, and Adkins, Joshua N. Fri . "Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting". United States. https://doi.org/10.7717/peerj.7055.
@article{osti_1525490,
title = {Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting},
author = {McDermott, Jason E. and Cort, John R. and Nakayasu, Ernesto S. and Pruneda, Jonathan N. and Overall, Christopher and Adkins, Joshua N.},
abstractNote = {Background Although pathogenic Gram-negative bacteria lack their own ubiquitination machinery, they have evolved or acquired virulence effectors that can manipulate the host ubiquitination process through structural and/or functional mimicry of host machinery. Many such effectors have been identified in a wide variety of bacterial pathogens that share little sequence similarity amongst themselves or with eukaryotic ubiquitin E3 ligases. Methods To allow identification of novel bacterial E3 ubiquitin ligase effectors from protein sequences we have developed a machine learning approach, the SVM-based Identification and Evaluation of Virulence Effector Ubiquitin ligases (SIEVE-Ub). We extend the string kernel approach used previously to sequence classification by introducing reduced amino acid (RED) alphabet encoding for protein sequences. Results We found that 14mer peptides with amino acids represented as simply either hydrophobic or hydrophilic provided the best models for discrimination of E3 ligases from other effector proteins with a receiver-operator characteristic area under the curve (AUC) of 0.90. When considering a subset of E3 ubiquitin ligase effectors that do not fall into known sequence based families we found that the AUC was 0.82, demonstrating the effectiveness of our method at identifying novel functional family members. Feature selection was used to identify a parsimonious set of 10 RED peptides that provided good discrimination, and these peptides were found to be located in functionally important regions of the proteins involved in E2 and host target protein binding. Our general approach enables construction of models based on other effector functions. We used SIEVE-Ub to predict nine potential novel E3 ligases from a large set of bacterial genomes. SIEVE-Ub is available for download at https://doi.org/10.6084/m9.figshare.7766984.v1 or https://github.com/biodataganache/SIEVE-Ub for the most current version.},
doi = {10.7717/peerj.7055},
journal = {PeerJ},
number = ,
volume = 7,
place = {United States},
year = {Fri Jun 07 00:00:00 EDT 2019},
month = {Fri Jun 07 00:00:00 EDT 2019}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.7717/peerj.7055

Citation Metrics:
Cited by: 4 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Gene Selection for Cancer Classification using Support Vector Machines
journal, January 2002

  • Guyon, Isabelle; Weston, Jason; Barnhill, Stephen
  • Machine Learning, Vol. 46, Issue 1/3, p. 389-422
  • DOI: 10.1023/A:1012487302797

Hijacking the host ubiquitin pathway: structural strategies of bacterial E3 ubiquitin ligases
journal, February 2010


Structural mimicry in bacterial virulence
journal, August 2001

  • Stebbins, C. Erec; Galán, Jorge E.
  • Nature, Vol. 412, Issue 6848
  • DOI: 10.1038/35089000

RavN is a member of a previously unrecognized group of Legionella pneumophila E3 ubiquitin ligases
journal, February 2018


Structure of the Shigella T3SS effector IpaH defines a new class of E3 ubiquitin ligases
journal, November 2008

  • Singer, Alexander U.; Rohde, John R.; Lam, Robert
  • Nature Structural & Molecular Biology, Vol. 15, Issue 12
  • DOI: 10.1038/nsmb.1511

A multi-pronged search for a common structural motif in the secretion signal of Salmonella enterica serovar Typhimurium type III effector proteins
journal, January 2010

  • Buchko, Garry W.; Niemann, George; Baker, Erin S.
  • Molecular BioSystems, Vol. 6, Issue 12
  • DOI: 10.1039/c0mb00097c

The Spectrum Kernel: a String Kernel for svm Protein Classification
conference, November 2011

  • Leslie, Christina; Eskin, Eleazar; Noble, William Stafford
  • Proceedings of the Pacific Symposium, Biocomputing 2002
  • DOI: 10.1142/9789812799623_0053

Identification of an unconventional E3 binding surface on the UbcH5   Ub conjugate recognized by a pathogenic bacterial E3 ligase.
journal, February 2010

  • Levin, I.; Eakin, C.; Blanc, M. -P.
  • Proceedings of the National Academy of Sciences, Vol. 107, Issue 7
  • DOI: 10.1073/pnas.0914821107

Biochemical and Structural Studies of a HECT-like Ubiquitin Ligase from Escherichia coli O157:H7
journal, October 2010

  • Lin, David Yin-wei; Diao, Jianbo; Zhou, Daoguo
  • Journal of Biological Chemistry, Vol. 286, Issue 1
  • DOI: 10.1074/jbc.M110.167643

ElaD, a Deubiquitinating Protease Expressed by E. coli
journal, April 2007


Biopython: freely available Python tools for computational molecular biology and bioinformatics
journal, March 2009


DRREP: deep ridge regressed epitope predictor
journal, October 2017


Glutamine Deamidation and Dysfunction of Ubiquitin/NEDD8 Induced by a Bacterial Effector Family
journal, August 2010


Improvements to PATRIC, the all-bacterial Bioinformatics Database and Analysis Resource Center
journal, November 2016

  • Wattam, Alice R.; Davis, James J.; Assaf, Rida
  • Nucleic Acids Research, Vol. 45, Issue D1
  • DOI: 10.1093/nar/gkw1017

A family of Salmonella virulence factors functions as a distinct class of autoregulated E3 ubiquitin ligases
journal, March 2009

  • Quezada, C. M.; Hicks, S. W.; Galan, J. E.
  • Proceedings of the National Academy of Sciences, Vol. 106, Issue 12
  • DOI: 10.1073/pnas.0811058106

MusiteDeep: a deep-learning framework for general and kinase-specific phosphorylation site prediction
journal, August 2017


Exploitation of Host Polyubiquitination Machinery through Molecular Mimicry by Eukaryotic-Like Bacterial F-Box Effectors
journal, January 2010


Mismatch string kernels for discriminative protein classification
journal, January 2004


HECT and RING finger families of E3 ubiquitin ligases at a glance
journal, February 2012

  • Metzger, M. B.; Hristova, V. A.; Weissman, A. M.
  • Journal of Cell Science, Vol. 125, Issue 3
  • DOI: 10.1242/jcs.091777

Automated Alphabet Reduction for Protein Datasets
journal, January 2009

  • Bacardit, Jaume; Stout, Michael; Hirst, Jonathan D.
  • BMC Bioinformatics, Vol. 10, Issue 1
  • DOI: 10.1186/1471-2105-10-6

Functional domains and motifs of bacterial type III effector proteins and their roles in infection
journal, November 2011


pROC: an open-source package for R and S+ to analyze and compare ROC curves
journal, March 2011

  • Robin, Xavier; Turck, Natacha; Hainard, Alexandre
  • BMC Bioinformatics, Vol. 12, Issue 1
  • DOI: 10.1186/1471-2105-12-77

Bacterial Interference of Ubiquitination and Deubiquitination
journal, March 2007


Type III Secretion Effectors of the IpaH Family Are E3 Ubiquitin Ligases
journal, March 2007


Genome-Scale Identification of Legionella pneumophila Effectors Using a Machine Learning Approach
journal, July 2009


What is a support vector machine?
journal, December 2006


Prediction of multi-drug resistance transporters using a novel sequence analysis method
journal, January 2015


Accurate Prediction of Secreted Substrates and Identification of a Conserved Putative Secretion Signal for Type III Secretion Systems
journal, April 2009


Sequence-Based Prediction of Type III Secreted Proteins
journal, April 2009


CaMELS: In silico prediction of calmodulin binding proteins and their binding sites : ABBASI et al.
journal, July 2017

  • Abbasi, Wajid Arshad; Asif, Amina; Andleeb, Saiqa
  • Proteins: Structure, Function, and Bioinformatics, Vol. 85, Issue 9
  • DOI: 10.1002/prot.25330

Catch me if you can: bacterial effectors and plant targets
journal, November 2012


Breaking the chains: structure and function of the deubiquitinases
journal, August 2009

  • Komander, David; Clague, Michael J.; Urbé, Sylvie
  • Nature Reviews Molecular Cell Biology, Vol. 10, Issue 8
  • DOI: 10.1038/nrm2731

Cytosol as battleground: ubiquitin as a weapon for both host and pathogen
journal, April 2010


Ubiquitylation in innate and adaptive immunity
journal, March 2009


Five Mechanisms of Manipulation by Bacterial Effectors: A Ubiquitous Theme
journal, August 2012


NleG Type 3 Effectors from Enterohaemorrhagic Escherichia coli Are U-Box E3 Ubiquitin Ligases
journal, June 2010


Computational Prediction of Type III and IV Secreted Effectors in Gram-Negative Bacteria
journal, October 2010

  • McDermott, Jason E.; Corrigan, Abigail; Peterson, Elena
  • Infection and Immunity, Vol. 79, Issue 1
  • DOI: 10.1128/IAI.00537-10

Identification of DNA-Binding Proteins Using Mixed Feature Representation Methods
journal, September 2017


The Universal Protein Resource (UniProt)
journal, December 2004

  • Bairoch, A.
  • Nucleic Acids Research, Vol. 33, Issue Database issue
  • DOI: 10.1093/nar/gki070

Non-canonical ubiquitin-based signals for proteasomal degradation
journal, February 2012

  • Kravtsova-Ivantsiv, Y.; Ciechanover, A.
  • Journal of Cell Science, Vol. 125, Issue 3
  • DOI: 10.1242/jcs.093567

How microbes utilize host ubiquitination
journal, October 2009


A Class of Edit Kernels for SVMs to Predict Translation Initiation Sites in Eukaryotic mRNAs
journal, July 2005


High-order neural networks and kernel methods for peptide-MHC binding prediction
journal, July 2015


Functional Analysis of NopM, a Novel E3 Ubiquitin Ligase (NEL) Domain Effector of Rhizobium sp. Strain NGR234
journal, May 2012


Predicting human protein subcellular localization by heterogeneous and comprehensive approaches
journal, June 2017