Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting

Journal Article · · PeerJ
DOI:https://doi.org/10.7717/peerj.7055· OSTI ID:1525490
 [1];  [2];  [2];  [3];  [4];  [2]
  1. Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States of America, Department of Molecular Microbiology and Immunology, Oregon Health & Science University, Portland, OR, United States of America
  2. Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States of America
  3. Department of Molecular Microbiology and Immunology, Oregon Health & Science University, Portland, OR, United States of America
  4. Center for Brain Immunology and Glia, University of Virginia, Charlottesville, United States of America
Background

Although pathogenic Gram-negative bacteria lack their own ubiquitination machinery, they have evolved or acquired virulence effectors that can manipulate the host ubiquitination process through structural and/or functional mimicry of host machinery. Many such effectors have been identified in a wide variety of bacterial pathogens that share little sequence similarity amongst themselves or with eukaryotic ubiquitin E3 ligases.

Methods

To allow identification of novel bacterial E3 ubiquitin ligase effectors from protein sequences we have developed a machine learning approach, the SVM-based Identification and Evaluation of Virulence Effector Ubiquitin ligases (SIEVE-Ub). We extend the string kernel approach used previously to sequence classification by introducing reduced amino acid (RED) alphabet encoding for protein sequences.

Results

We found that 14mer peptides with amino acids represented as simply either hydrophobic or hydrophilic provided the best models for discrimination of E3 ligases from other effector proteins with a receiver-operator characteristic area under the curve (AUC) of 0.90. When considering a subset of E3 ubiquitin ligase effectors that do not fall into known sequence based families we found that the AUC was 0.82, demonstrating the effectiveness of our method at identifying novel functional family members. Feature selection was used to identify a parsimonious set of 10 RED peptides that provided good discrimination, and these peptides were found to be located in functionally important regions of the proteins involved in E2 and host target protein binding. Our general approach enables construction of models based on other effector functions. We used SIEVE-Ub to predict nine potential novel E3 ligases from a large set of bacterial genomes. SIEVE-Ub is available for download at https://doi.org/10.6084/m9.figshare.7766984.v1 or https://github.com/biodataganache/SIEVE-Ub for the most current version.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC05-76RL01830
OSTI ID:
1525490
Alternate ID(s):
OSTI ID: 1544792
Report Number(s):
PNNL-SA--138492; e7055
Journal Information:
PeerJ, Journal Name: PeerJ Vol. 7; ISSN 2167-8359
Publisher:
PeerJCopyright Statement
Country of Publication:
United States
Language:
English

References (49)

CaMELS: In silico prediction of calmodulin binding proteins and their binding sites : ABBASI et al. journal July 2017
Type III Secretion Effectors of the IpaH Family Are E3 Ubiquitin Ligases journal March 2007
Bacterial Interference of Ubiquitination and Deubiquitination journal March 2007
Hijacking the host ubiquitin pathway: structural strategies of bacterial E3 ubiquitin ligases journal February 2010
Cytosol as battleground: ubiquitin as a weapon for both host and pathogen journal April 2010
Catch me if you can: bacterial effectors and plant targets journal November 2012
Gene Selection for Cancer Classification using Support Vector Machines journal January 2002
Structural mimicry in bacterial virulence journal August 2001
Ubiquitylation in innate and adaptive immunity journal March 2009
What is a support vector machine? journal December 2006
Breaking the chains: structure and function of the deubiquitinases journal August 2009
Structure of the Shigella T3SS effector IpaH defines a new class of E3 ubiquitin ligases journal November 2008
A multi-pronged search for a common structural motif in the secretion signal of Salmonella enterica serovar Typhimurium type III effector proteins journal January 2010
A family of Salmonella virulence factors functions as a distinct class of autoregulated E3 ubiquitin ligases journal March 2009
Identification of an unconventional E3 binding surface on the UbcH5   Ub conjugate recognized by a pathogenic bacterial E3 ligase. journal February 2010
Salmonella Type III Secretion Effector SlrP Is an E3 Ubiquitin Ligase for Mammalian Thioredoxin journal August 2009
Biochemical and Structural Studies of a HECT-like Ubiquitin Ligase from Escherichia coli O157:H7 journal October 2010
A Class of Edit Kernels for SVMs to Predict Translation Initiation Sites in Eukaryotic mRNAs journal July 2005
Mismatch string kernels for discriminative protein classification journal January 2004
Biopython: freely available Python tools for computational molecular biology and bioinformatics journal March 2009
High-order neural networks and kernel methods for peptide-MHC binding prediction journal July 2015
MusiteDeep: a deep-learning framework for general and kinase-specific phosphorylation site prediction journal August 2017
The Universal Protein Resource (UniProt) journal December 2004
Improvements to PATRIC, the all-bacterial Bioinformatics Database and Analysis Resource Center journal November 2016
Genus-optimized strategy for the identification of chlamydial type III secretion substrates journal August 2013
How microbes utilize host ubiquitination journal October 2009
Functional domains and motifs of bacterial type III effector proteins and their roles in infection journal November 2011
Glutamine Deamidation and Dysfunction of Ubiquitin/NEDD8 Induced by a Bacterial Effector Family journal August 2010
Computational Prediction of Type III and IV Secreted Effectors in Gram-Negative Bacteria journal October 2010
Discovery of Novel Secreted Virulence Factors from Salmonella enterica Serovar Typhimurium by Proteomic Analysis of Culture Supernatants journal October 2010
A Program of Yersinia enterocolitica Type III Secretion Reactions Is Activated by Specific Signals journal September 2001
The Spectrum Kernel: a String Kernel for svm Protein Classification conference November 2011
Automated Alphabet Reduction for Protein Datasets journal January 2009
pROC: an open-source package for R and S+ to analyze and compare ROC curves journal March 2011
DRREP: deep ridge regressed epitope predictor journal October 2017
HECT and RING finger families of E3 ubiquitin ligases at a glance journal February 2012
Non-canonical ubiquitin-based signals for proteasomal degradation journal February 2012
Prediction of multi-drug resistance transporters using a novel sequence analysis method journal January 2015
ElaD, a Deubiquitinating Protease Expressed by E. coli journal April 2007
Predicting human protein subcellular localization by heterogeneous and comprehensive approaches journal June 2017
Accurate Prediction of Secreted Substrates and Identification of a Conserved Putative Secretion Signal for Type III Secretion Systems journal April 2009
Sequence-Based Prediction of Type III Secreted Proteins journal April 2009
Genome-Scale Identification of Legionella pneumophila Effectors Using a Machine Learning Approach journal July 2009
NleG Type 3 Effectors from Enterohaemorrhagic Escherichia coli Are U-Box E3 Ubiquitin Ligases journal June 2010
Functional Analysis of NopM, a Novel E3 Ubiquitin Ligase (NEL) Domain Effector of Rhizobium sp. Strain NGR234 journal May 2012
Five Mechanisms of Manipulation by Bacterial Effectors: A Ubiquitous Theme journal August 2012
RavN is a member of a previously unrecognized group of Legionella pneumophila E3 ubiquitin ligases journal February 2018
Exploitation of Host Polyubiquitination Machinery through Molecular Mimicry by Eukaryotic-Like Bacterial F-Box Effectors journal January 2010
Identification of DNA-Binding Proteins Using Mixed Feature Representation Methods journal September 2017

Similar Records

Structural Basis for DNA Recognition by the Two-Component Response Regulator RcsB
Journal Article · Tue Mar 06 19:00:00 EST 2018 · mBio · OSTI ID:1493948

Identification of an unconventional E3 binding surface on the UbcH5 Ub conjugate recognized by a pathogenic bacterial E3 ligase.
Journal Article · Sun Jan 31 23:00:00 EST 2010 · Proceedings of the National Academy of Sciences of the United States of America · OSTI ID:974925