skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides

Journal Article · · Scientific Reports
DOI:https://doi.org/10.1038/srep32115· OSTI ID:1378368
 [1];  [2];  [2];  [2];  [2];  [2]
  1. U.S. Food and Drug Administration, Jefferson, AR (United States); Univ. of Arkansas at Little Rock/Univ. of Arkansas for Medical Sciences Bioinformatics Graduate Program, Little Rock, AR (United States)
  2. U.S. Food and Drug Administration, Jefferson, AR (United States)

Understanding the binding between human leukocyte antigens (HLAs) and peptides is important to understand the functioning of the immune system. Since it is time-consuming and costly to measure the binding between large numbers of HLAs and peptides, computational methods including machine learning models and network approaches have been developed to predict HLA-peptide binding. However, there are several limitations for the existing methods. We developed a network-based algorithm called sNebula to address these limitations. We curated qualitative Class I HLA-peptide binding data and demonstrated the prediction performance of sNebula on this dataset using leave-one-out cross-validation and five-fold cross-validations. Furthermore, this algorithm can predict not only peptides of different lengths and different types of HLAs, but also the peptides or HLAs that have no existing binding data. We believe sNebula is an effective method to predict HLA-peptide binding and thus improve our understanding of the immune system.

Research Organization:
Oak Ridge Institute for Science and Education (ORISE), Oak Ridge, TN (United States); U.S. Food and Drug Administration, Jefferson, AR (United States). National Center for Toxicological Research
Sponsoring Organization:
USDOE
OSTI ID:
1378368
Journal Information:
Scientific Reports, Vol. 6, Issue 1; ISSN 2045-2322
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 12 works
Citation information provided by
Web of Science

References (91)

ESSESA: An expert system for structure elucidation from spectra. 3. LNSCS for chemical knowledge representation journal January 1992
Assessing Drug Target Association Using Semantic Linked Data journal July 2012
The MHC, disease and selection journal June 2011
The IMGT/HLA database journal October 2012
NetMHCIIpan-3.0, a common pan-specific MHC class II prediction method including all three human MHC class II isotypes, HLA-DR, HLA-DP and HLA-DQ journal July 2013
Immune self-reactivity triggered by drug-modified HLA-peptide repertoire journal May 2012
A Review on Missing Value Imputation Algorithms for Microarray Gene Expression Data journal January 2014
Estrogenic Activity Data Extraction and in Silico Prediction Show the Endocrine Disruption Potential of Bisphenol A Replacement Compounds journal September 2015
Spec2D:  A Structure Elucidation System Based on 1 H NMR and H−H COSY Spectra in Organic Chemistry journal March 2006
Toward more accurate pan-specific MHC-peptide binding prediction: a review of current methods and tools journal September 2011
Predicting Class II MHC-Peptide binding: a kernel based approach using similarity scores journal November 2006
An in silico ensemble method for lead discovery: decision forest journal August 2005
Advances in computational approaches for prioritizing driver mutations and significantly mutated genes in cancer genomes journal August 2015
A Computational Approach for Designing a Universal Epitope-Based Peptide Vaccine Against Nipah Virus journal June 2015
ESSESA, an expert system for structure elucidation from spectral analysis journal June 1992
Assessing QSAR Limitations - A Regulatory Perspective journal April 2005
ESSESA: An Expert System for Structure Elucidation from Spectra. 4. Canonical Representation of Structures journal July 1994
Molecular Docking to Identify Associations Between Drugs and Class I Human Leukocyte Antigens for Predicting Idiosyncratic Drug Reactions journal April 2015
Adverse Drug Events: Database Construction and in Silico Prediction journal April 2013
Cheetah Paradigm Revisited: MHC Diversity in the World's Largest Free-Ranging Population journal December 2010
Molecular Dynamics Simulation Reveals the Selective Binding of Human Leukocyte Antigen Alleles Associated with Behçet's Disease journal September 2015
Identification of a naturally processed HLA-Cw7-binding peptide that cross-reacts with HLA-A24-restricted ovarian cancer-specific CTLs: Alloreactivity of tumor-restricted CTLs journal July 2015
MHC peptides and the sensory evaluation of genotype journal February 2006
Biomarker-based drug safety assessment in the age of systems pharmacology: from foundational to regulatory science journal November 2015
NetMHCpan, a method for MHC class I binding prediction beyond humans journal November 2008
ESSESA: An Expert System for Structure Elucidation from Spectra. 5. Substructure Constraints from Analysis of First-Order 1H-NMR Spectra journal November 1994
Mold 2 , Molecular Descriptors from 2D Structures for Chemoinformatics and Toxicoinformatics journal June 2008
MissForest--non-parametric missing value imputation for mixed-type data journal October 2011
Human leukocyte antigen-associated drug hypersensitivity journal February 2013
Development and Validation of Decision Forest Model for Estrogen Receptor Binding Prediction of Chemicals Using Large Data Sets journal November 2015
Gapped sequence alignment using artificial neural networks: application to the MHC class I system journal October 2015
Permutation test for incomplete paired data with application to cDNA microarray data journal March 2012
Efficient peptide–MHC-I binding prediction for alleles with few known binders journal December 2007
Missing data imputation using statistical and machine learning methods in a real breast cancer problem journal October 2010
Prediction of human genes and diseases targeted by xenobiotics using predictive toxicogenomic-derived models (PTDMs) journal January 2013
Prediction of Polypharmacological Profiles of Drugs by the Integration of Chemical, Side Effect, and Therapeutic Space journal April 2013
Decision Forest:  Combining the Predictions of Multiple Independent Decision Tree Models journal February 2003
The Accurate Prediction of Protein Family from Amino Acid Sequence by Measuring Features of Sequence Fragments journal December 2009
Missing value imputation for gene expression data: computational techniques to recover missing data from available information journal December 2010
Phenotyping of human complement component C4, a class-III HLA antigen journal November 1986
Prediction of Chemical-Protein Interactions Network with Weighted Network-Based Inference Method journal July 2012
EADB: An Estrogenic Activity Database for Assessing Potential Endocrine Activity journal July 2013
Intrinsic and cooperative antigen-presenting functions of dendritic-cell subsets in vivo journal July 2007
Immunodominance in Major Histocompatibility Complex Class I–Restricted t Lymphocyte Responses journal April 1999
Applying network analysis and Nebula (neighbor-edges based and unbiased leverage algorithm) to ToxCast data journal April 2016
SYFPEITHI: database for MHC ligands and peptide motifs journal November 1999
ESSESA: an expert system for elucidation of structures from spectra. 1. Knowledge base of infrared spectra and analysis and interpretation programs journal August 1990
Prediction of estrogen receptor binding for 58,000 chemicals using an integrated system of a tree-based model with structural alerts. journal January 2002
Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method journal January 2005
Physical association between the CD8 and HLA class I molecules on the surface of activated human T lymphocytes. journal June 1988
PREDICT: a method for inferring novel drug indications with application to personalized medicine journal January 2011
Comparative molecular field analysis (CoMFA) model using a large diverse set of natural, synthetic and environmental chemicals for binding to the androgen receptor journal October 2003
Population Biology of Antigen Presentation by MHC Class I Molecules journal April 1996
Using Decision Forest to Classify Prostate Cancer Samples on the Basis of SELDI-TOF MS Data: Assessing Chance Correlation and Prediction Confidence journal August 2004
The Immune Epitope Database 2.0 journal November 2009
Versatility or Promiscuity: The Estrogen Receptors, Control of Ligand Selectivity and an Update on Subtype Selective Ligands journal August 2014
Consensus analysis of multiple classifiers using non-repetitive variables: Diagnostic application to microarray gene expression data journal February 2007
SDTNBI: an integrated network and chemoinformatics tool for systematic prediction of drug–target interactions and drug repositioning journal March 2016
Predicting immunogenic tumour mutations by combining mass spectrometry and exome sequencing journal November 2014
Computational prediction of microRNA networks incorporating environmental toxicity and disease etiology journal July 2014
Rat α-Fetoprotein Binding Affinities of a Large Set of Structurally Diverse Chemicals Elucidated the Relationships between Structures and Binding Affinities journal August 2012
MHCBN 4.0: A database of MHC/TAP binding peptides and T-cell epitopes journal January 2009
A double amino-acid change in the HLA-A peptide-binding groove is associated with response to psychotropic treatment in patients with schizophrenia journal July 2015
Amino acid substitution matrices from protein blocks. journal November 1992
Modeling and Optimization for Big Data Analytics: (Statistical) learning tools for our era of data deluge journal September 2014
Improved peptide vaccine strategies, creating synthetic artificial infections to maximize immune efficacy journal October 2006
A roadmap for HLA-A, HLA-B, and HLA-C peptide binding specificities journal November 1996
NN-align. An artificial neural network-based alignment algorithm for MHC class II peptide binding prediction journal September 2009
Multiclass Decision Forest—A Novel Pattern Recognition Method for Multiclass Classification in Microarray Data Analysis journal October 2004
AntiJen: a quantitative immunology database integrating functional, thermodynamic, kinetic, biophysical, and cellular data journal January 2005
Accurate approximation method for prediction of class I MHC affinities for peptides of length 8, 10 and 11 using prediction tools trained on 9mers journal April 2008
Crystal structure of the human class II MHC protein HLA-DR1 complexed with an influenza virus peptide journal March 1994
Nomenclature for factors of the HLA system, 2004 journal April 2005
Neoantigens in cancer immunotherapy journal April 2015
Human Sex Hormone-Binding Globulin Binding Affinities of 125 Structurally Diverse Chemicals and Comparison with Their Binding to Androgen Receptor, Estrogen Receptor, and α-Fetoprotein journal October 2014
Predicting Hepatotoxicity Using ToxCast in Vitro Bioactivity and Chemical Structure journal March 2015
Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference journal May 2012
Specificity of T-cell alloreactivity journal December 2007
NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data journal November 2011
HLA Class II peptide-binding and autoimmunity journal February 2002
MHC class II-dependent activation of CD4 + T cell hybridomas by human mast cells through superantigen presentation journal July 1999
Using Decision Forest to Classify Prostate Cancer Samples on the Basis of SELDI-TOF MS Data: Assessing Chance Correlation and Prediction Confidence journal November 2004
The IMGT/HLA database journal November 2010
Nomenclature for factors of the HLA system, 2004 journal April 2005
The Nature of Selection on the Major Histocompatibility Complex journal January 2017
Nomenclature for Factors of the HLA System, 2004 journal May 2005
Refined structure of the human histocompatibility antigen HLA-A2 at 2.6 Å resolution journal May 1991
Minimizing the immunogenicity of protein therapeutics journal January 2004
Longitudinal molecular trajectories of diffuse glioma in adults journal November 2019
Peopling the Americas journal August 1996
The Nature of Selection on the Major Histocompatibility Complex journal January 1997

Cited By (5)

DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction journal January 2019
Machine learning and molecular design of self-assembling -conjugated oligopeptides journal April 2018
DeepSeqPan, a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction journal July 2018
TSNAdb: A Database for Tumor-specific Neoantigens from Immunogenomics Data Analysis journal August 2018
In silico design of MHC class I high binding affinity peptides through motifs activation map journal December 2018