Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Deep graph representations embed network information for robust disease marker identification

Journal Article · · Bioinformatics
 [1];  [2];  [2]
  1. Texas A & M Univ., College Station, TX (United States)
  2. Texas A & M Univ., College Station, TX (United States); Brookhaven National Lab. (BNL), Upton, NY (United States)
We report that the accurate disease diagnosis and prognosis based on omics data rely on the effective identification of robust prognostic and diagnostic markers that reflect the states of the biological processes underlying the disease pathogenesis and progression. In this article, we present GCNCC, a Graph Convolutional Network-based approach for Clustering and Classification, that can identify highly effective and robust network-based disease markers. Based on a geometric deep learning framework, GCNCC learns deep network representations by integrating gene expression data with protein interaction data to identify highly reproducible markers with consistently accurate prediction performance across independent datasets possibly from different platforms. GCNCC identifies these markers by clustering the nodes in the protein interaction network based on latent similarity measures learned by the deep architecture of a graph convolutional network, followed by a supervised feature selection procedure that extracts clusters that are highly predictive of the disease state. By benchmarking GCNCC based on independent datasets from different diseases (psychiatric disorder and cancer) and different platforms (microarray and RNA-seq), we show that GCNCC outperforms other state-of-the-art methods in terms of accuracy and reproducibility.
Research Organization:
Brookhaven National Laboratory (BNL), Upton, NY (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
SC0012704
OSTI ID:
1855099
Report Number(s):
BNL-222809-2022-JAAM
Journal Information:
Bioinformatics, Journal Name: Bioinformatics Journal Issue: 4 Vol. 38; ISSN 1367-4803
Publisher:
International Society for Computational Biology - Oxford University PressCopyright Statement
Country of Publication:
United States
Language:
English

References (31)

Genome-wide expression analysis reveals dyregulation of myelination-related genes in chronic schizophrenia journal May 2002
Techniques for clustering gene expression data journal March 2008
Network‐based classification of breast cancer metastasis journal January 2007
Dynamic regulatory network controlling TH17 cell differentiation journal March 2013
A molecular signature of metastasis in primary solid tumors journal December 2002
Identification of transcriptional regulators in the mouse immune system journal April 2013
Wisdom of crowds for robust gene network inference journal July 2012
Network-based stratification of tumor mutations journal September 2013
Advantages and limitations of current network inference methods journal August 2010
A comprehensive evaluation of module detection methods for gene expression data journal March 2018
Modularity and community structure in networks journal May 2006
A pathway-based classification of human breast cancer journal March 2010
Tissue Classification with Gene Expression Profiles journal August 2000
A comparative analysis of biclustering algorithms for gene expression data journal July 2012
Genesis: cluster analysis of microarray data journal January 2002
STRING v10: protein–protein interaction networks, integrated over the tree of life journal October 2014
Enrichr: a comprehensive gene set enrichment analysis web server 2016 update journal May 2016
DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants journal October 2016
Clustering by Passing Messages Between Data Points journal February 2007
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring journal October 1999
Shared molecular neuropathology across major psychiatric disorders parallels polygenic overlap journal February 2018
Autism as a Paradigmatic Complex Genetic Disorder journal September 2004
Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network journal October 2010
Incorporating topological information for predicting robust cancer subnetwork markers in human protein-protein interaction network journal October 2016
Inferring Pathway Activity toward Precise Disease Classification journal November 2008
Chapter 5: Network Biology Approach to Complex Diseases journal December 2012
Integrated Module and Gene-Specific Regulatory Inference Implicates Upstream Signaling Networks journal October 2013
Integrating Genome-Wide Genetic Variations and Monocyte Expression Data Reveals Trans-Regulated Gene Modules in Humans journal December 2011
Accurate and Reliable Cancer Classification Based on Probabilistic Inference of Pathway Activity journal December 2009
A General Framework for Weighted Gene Co-Expression Network Analysis journal January 2005
Editorial: Comorbidity and Autism Spectrum Disorder journal November 2020

Similar Records

Scaling Subspace-Driven Approaches Using Information Fusion
Book · Tue Jan 31 23:00:00 EST 2023 · OSTI ID:1972569

Multi-Edge Graph Convolutional Networks for Power Systems
Conference · Wed Jan 31 23:00:00 EST 2024 · OSTI ID:2372991

Related Subjects