Prior knowledge driven Granger causality analysis on gene regulatory network discovery
Abstract
Our study focuses on discovering gene regulatory networks from time series gene expression data using the Granger causality (GC) model. However, the number of available time points (T) usually is much smaller than the number of target genes (n) in biological datasets. The widely applied pairwise GC model (PGC) and other regularization strategies can lead to a significant number of false identifications when n>>T. In this study, we proposed a new method, viz., CGC-2SPR (CGC using two-step prior Ridge regularization) to resolve the problem by incorporating prior biological knowledge about a target gene data set. In our simulation experiments, the propose new methodology CGC-2SPR showed significant performance improvement in terms of accuracy over other widely used GC modeling (PGC, Ridge and Lasso) and MI-based (MRNET and ARACNE) methods. In addition, we applied CGC-2SPR to a real biological dataset, i.e., the yeast metabolic cycle, and discovered more true positive edges with CGC-2SPR than with the other existing methods. In our research, we noticed a “ 1+1>2” effect when we combined prior knowledge and gene expression data to discover regulatory networks. Based on causality networks, we made a functional prediction that the Abm1 gene (its functions previously were unknown) might be relatedmore »
- Authors:
-
- Stony Brook Univ., NY (United States); Brookhaven National Lab. (BNL), Upton, NY (United States)
- Brookhaven National Lab. (BNL), Upton, NY (United States)
- Publication Date:
- Research Org.:
- Brookhaven National Laboratory (BNL), Upton, NY (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1259268
- Resource Type:
- Accepted Manuscript
- Journal Name:
- BMC Bioinformatics
- Additional Journal Information:
- Journal Volume: 16; Journal Issue: 1; Journal ID: ISSN 1471-2105
- Publisher:
- BioMed Central
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 60 APPLIED LIFE SCIENCES; Time series; Gene expression data; Granger causality; Gene regulatory networks
Citation Formats
Yao, Shun, Yoo, Shinjae, and Yu, Dantong. Prior knowledge driven Granger causality analysis on gene regulatory network discovery. United States: N. p., 2015.
Web. doi:10.1186/s12859-015-0710-1.
Yao, Shun, Yoo, Shinjae, & Yu, Dantong. Prior knowledge driven Granger causality analysis on gene regulatory network discovery. United States. https://doi.org/10.1186/s12859-015-0710-1
Yao, Shun, Yoo, Shinjae, and Yu, Dantong. Fri .
"Prior knowledge driven Granger causality analysis on gene regulatory network discovery". United States. https://doi.org/10.1186/s12859-015-0710-1. https://www.osti.gov/servlets/purl/1259268.
@article{osti_1259268,
title = {Prior knowledge driven Granger causality analysis on gene regulatory network discovery},
author = {Yao, Shun and Yoo, Shinjae and Yu, Dantong},
abstractNote = {Our study focuses on discovering gene regulatory networks from time series gene expression data using the Granger causality (GC) model. However, the number of available time points (T) usually is much smaller than the number of target genes (n) in biological datasets. The widely applied pairwise GC model (PGC) and other regularization strategies can lead to a significant number of false identifications when n>>T. In this study, we proposed a new method, viz., CGC-2SPR (CGC using two-step prior Ridge regularization) to resolve the problem by incorporating prior biological knowledge about a target gene data set. In our simulation experiments, the propose new methodology CGC-2SPR showed significant performance improvement in terms of accuracy over other widely used GC modeling (PGC, Ridge and Lasso) and MI-based (MRNET and ARACNE) methods. In addition, we applied CGC-2SPR to a real biological dataset, i.e., the yeast metabolic cycle, and discovered more true positive edges with CGC-2SPR than with the other existing methods. In our research, we noticed a “ 1+1>2” effect when we combined prior knowledge and gene expression data to discover regulatory networks. Based on causality networks, we made a functional prediction that the Abm1 gene (its functions previously were unknown) might be related to the yeast’s responses to different levels of glucose. In conclusion, our research improves causality modeling by combining heterogeneous knowledge, which is well aligned with the future direction in system biology. Furthermore, we proposed a method of Monte Carlo significance estimation (MCSE) to calculate the edge significances which provide statistical meanings to the discovered causality networks. All of our data and source codes will be available under the link https://bitbucket.org/dtyu/granger-causality/wiki/Home.},
doi = {10.1186/s12859-015-0710-1},
journal = {BMC Bioinformatics},
number = 1,
volume = 16,
place = {United States},
year = {Fri Aug 28 00:00:00 EDT 2015},
month = {Fri Aug 28 00:00:00 EDT 2015}
}
Web of Science
Works referenced in this record:
A decade’s perspective on DNA sequencing technology
journal, February 2011
- Mardis, Elaine R.
- Nature, Vol. 470, Issue 7333, p. 198-203
Bioinformatics challenges of new sequencing technology
journal, March 2008
- Pop, Mihai; Salzberg, Steven L.
- Trends in Genetics, Vol. 24, Issue 3
Gene regulatory network inference: Data integration in dynamic models—A review
journal, April 2009
- Hecker, Michael; Lambeck, Sandro; Toepfer, Susanne
- Biosystems, Vol. 96, Issue 1
NCBI GEO: archive for functional genomics data sets—update
journal, November 2012
- Barrett, Tanya; Wilhite, Stephen E.; Ledoux, Pierre
- Nucleic Acids Research, Vol. 41, Issue D1
Inferring gene regulatory networks from time series data using the minimum description length principle
journal, July 2006
- Zhao, W.; Serpedin, E.; Dougherty, E. R.
- Bioinformatics, Vol. 22, Issue 17
Boolean network inference from time series data incorporating prior biological knowledge
journal, January 2012
- Haider, Saad; Pal, Ranadip
- BMC Genomics, Vol. 13, Issue Suppl 6
Dynamic Bayesian network and nonparametric regression for nonlinear modeling of gene networks from time series gene expression data
journal, July 2004
- Kim, Sunyong; Imoto, Seiya; Miyano, Satoru
- Biosystems, Vol. 75, Issue 1-3
A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data
journal, August 2004
- Zou, M.; Conzen, S. D.
- Bioinformatics, Vol. 21, Issue 1
Characterizing Dynamic Changes in the Human Blood Transcriptional Network
journal, February 2010
- Zhu, Jun; Chen, Yanqing; Leonardson, Amy S.
- PLoS Computational Biology, Vol. 6, Issue 2
Granger causality vs. dynamic Bayesian network inference: a comparative study
journal, April 2009
- Zou, Cunlu; Feng, Jianfeng
- BMC Bioinformatics, Vol. 10, Issue 1
Fast Bayesian inference for gene regulatory networks using ScanBMA
journal, January 2014
- Young, William; Raftery, Adrian E.; Yeung, Ka
- BMC Systems Biology, Vol. 8, Issue 1
Reverse engineering of regulatory networks in human B cells
journal, March 2005
- Basso, Katia; Margolin, Adam A.; Stolovitzky, Gustavo
- Nature Genetics, Vol. 37, Issue 4
TimeDelay-ARACNE: Reverse engineering of gene networks from time-course data by an information theoretic approach
journal, January 2010
- Zoppoli, Pietro; Morganella, Sandro; Ceccarelli, Michele
- BMC Bioinformatics, Vol. 11, Issue 1
Inference of gene regulatory networks from time series by Tsallis entropy
journal, May 2011
- Lopes, Fabrício Martins; de Oliveira, Evaldo A.; Cesar, Roberto M.
- BMC Systems Biology, Vol. 5, Issue 1
Experimental assessment of static and dynamic algorithms for gene regulation inference from time series expression data
journal, January 2013
- Lopes, Miguel; Bontempi, Gianluca
- Frontiers in Genetics, Vol. 4
Investigating Causal Relations by Econometric Models and Cross-spectral Methods
journal, August 1969
- Granger, C. W. J.
- Econometrica, Vol. 37, Issue 3
Testing for causality
journal, January 1980
- Granger, C. W. J.
- Journal of Economic Dynamics and Control, Vol. 2
Causality and pathway search in microarray time series experiment
journal, December 2006
- Mukhopadhyay, N. D.; Chatterjee, S.
- Bioinformatics, Vol. 23, Issue 4
Granger Causality Analysis of Human Cell-Cycle Gene Expression Profiles
journal, January 2010
- Nagarajan, Radhakrishnan; Upreti, Meenakshi
- Statistical Applications in Genetics and Molecular Biology, Vol. 9, Issue 1
Application of Granger causality to gene regulatory network discovery
conference, August 2012
- Tam, Gary Hak Fui; Chang, Chunqi; Hung, Yeung Sam
- 2012 IEEE 6th International Conference on Systems Biology (ISB)
Grouped graphical Granger modeling for gene expression regulatory networks discovery
journal, May 2009
- Lozano, A. C.; Abe, N.; Liu, Y.
- Bioinformatics, Vol. 25, Issue 12
Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge
journal, February 2007
- Geier, Florian; Timmer, Jens; Fleck, Christian
- BMC Systems Biology, Vol. 1, Issue 1
Identification of Genes Periodically Expressed in the Human Cell Cycle and Their Expression in Tumors
journal, June 2002
- Whitfield, Michael L.; Sherlock, Gavin; Saldanha, Alok J.
- Molecular Biology of the Cell, Vol. 13, Issue 6
Logic of the Yeast Metabolic Cycle: Temporal Compartmentalization of Cellular Processes
journal, November 2005
- Tu, B. P.
- Science, Vol. 310, Issue 5751
Ridge Regression: Biased Estimation for Nonorthogonal Problems
journal, February 1970
- Hoerl, Arthur E.; Kennard, Robert W.
- Technometrics, Vol. 12, Issue 1
Regression Shrinkage and Selection Via the Lasso
journal, January 1996
- Tibshirani, Robert
- Journal of the Royal Statistical Society: Series B (Methodological), Vol. 58, Issue 1
The lasso problem and uniqueness
journal, January 2013
- Tibshirani, Ryan J.
- Electronic Journal of Statistics, Vol. 7, Issue 0
Regularization Paths for Generalized Linear Models via Coordinate Descent
journal, January 2010
- Friedman, Jerome; Hastie, Trevor; Tibshirani, Robert
- Journal of Statistical Software, Vol. 33, Issue 1
Comparing genomes to computer operating systems in terms of the topology and evolution of their regulatory control networks
journal, May 2010
- Yan, K. -K.; Fang, G.; Bhardwaj, N.
- Proceedings of the National Academy of Sciences, Vol. 107, Issue 20
Construction and Analysis of an Integrated Regulatory Network Derived from High-Throughput Sequencing Data
journal, November 2011
- Cheng, Chao; Yan, Koon-Kiu; Hwang, Woochang
- PLoS Computational Biology, Vol. 7, Issue 11
Architecture of the human regulatory network derived from ENCODE data
journal, September 2012
- Gerstein, Mark B.; Kundaje, Anshul; Hariharan, Manoj
- Nature, Vol. 489, Issue 7414
Regularization and variable selection via the elastic net
journal, April 2005
- Zou, Hui; Hastie, Trevor
- Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 67, Issue 2
A MATLAB toolbox for Granger causal connectivity analysis
journal, February 2010
- Seth, Anil K.
- Journal of Neuroscience Methods, Vol. 186, Issue 2
minet: A R/Bioconductor Package for Inferring Large Transcriptional Networks Using Mutual Information
journal, October 2008
- Meyer, Patrick E.; Lafitte, Frédéric; Bontempi, Gianluca
- BMC Bioinformatics, Vol. 9, Issue 1
Reverse engineering gene networks using singular value decomposition and robust regression
journal, April 2002
- Yeung, M. K. S.; Tegner, J.; Collins, J. J.
- Proceedings of the National Academy of Sciences, Vol. 99, Issue 9
Singular value decomposition and least squares solutions
journal, April 1970
- Golub, G. H.; Reinsch, C.
- Numerische Mathematik, Vol. 14, Issue 5
Genetic reconstruction of a functional transcriptional regulatory network
journal, April 2007
- Hu, Zhanzhi; Killion, Patrick J.; Iyer, Vishwanath R.
- Nature Genetics, Vol. 39, Issue 5
An Improved, Bias-Reduced Probabilistic Functional Gene Network of Baker's Yeast, Saccharomyces cerevisiae
journal, October 2007
- Lee, Insuk; Li, Zhihua; Marcotte, Edward M.
- PLoS ONE, Vol. 2, Issue 10
High-resolution DNA-binding specificity analysis of yeast transcription factors
journal, January 2009
- Zhu, C.; Byers, K. J. R. P.; McCord, R. P.
- Genome Research, Vol. 19, Issue 4
Cytoscape 2.8: new features for data integration and network visualization
journal, December 2010
- Smoot, M. E.; Ono, K.; Ruscheinski, J.
- Bioinformatics, Vol. 27, Issue 3
SGD: Saccharomyces Genome Database
journal, January 1998
- Cherry, J.
- Nucleic Acids Research, Vol. 26, Issue 1
The Shannon sampling theorem—Its various extensions and applications: A tutorial review
journal, January 1977
- Jerri, A. J.
- Proceedings of the IEEE, Vol. 65, Issue 11
The Forkhead transcription factor Hcm1 regulates chromosome segregation genes and fills the S-phase gap in the transcriptional circuitry of the cell cycle
journal, August 2006
- Pramila, T.
- Genes & Development, Vol. 20, Issue 16
Gene regulatory network inference: Data integration in dynamic models—A review
journal, April 2009
- Hecker, Michael; Lambeck, Sandro; Toepfer, Susanne
- Biosystems, Vol. 96, Issue 1
A MATLAB toolbox for Granger causal connectivity analysis
journal, February 2010
- Seth, Anil K.
- Journal of Neuroscience Methods, Vol. 186, Issue 2
Bioinformatics challenges of new sequencing technology
journal, March 2008
- Pop, Mihai; Salzberg, Steven L.
- Trends in Genetics, Vol. 24, Issue 3
A decade’s perspective on DNA sequencing technology
journal, February 2011
- Mardis, Elaine R.
- Nature, Vol. 470, Issue 7333, p. 198-203
Reverse engineering of regulatory networks in human B cells
journal, March 2005
- Basso, Katia; Margolin, Adam A.; Stolovitzky, Gustavo
- Nature Genetics, Vol. 37, Issue 4
Genetic reconstruction of a functional transcriptional regulatory network
journal, April 2007
- Hu, Zhanzhi; Killion, Patrick J.; Iyer, Vishwanath R.
- Nature Genetics, Vol. 39, Issue 5
Ontology-driven integrative analysis of omics data through Onassis
journal, January 2020
- Galeota, Eugenia; Kishore, Kamal; Pelizzola, Mattia
- Scientific Reports, Vol. 10, Issue 1
Challenges and opportunities for strain verification by whole-genome sequencing
journal, April 2020
- Gallegos, Jenna E.; Hayrynen, Sergei; Adames, Neil R.
- Scientific Reports, Vol. 10, Issue 1
Comparing genomes to computer operating systems in terms of the topology and evolution of their regulatory control networks
journal, May 2010
- Yan, K. -K.; Fang, G.; Bhardwaj, N.
- Proceedings of the National Academy of Sciences, Vol. 107, Issue 20
Reverse engineering gene networks using singular value decomposition and robust regression
journal, April 2002
- Yeung, M. K. S.; Tegner, J.; Collins, J. J.
- Proceedings of the National Academy of Sciences, Vol. 99, Issue 9
Ridge Regression: Biased Estimation for Nonorthogonal Problems
journal, February 1970
- Hoerl, Arthur E.; Kennard, Robert W.
- Technometrics, Vol. 12, Issue 1
A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data
journal, August 2004
- Zou, M.; Conzen, S. D.
- Bioinformatics, Vol. 21, Issue 1
Grouped graphical Granger modeling for gene expression regulatory networks discovery
journal, May 2009
- Lozano, A. C.; Abe, N.; Liu, Y.
- Bioinformatics, Vol. 25, Issue 12
Cytoscape 2.8: new features for data integration and network visualization
journal, December 2010
- Smoot, M. E.; Ono, K.; Ruscheinski, J.
- Bioinformatics, Vol. 27, Issue 3
NCBI GEO: archive for functional genomics data sets—update
journal, November 2012
- Barrett, Tanya; Wilhite, Stephen E.; Ledoux, Pierre
- Nucleic Acids Research, Vol. 41, Issue D1
Single-Cell Transcriptomic Atlas of the Human Endometrium During the Menstrual Cycle
journal, February 2022
- Wang, Wanxin; Vilella, Felipe; Alama, Pilar
- Obstetrical & Gynecological Survey, Vol. 77, Issue 2
The Forkhead transcription factor Hcm1 regulates chromosome segregation genes and fills the S-phase gap in the transcriptional circuitry of the cell cycle
journal, August 2006
- Pramila, T.
- Genes & Development, Vol. 20, Issue 16
High-resolution DNA-binding specificity analysis of yeast transcription factors
journal, January 2009
- Zhu, C.; Byers, K. J. R. P.; McCord, R. P.
- Genome Research, Vol. 19, Issue 4
Logic of the Yeast Metabolic Cycle: Temporal Compartmentalization of Cellular Processes
journal, November 2005
- Tu, B. P.
- Science, Vol. 310, Issue 5751
Granger causality vs. dynamic Bayesian network inference: a comparative study
journal, April 2009
- Zou, Cunlu; Feng, Jianfeng
- BMC Bioinformatics, Vol. 10, Issue 1
TimeDelay-ARACNE: Reverse engineering of gene networks from time-course data by an information theoretic approach
journal, January 2010
- Zoppoli, Pietro; Morganella, Sandro; Ceccarelli, Michele
- BMC Bioinformatics, Vol. 11, Issue 1
minet: A R/Bioconductor Package for Inferring Large Transcriptional Networks Using Mutual Information
journal, October 2008
- Meyer, Patrick E.; Lafitte, Frédéric; Bontempi, Gianluca
- BMC Bioinformatics, Vol. 9, Issue 1
Rootstock-regulated gene expression patterns associated with fire blight resistance in apple
journal, January 2012
- Jensen, Philip J.; Halbrendt, Noemi; Fazio, Gennaro
- BMC Genomics, Vol. 13, Issue 1
Fast Bayesian inference for gene regulatory networks using ScanBMA
journal, January 2014
- Young, William; Raftery, Adrian E.; Yeung, Ka
- BMC Systems Biology, Vol. 8, Issue 1
Construction and Analysis of an Integrated Regulatory Network Derived from High-Throughput Sequencing Data
journal, November 2011
- Cheng, Chao; Yan, Koon-Kiu; Hwang, Woochang
- PLoS Computational Biology, Vol. 7, Issue 11
An Improved, Bias-Reduced Probabilistic Functional Gene Network of Baker's Yeast, Saccharomyces cerevisiae
journal, October 2007
- Lee, Insuk; Li, Zhihua; Marcotte, Edward M.
- PLoS ONE, Vol. 2, Issue 10
Regularization Paths for Generalized Linear Models via Coordinate Descent
journal, January 2010
- Friedman, Jerome; Hastie, Trevor; Tibshirani, Robert
- Journal of Statistical Software, Vol. 33, Issue 1
Investigating Causal Relations by Econometric Models and Cross-spectral Methods
journal, August 1969
- Granger, C. W. J.
- Econometrica, Vol. 37, Issue 3
Works referencing / citing this record:
Computational dynamic approaches for temporal omics data with applications to systems medicine
journal, June 2017
- Liang, Yulan; Kelemen, Arpad
- BioData Mining, Vol. 10, Issue 1
Computational dynamic approaches for temporal omics data with applications to systems medicine
journal, June 2017
- Liang, Yulan; Kelemen, Arpad
- BioData Mining, Vol. 10, Issue 1
Prophetic Granger Causality to infer gene regulatory networks
journal, December 2017
- Carlin, Daniel E.; Paull, Evan O.; Graim, Kiley
- PLOS ONE, Vol. 12, Issue 12