Sparse multitask regression for identifying common mechanism of response to therapeutic targets
Abstract
Motivation: Molecular association of phenotypic responses is an important step in hypothesis generation and for initiating design of new experiments. Current practices for associating gene expression data with multidimensional phenotypic data are typically (i) performed one-to-one, i.e. each gene is examined independently with a phenotypic index and (ii) tested with one stress condition at a time, i.e. different perturbations are analyzed separately. As a result, the complex coordination among the genes responsible for a phenotypic profile is potentially lost. More importantly, univariate analysis can potentially hide new insights into common mechanism of response. Results: In this article, we propose a sparse, multitask regression model together with co-clustering analysis to explore the intrinsic grouping in associating the gene expression with phenotypic signatures. The global structure of association is captured by learning an intrinsic template that is shared among experimental conditions, with local perturbations introduced to integrate effects of therapeutic agents. We demonstrate the performance of our approach on both synthetic and experimental data. Synthetic data reveal that the multitask regression has a superior reduction in the regression error when compared with traditional L1-and L2-regularized regression. On the other hand, experiments with cell cycle inhibitors over a panel of 14 breast cancermore »
- Authors:
-
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Life Sciences Division
- Publication Date:
- Research Org.:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division
- OSTI Identifier:
- 1625267
- Grant/Contract Number:
- AC02-05CH11231
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Bioinformatics
- Additional Journal Information:
- Journal Volume: 26; Journal Issue: 12; Journal ID: ISSN 1367-4803
- Publisher:
- Oxford University Press
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; 37 INORGANIC, ORGANIC, PHYSICAL, AND ANALYTICAL CHEMISTRY; 97 MATHEMATICS AND COMPUTING; Biochemistry & Molecular Biology; Biotechnology & Applied Microbiology; Computer Science; Mathematical & Computational Biology; Mathematics
Citation Formats
Zhang, K., Gray, J. W., and Parvin, B. Sparse multitask regression for identifying common mechanism of response to therapeutic targets. United States: N. p., 2010.
Web. doi:10.1093/bioinformatics/btq181.
Zhang, K., Gray, J. W., & Parvin, B. Sparse multitask regression for identifying common mechanism of response to therapeutic targets. United States. https://doi.org/10.1093/bioinformatics/btq181
Zhang, K., Gray, J. W., and Parvin, B. Sun .
"Sparse multitask regression for identifying common mechanism of response to therapeutic targets". United States. https://doi.org/10.1093/bioinformatics/btq181. https://www.osti.gov/servlets/purl/1625267.
@article{osti_1625267,
title = {Sparse multitask regression for identifying common mechanism of response to therapeutic targets},
author = {Zhang, K. and Gray, J. W. and Parvin, B.},
abstractNote = {Motivation: Molecular association of phenotypic responses is an important step in hypothesis generation and for initiating design of new experiments. Current practices for associating gene expression data with multidimensional phenotypic data are typically (i) performed one-to-one, i.e. each gene is examined independently with a phenotypic index and (ii) tested with one stress condition at a time, i.e. different perturbations are analyzed separately. As a result, the complex coordination among the genes responsible for a phenotypic profile is potentially lost. More importantly, univariate analysis can potentially hide new insights into common mechanism of response. Results: In this article, we propose a sparse, multitask regression model together with co-clustering analysis to explore the intrinsic grouping in associating the gene expression with phenotypic signatures. The global structure of association is captured by learning an intrinsic template that is shared among experimental conditions, with local perturbations introduced to integrate effects of therapeutic agents. We demonstrate the performance of our approach on both synthetic and experimental data. Synthetic data reveal that the multitask regression has a superior reduction in the regression error when compared with traditional L1-and L2-regularized regression. On the other hand, experiments with cell cycle inhibitors over a panel of 14 breast cancer cell lines demonstrate the relevance of the computed molecular predictors with the cell cycle machinery, as well as the identification of hidden variables that are not captured by the baseline regression analysis. Accordingly, the system has identified CLCA2 as a hidden transcript and as a common mechanism of response for two therapeutic agents of CI-1040 and Iressa, which are currently in clinical use.},
doi = {10.1093/bioinformatics/btq181},
journal = {Bioinformatics},
number = 12,
volume = 26,
place = {United States},
year = {Sun Jun 06 00:00:00 EDT 2010},
month = {Sun Jun 06 00:00:00 EDT 2010}
}
Works referenced in this record:
Unsupervised feature selection via two-way ordering in gene expression analysis
journal, July 2003
- Ding, C. H. Q.
- Bioinformatics, Vol. 19, Issue 10
An Interior-Point Method for Large-Scale -Regularized Least Squares
journal, December 2007
- Kim, Seung-Jean; Koh, K.; Lustig, M.
- IEEE Journal of Selected Topics in Signal Processing, Vol. 1, Issue 4
Discovering statistically significant biclusters in gene expression data
journal, July 2002
- Tanay, A.; Sharan, R.; Shamir, R.
- Bioinformatics, Vol. 18, Issue Suppl 1
Geometric approach to segmentation and protein localization in cell culture assays
journal, January 2007
- Raman, S.; Maxwell, C. A.; Barcellos-Hoff, M. H.
- Journal of Microscopy, Vol. 225, Issue 1
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network
journal, May 2001
- Ideker, T.
- Science, Vol. 292, Issue 5518
Unsupervised feature selection via two-way ordering in gene expression analysis
journal, July 2003
- Ding, C. H. Q.
- Bioinformatics, Vol. 19, Issue 10
Geometric approach to segmentation and protein localization in cell culture assays
journal, January 2007
- Raman, S.; Maxwell, C. A.; Barcellos-Hoff, M. H.
- Journal of Microscopy, Vol. 225, Issue 1
CLCA2 tumour suppressor gene in 1p31 is epigenetically regulated in breast cancer
journal, February 2004
- Li, Xiurong; Cowell, John K.; Sossey-Alaoui, Khalid
- Oncogene, Vol. 23, Issue 7
CLCA2 tumour suppressor gene in 1p31 is epigenetically regulated in breast cancer
journal, February 2004
- Li, Xiurong; Cowell, John K.; Sossey-Alaoui, Khalid
- Oncogene, Vol. 23, Issue 7
Abrogated Response to Cellular Stress Identifies DCIS Associated with Subsequent Tumor Events and Defines Basal-like Breast Tumors
journal, November 2007
- Gauthier, Mona L.; Berman, Hal K.; Miller, Caroline
- Cancer Cell, Vol. 12, Issue 5
Direct Clustering of a Data Matrix
journal, March 1972
- Hartigan, J. A.
- Journal of the American Statistical Association, Vol. 67, Issue 337
Spectral Biclustering of Microarray Data: Coclustering Genes and Conditions
journal, April 2003
- Kluger, Y.
- Genome Research, Vol. 13, Issue 4
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network
journal, May 2001
- Ideker, T.
- Science, Vol. 292, Issue 5518
Compressed sensing
journal, April 2006
- Donoho, D. L.
- IEEE Transactions on Information Theory, Vol. 52, Issue 4
Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data
journal, April 2005
- Sachs, K.
- Science, Vol. 308, Issue 5721
A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes
journal, December 2006
- Neve, Richard M.; Chin, Koei; Fridlyand, Jane
- Cancer Cell, Vol. 10, Issue 6
Structure and Transcriptional Regulation of the Human Cystatin A Gene
journal, July 1998
- Takahashi, Hidetoshi; Asano, Kazuhiro; Kinouchi, Motoshi
- Journal of Biological Chemistry, Vol. 273, Issue 28
PAN1/NALP2/PYPAF2, an Inducible Inflammatory Mediator That Regulates NF-κB and Caspase-1 Activation in Macrophages
journal, December 2004
- Bruey, Jean Marie; Bruey-Sedano, Nathalie; Newman, Ruchi
- Journal of Biological Chemistry, Vol. 279, Issue 50
Atomic Decomposition by Basis Pursuit
journal, January 2001
- Chen, Scott Shaobing; Donoho, David L.; Saunders, Michael A.
- SIAM Review, Vol. 43, Issue 1
Molecular Predictors of 3D Morphogenesis by Breast Cancer Cell Lines in 3D Culture
journal, February 2010
- Han, Ju; Chang, Hang; Giricz, Orsi
- PLoS Computational Biology, Vol. 6, Issue 2
Reverse engineering gene networks: Integrating genetic perturbations with dynamical modeling
journal, May 2003
- Tegner, J.; Yeung, M. K. S.; Hasty, J.
- Proceedings of the National Academy of Sciences, Vol. 100, Issue 10
Direct Clustering of a Data Matrix
journal, March 1972
- Hartigan, J. A.
- Journal of the American Statistical Association, Vol. 67, Issue 337
Response projected clustering for direct association with physiological and clinical response data
journal, January 2008
- Yi, Sung-Gon; Park, Taesung; Lee, Jae K.
- BMC Bioinformatics, Vol. 9, Issue 1
hCLCA2 Is a p53-Inducible Inhibitor of Breast Cancer Cell Proliferation
journal, August 2009
- Walia, Vijay; Ding, Ming; Kumar, Sumit
- Cancer Research, Vol. 69, Issue 16
Response projected clustering for direct association with physiological and clinical response data
journal, January 2008
- Yi, Sung-Gon; Park, Taesung; Lee, Jae K.
- BMC Bioinformatics, Vol. 9, Issue 1
Gene-based approach to human gene-phenotype correlations
journal, October 1997
- Dryja, T. P.
- Proceedings of the National Academy of Sciences, Vol. 94, Issue 22
A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes
journal, December 2006
- Neve, Richard M.; Chin, Koei; Fridlyand, Jane
- Cancer Cell, Vol. 10, Issue 6
Multidimensional Profiling of Cell Surface Proteins and Nuclear Markers
journal, January 2010
- Han, Ju; Chang, Hang; Andarawewa, K.
- IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 7, Issue 1, p. 80-90
Spectral Biclustering of Microarray Data: Coclustering Genes and Conditions
journal, April 2003
- Kluger, Y.
- Genome Research, Vol. 13, Issue 4
Co-clustering documents and words using bipartite spectral graph partitioning
conference, January 2001
- Dhillon, Inderjit S.
- Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '01
Direct Clustering of a Data Matrix
journal, March 1972
- Hartigan, J. A.
- Journal of the American Statistical Association, Vol. 67, Issue 337
Learning a meta-level prior for feature relevance from multiple related tasks
conference, June 2007
- Lee, Su-In; Chatalbashev, Vassil; Vickrey, David
- Proceedings of the 24th international conference on Machine learning
Probabilistic Joint Feature Selection for Multi-task Learning
conference, April 2007
- Xiong, Tao; Bi, Jinbo; Rao, Bharat
- Proceedings of the 2007 SIAM International Conference on Data Mining
Works referencing / citing this record:
Multi-task diagnosis for autism spectrum disorders using multi-modality features: A multi-center study: Multi-Modality Multi-Center Diagnosis for ASD
journal, March 2017
- Wang, Jun; Wang, Qian; Peng, Jialin
- Human Brain Mapping, Vol. 38, Issue 6
Using multitask classification methods to investigate the kinase-specific phosphorylation sites
journal, June 2012
- Gao, Shan; Xu, Shuo; Fang, Yaping
- Proteome Science, Vol. 10, Issue S1
Pattern Classification of Large-Scale Functional Brain Networks: Identification of Informative Neuroimaging Markers for Epilepsy
journal, May 2012
- Zhang, Jie; Cheng, Wei; Wang, ZhengGe
- PLoS ONE, Vol. 7, Issue 5
A Survey on Multi-Task Learning
journal, January 2021
- Zhang, Yu; Yang, Qiang
- IEEE Transactions on Knowledge and Data Engineering
Deep multi-task learning for individuals origin–destination matrices estimation from census data
journal, November 2019
- Katranji, Mehdi; Kraiem, Sami; Moalic, Laurent
- Data Mining and Knowledge Discovery, Vol. 34, Issue 1
Integrative analysis of multiple diverse omics datasets by sparse group multitask regression
journal, October 2014
- Lin, Dongdong; Zhang, Jigang; Li, Jingyao
- Frontiers in Cell and Developmental Biology, Vol. 2
An ensemble method approach to investigate kinase-specific phosphorylation sites
journal, May 2014
- Datta, Sutapa; Mukhopadhyay, Subhasis
- International Journal of Nanomedicine
An overview of multi-task learning
journal, September 2017
- Zhang, Yu; Yang, Qiang
- National Science Review, Vol. 5, Issue 1
Pattern Classification of Large-Scale Functional Brain Networks: Identification of Informative Neuroimaging Markers for Epilepsy
journal, May 2012
- Zhang, Jie; Cheng, Wei; Wang, ZhengGe
- PLoS ONE, Vol. 7, Issue 5