MAVTgsa: An R Package for Gene Set (Enrichment) Analysis
Abstract
Gene set analysis methods aim to determine whether an a priori defined set of genes shows statistically significant difference in expression on either categorical or continuous outcomes. Although many methods for gene set analysis have been proposed, a systematic analysis tool for identification of different types of gene set significance modules has not been developed previously. This work presents an R package, called MAVTgsa, which includes three different methods for integrated gene set enrichment analysis. (1) The one-sided OLS (ordinary least squares) test detects coordinated changes of genes in gene set in one direction, either up- or downregulation. (2) The two-sided MANOVA (multivariate analysis variance) detects changes both up- and downregulation for studying two or more experimental conditions. (3) A random forests-based procedure is to identify gene sets that can accurately predict samples from different experimental conditions or are associated with the continuous phenotypes. MAVTgsa computes the values and FDR (false discovery rate) -value for all gene sets in the study. Furthermore, MAVTgsa provides several visualization outputs to support and interpret the enrichment results. This package is available online.
- Authors:
-
- Community Medicine Research Center, Keelung Chang Gung Memorial Hospital, No. 200, Lane 208, Jijinyi Road, Anle District, Keelung 204, Taiwan
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, FDA, 3900 NCTR Road, HFT-20, Jefferson, AR 72079, USA
- Department of Agronomy, National Taiwan University, No. 1, Section 4, Roosevelt Road, Taipei 106, Taiwan
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, FDA, 3900 NCTR Road, HFT-20, Jefferson, AR 72079, USA, Graduate Institute of Biostatistics and Biostatistics Center, China Medical University, Taichung, Taiwan
- Publication Date:
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1228276
- Resource Type:
- Published Article
- Journal Name:
- BioMed Research International
- Additional Journal Information:
- Journal Name: BioMed Research International Journal Volume: 2014; Journal ID: ISSN 2314-6133
- Publisher:
- Hindawi Publishing Corporation
- Country of Publication:
- Egypt
- Language:
- English
Citation Formats
Chien, Chih-Yi, Chang, Ching-Wei, Tsai, Chen-An, and Chen, James J. MAVTgsa: An R Package for Gene Set (Enrichment) Analysis. Egypt: N. p., 2014.
Web. doi:10.1155/2014/346074.
Chien, Chih-Yi, Chang, Ching-Wei, Tsai, Chen-An, & Chen, James J. MAVTgsa: An R Package for Gene Set (Enrichment) Analysis. Egypt. https://doi.org/10.1155/2014/346074
Chien, Chih-Yi, Chang, Ching-Wei, Tsai, Chen-An, and Chen, James J. Wed .
"MAVTgsa: An R Package for Gene Set (Enrichment) Analysis". Egypt. https://doi.org/10.1155/2014/346074.
@article{osti_1228276,
title = {MAVTgsa: An R Package for Gene Set (Enrichment) Analysis},
author = {Chien, Chih-Yi and Chang, Ching-Wei and Tsai, Chen-An and Chen, James J.},
abstractNote = {Gene set analysis methods aim to determine whether an a priori defined set of genes shows statistically significant difference in expression on either categorical or continuous outcomes. Although many methods for gene set analysis have been proposed, a systematic analysis tool for identification of different types of gene set significance modules has not been developed previously. This work presents an R package, called MAVTgsa, which includes three different methods for integrated gene set enrichment analysis. (1) The one-sided OLS (ordinary least squares) test detects coordinated changes of genes in gene set in one direction, either up- or downregulation. (2) The two-sided MANOVA (multivariate analysis variance) detects changes both up- and downregulation for studying two or more experimental conditions. (3) A random forests-based procedure is to identify gene sets that can accurately predict samples from different experimental conditions or are associated with the continuous phenotypes. MAVTgsa computes the P values and FDR (false discovery rate) q -value for all gene sets in the study. Furthermore, MAVTgsa provides several visualization outputs to support and interpret the enrichment results. This package is available online.},
doi = {10.1155/2014/346074},
journal = {BioMed Research International},
number = ,
volume = 2014,
place = {Egypt},
year = {Wed Jan 01 00:00:00 EST 2014},
month = {Wed Jan 01 00:00:00 EST 2014}
}
https://doi.org/10.1155/2014/346074
Web of Science
Works referenced in this record:
Gene-set approach for expression pattern analysis
journal, January 2008
- Nam, D.; Kim, S. -Y.
- Briefings in Bioinformatics, Vol. 9, Issue 3
Improving gene set analysis of microarray data by SAM-GS
journal, January 2007
- Dinu, Irina; Potter, John D.; Mueller, Thomas
- BMC Bioinformatics, Vol. 8, Issue 1
Ontological analysis of gene expression data: current tools, limitations, and open problems
journal, June 2005
- Khatri, P.; Draghici, S.
- Bioinformatics, Vol. 21, Issue 18
Enrichment or depletion of a GO category within a class of genes: which test?
journal, December 2006
- Rivals, I.; Personnaz, L.; Taing, L.
- Bioinformatics, Vol. 23, Issue 4
Procedures for Comparing Samples with Multiple Endpoints
journal, December 1984
- O'Brien, Peter C.
- Biometrics, Vol. 40, Issue 4
Subdominant/Cryptic CD8 T Cell Epitopes Contribute to Resistance against Experimental Infection with a Human Protozoan Parasite
journal, July 2011
- Dominguez, Mariana R.; Silveira, Eduardo L. V.; de Vasconcelos, José Ronnie C.
- PLoS ONE, Vol. 6, Issue 7
Pathway analysis using random forests classification and regression
journal, June 2006
- Pang, H.; Lin, A.; Holford, M.
- Bioinformatics, Vol. 22, Issue 16
Significance analysis of groups of genes in expression profiling studies
journal, June 2007
- Chen, J. J.; Lee, T.; Delongchamp, R. R.
- Bioinformatics, Vol. 23, Issue 16
Exact t and F Tests for Analyzing Studies with Multiple Endpoints
journal, September 1996
- Lauter, Jurgen
- Biometrics, Vol. 52, Issue 3
Linear Combination Test for Hierarchical Gene Set Analysis
journal, January 2011
- Wang, Xiaoming; Dinu, Irina; Liu, Wei
- Statistical Applications in Genetics and Molecular Biology, Vol. 10, Issue 1
Global functional profiling of gene expression☆☆This work was funded in part by a Sun Microsystems grant awarded to S.D., NIH Grant HD36512 to S.A.K., a Wayne State University SOM Dean’s Post-Doctoral Fellowship, and an NICHD Contraception and Infertility Loan to G.C.O. Support from the WSU MCBI mode is gratefully appreciated.
journal, February 2003
- Drǎghici, Sorin; Khatri, Purvesh; Martins, Rui P.
- Genomics, Vol. 81, Issue 2
Discovering statistically significant pathways in expression profiling studies
journal, September 2005
- Tian, L.; Greenberg, S. A.; Kong, S. W.
- Proceedings of the National Academy of Sciences, Vol. 102, Issue 38
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
journal, September 2005
- Subramanian, A.; Tamayo, P.; Mootha, V. K.
- Proceedings of the National Academy of Sciences, Vol. 102, Issue 43, p. 15545-15550
On testing the significance of sets of genes
journal, June 2007
- Efron, Bradley; Tibshirani, Robert
- The Annals of Applied Statistics, Vol. 1, Issue 1
Gene expression profiling predicts clinical outcome of breast cancer
journal, January 2002
- van 't Veer, Laura J.; Dai, Hongyue; van de Vijver, Marc J.
- Nature, Vol. 415, Issue 6871
In vitro analysis of ovarian cancer response to cisplatin, carboplatin, and paclitaxel identifies common pathways that are also associated with overall patient survival
journal, May 2012
- Bicaku, E.; Xiong, Y.; Marchion, D. C.
- British Journal of Cancer, Vol. 106, Issue 12
A multivariate approach for integrating genome-wide expression data and biological knowledge
journal, July 2006
- Kong, S. W.; Pu, W. T.; Park, P. J.
- Bioinformatics, Vol. 22, Issue 19
A global test for groups of genes: testing association with a clinical outcome
journal, December 2003
- Goeman, J. J.; van de Geer, S. A.; de Kort, F.
- Bioinformatics, Vol. 20, Issue 1
Linear combination test for gene set analysis of a continuous phenotype
journal, July 2013
- Dinu, Irina; Wang, Xiaoming; Kelemen, Linda E.
- BMC Bioinformatics, Vol. 14, Issue 1
Significance analysis of functional categories in gene expression studies: a structured permutation approach
journal, January 2005
- Barry, W. T.; Nobel, A. B.; Wright, F. A.
- Bioinformatics, Vol. 21, Issue 9
Multivariate analysis of variance test for gene set analysis
journal, March 2009
- Tsai, Chen-An; Chen, James J.
- Bioinformatics, Vol. 25, Issue 7
PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes
journal, June 2003
- Mootha, Vamsi K.; Lindgren, Cecilia M.; Eriksson, Karl-Fredrik
- Nature Genetics, Vol. 34, Issue 3
A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics
journal, January 2005
- Schäfer, Juliane; Strimmer, Korbinian
- Statistical Applications in Genetics and Molecular Biology, Vol. 4, Issue 1
Random forests-based differential analysis of gene sets for gene expression data
journal, April 2013
- Hsueh, Huey-Miin; Zhou, Da-Wei; Tsai, Chen-An
- Gene, Vol. 518, Issue 1
A DNA Repair Pathway–Focused Score for Prediction of Outcomes in Ovarian Cancer Treated With Platinum-Based Chemotherapy
journal, April 2012
- Kang, Josephine; D’Andrea, Alan D.; Kozono, David
- JNCI: Journal of the National Cancer Institute, Vol. 104, Issue 9