DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: MAVTgsa: An R Package for Gene Set (Enrichment) Analysis

Abstract

Gene set analysis methods aim to determine whether an a priori defined set of genes shows statistically significant difference in expression on either categorical or continuous outcomes. Although many methods for gene set analysis have been proposed, a systematic analysis tool for identification of different types of gene set significance modules has not been developed previously. This work presents an R package, called MAVTgsa, which includes three different methods for integrated gene set enrichment analysis. (1) The one-sided OLS (ordinary least squares) test detects coordinated changes of genes in gene set in one direction, either up- or downregulation. (2) The two-sided MANOVA (multivariate analysis variance) detects changes both up- and downregulation for studying two or more experimental conditions. (3) A random forests-based procedure is to identify gene sets that can accurately predict samples from different experimental conditions or are associated with the continuous phenotypes. MAVTgsa computes the P values and FDR (false discovery rate) q -value for all gene sets in the study. Furthermore, MAVTgsa provides several visualization outputs to support and interpret the enrichment results. This package is available online.

Authors:
 [1];  [2]; ORCiD logo [3];  [4]
  1. Community Medicine Research Center, Keelung Chang Gung Memorial Hospital, No. 200, Lane 208, Jijinyi Road, Anle District, Keelung 204, Taiwan
  2. Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, FDA, 3900 NCTR Road, HFT-20, Jefferson, AR 72079, USA
  3. Department of Agronomy, National Taiwan University, No. 1, Section 4, Roosevelt Road, Taipei 106, Taiwan
  4. Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, FDA, 3900 NCTR Road, HFT-20, Jefferson, AR 72079, USA, Graduate Institute of Biostatistics and Biostatistics Center, China Medical University, Taichung, Taiwan
Publication Date:
Sponsoring Org.:
USDOE
OSTI Identifier:
1228276
Resource Type:
Published Article
Journal Name:
BioMed Research International
Additional Journal Information:
Journal Name: BioMed Research International Journal Volume: 2014; Journal ID: ISSN 2314-6133
Publisher:
Hindawi Publishing Corporation
Country of Publication:
Egypt
Language:
English

Citation Formats

Chien, Chih-Yi, Chang, Ching-Wei, Tsai, Chen-An, and Chen, James J. MAVTgsa: An R Package for Gene Set (Enrichment) Analysis. Egypt: N. p., 2014. Web. doi:10.1155/2014/346074.
Chien, Chih-Yi, Chang, Ching-Wei, Tsai, Chen-An, & Chen, James J. MAVTgsa: An R Package for Gene Set (Enrichment) Analysis. Egypt. https://doi.org/10.1155/2014/346074
Chien, Chih-Yi, Chang, Ching-Wei, Tsai, Chen-An, and Chen, James J. Wed . "MAVTgsa: An R Package for Gene Set (Enrichment) Analysis". Egypt. https://doi.org/10.1155/2014/346074.
@article{osti_1228276,
title = {MAVTgsa: An R Package for Gene Set (Enrichment) Analysis},
author = {Chien, Chih-Yi and Chang, Ching-Wei and Tsai, Chen-An and Chen, James J.},
abstractNote = {Gene set analysis methods aim to determine whether an a priori defined set of genes shows statistically significant difference in expression on either categorical or continuous outcomes. Although many methods for gene set analysis have been proposed, a systematic analysis tool for identification of different types of gene set significance modules has not been developed previously. This work presents an R package, called MAVTgsa, which includes three different methods for integrated gene set enrichment analysis. (1) The one-sided OLS (ordinary least squares) test detects coordinated changes of genes in gene set in one direction, either up- or downregulation. (2) The two-sided MANOVA (multivariate analysis variance) detects changes both up- and downregulation for studying two or more experimental conditions. (3) A random forests-based procedure is to identify gene sets that can accurately predict samples from different experimental conditions or are associated with the continuous phenotypes. MAVTgsa computes the P values and FDR (false discovery rate) q -value for all gene sets in the study. Furthermore, MAVTgsa provides several visualization outputs to support and interpret the enrichment results. This package is available online.},
doi = {10.1155/2014/346074},
journal = {BioMed Research International},
number = ,
volume = 2014,
place = {Egypt},
year = {Wed Jan 01 00:00:00 EST 2014},
month = {Wed Jan 01 00:00:00 EST 2014}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.1155/2014/346074

Citation Metrics:
Cited by: 2 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Gene-set approach for expression pattern analysis
journal, January 2008

  • Nam, D.; Kim, S. -Y.
  • Briefings in Bioinformatics, Vol. 9, Issue 3
  • DOI: 10.1093/bib/bbn001

Improving gene set analysis of microarray data by SAM-GS
journal, January 2007


Ontological analysis of gene expression data: current tools, limitations, and open problems
journal, June 2005


Enrichment or depletion of a GO category within a class of genes: which test?
journal, December 2006


Procedures for Comparing Samples with Multiple Endpoints
journal, December 1984


Subdominant/Cryptic CD8 T Cell Epitopes Contribute to Resistance against Experimental Infection with a Human Protozoan Parasite
journal, July 2011

  • Dominguez, Mariana R.; Silveira, Eduardo L. V.; de Vasconcelos, José Ronnie C.
  • PLoS ONE, Vol. 6, Issue 7
  • DOI: 10.1371/journal.pone.0022011

Pathway analysis using random forests classification and regression
journal, June 2006


Significance analysis of groups of genes in expression profiling studies
journal, June 2007


Exact t and F Tests for Analyzing Studies with Multiple Endpoints
journal, September 1996


Linear Combination Test for Hierarchical Gene Set Analysis
journal, January 2011

  • Wang, Xiaoming; Dinu, Irina; Liu, Wei
  • Statistical Applications in Genetics and Molecular Biology, Vol. 10, Issue 1
  • DOI: 10.2202/1544-6115.1641

Discovering statistically significant pathways in expression profiling studies
journal, September 2005

  • Tian, L.; Greenberg, S. A.; Kong, S. W.
  • Proceedings of the National Academy of Sciences, Vol. 102, Issue 38
  • DOI: 10.1073/pnas.0506577102

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
journal, September 2005

  • Subramanian, A.; Tamayo, P.; Mootha, V. K.
  • Proceedings of the National Academy of Sciences, Vol. 102, Issue 43, p. 15545-15550
  • DOI: 10.1073/pnas.0506580102

On testing the significance of sets of genes
journal, June 2007

  • Efron, Bradley; Tibshirani, Robert
  • The Annals of Applied Statistics, Vol. 1, Issue 1
  • DOI: 10.1214/07-AOAS101

Gene expression profiling predicts clinical outcome of breast cancer
journal, January 2002

  • van 't Veer, Laura J.; Dai, Hongyue; van de Vijver, Marc J.
  • Nature, Vol. 415, Issue 6871
  • DOI: 10.1038/415530a

A multivariate approach for integrating genome-wide expression data and biological knowledge
journal, July 2006


A global test for groups of genes: testing association with a clinical outcome
journal, December 2003


Linear combination test for gene set analysis of a continuous phenotype
journal, July 2013


Significance analysis of functional categories in gene expression studies: a structured permutation approach
journal, January 2005


Multivariate analysis of variance test for gene set analysis
journal, March 2009


PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes
journal, June 2003

  • Mootha, Vamsi K.; Lindgren, Cecilia M.; Eriksson, Karl-Fredrik
  • Nature Genetics, Vol. 34, Issue 3
  • DOI: 10.1038/ng1180

Random Forests
journal, January 2001


A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics
journal, January 2005

  • Schäfer, Juliane; Strimmer, Korbinian
  • Statistical Applications in Genetics and Molecular Biology, Vol. 4, Issue 1
  • DOI: 10.2202/1544-6115.1175

Random forests-based differential analysis of gene sets for gene expression data
journal, April 2013


A DNA Repair Pathway–Focused Score for Prediction of Outcomes in Ovarian Cancer Treated With Platinum-Based Chemotherapy
journal, April 2012

  • Kang, Josephine; D’Andrea, Alan D.; Kozono, David
  • JNCI: Journal of the National Cancer Institute, Vol. 104, Issue 9
  • DOI: 10.1093/jnci/djs177