Sequential Projection Pursuit Principal Component Analysis – Dealing with Missing Data Associated with New -Omics Technologies
Journal Article
·
· BioTechniques, 54(3):165-168
We present a new version of sequential projection pursuit Principal Component Analysis (sppPCA) that has the capability to perform PCA on large multivariate datasets that contain non-random missing values. We demonstrate that sppPCA generates more robust and informative low-dimensional representations of the data than imputation-based approaches and improved downstream statistical analyses, such as clustering or classification. A Java program to run sppPCA is freely available at https://www.biopilot.org/docs/Software/sppPCA.
- Research Organization:
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1072899
- Report Number(s):
- PNNL-SA-87092; 400412000
- Journal Information:
- BioTechniques, 54(3):165-168, Vol. 54, Issue 3; ISSN 0736-6205
- Country of Publication:
- United States
- Language:
- English
Similar Records
An improved optimization algorithm and Bayes factor termination criterion for sequential projection pursuit
Effective Dimension Reduction Using Sequential Projection Pursuit On Gene Expression Data for Cancer Classification
Kernel principal component analysis for stochastic input model generation
Journal Article
·
Sat May 28 00:00:00 EDT 2005
· Chemometrics and Intelligent Laboratory Systems
·
OSTI ID:1072899
+2 more
Effective Dimension Reduction Using Sequential Projection Pursuit On Gene Expression Data for Cancer Classification
Conference
·
Wed Jun 23 00:00:00 EDT 2004
·
OSTI ID:1072899
Kernel principal component analysis for stochastic input model generation
Journal Article
·
Wed Aug 10 00:00:00 EDT 2011
· Journal of Computational Physics
·
OSTI ID:1072899