skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Sequential Projection Pursuit Principal Component Analysis – Dealing with Missing Data Associated with New -Omics Technologies

Journal Article · · BioTechniques, 54(3):165-168
DOI:https://doi.org/10.2144/000113978· OSTI ID:1072899

We present a new version of sequential projection pursuit Principal Component Analysis (sppPCA) that has the capability to perform PCA on large multivariate datasets that contain non-random missing values. We demonstrate that sppPCA generates more robust and informative low-dimensional representations of the data than imputation-based approaches and improved downstream statistical analyses, such as clustering or classification. A Java program to run sppPCA is freely available at https://www.biopilot.org/docs/Software/sppPCA.

Research Organization:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1072899
Report Number(s):
PNNL-SA-87092; 400412000
Journal Information:
BioTechniques, 54(3):165-168, Vol. 54, Issue 3; ISSN 0736-6205
Country of Publication:
United States
Language:
English

Similar Records

An improved optimization algorithm and Bayes factor termination criterion for sequential projection pursuit
Journal Article · Sat May 28 00:00:00 EDT 2005 · Chemometrics and Intelligent Laboratory Systems · OSTI ID:1072899

Effective Dimension Reduction Using Sequential Projection Pursuit On Gene Expression Data for Cancer Classification
Conference · Wed Jun 23 00:00:00 EDT 2004 · OSTI ID:1072899

Kernel principal component analysis for stochastic input model generation
Journal Article · Wed Aug 10 00:00:00 EDT 2011 · Journal of Computational Physics · OSTI ID:1072899

Related Subjects