An improved optimization algorithm and Bayes factor termination criterion for sequential projection pursuit

Webb-Robertson, Bobbie-Jo M; Jarman, Kristin H; Harvey, Scott D; Posse, Christian; Wright, Bob W

doi:10.1016/j.chemolab.2004.09.014

Title: An improved optimization algorithm and Bayes factor termination criterion for sequential projection pursuit

Journal Article · Sat May 28 00:00:00 EDT 2005 · Chemometrics and Intelligent Laboratory Systems

DOI:https://doi.org/10.1016/j.chemolab.2004.09.014· OSTI ID:15020557

Webb-Robertson, Bobbie-Jo M; Jarman, Kristin H; Harvey, Scott D; Posse, Christian; Wright, Bob W

A fundamental problem in analysis of highly multivariate spectral or chromatographic data is reduction of dimensionality. Principal components analysis (PCA), concerned with explaining the variance-covariance structure of the data, is a commonly used approach to dimension reduction. Recently an attractive alternative to PCA, sequential projection pursuit (SPP), has been introduced. Designed to elicit clustering tendencies in the data, SPP may be more appropriate when performing clustering or classification analysis. However, the existing genetic algorithm (GA) implementation of SPP has two shortcomings, computation time and inability to determine the number of factors necessary to explain the majority of the structure in the data. We address both these shortcomings. First, we introduce a new SPP algorithm, a random scan sampling algorithm (RSSA), that significantly reduces computation time. We compare the computational burden of the RSS and GA implementation for SPP on a dataset containing Raman spectra of twelve organic compounds. Second, we propose a Bayes factor criterion, BFC, as an effective measure for selecting the number of factors needed to explain the majority of the structure in the data. We compare SPP to PCA on two datasets varying in type, size, and difficulty; in both cases SPP achieves a higher accuracy with a lower number of latent variables.

Cite

Export

Save

Research Organization:: Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: AC05-76RL01830

OSTI ID:: 15020557

Report Number(s):: PNNL-SA-41907; TRN: US200521%%172

Journal Information:: Chemometrics and Intelligent Laboratory Systems, Vol. 77, Issue 1-2

Country of Publication:: United States

Language:: English

Similar Records

Effective Dimension Reduction Using Sequential Projection Pursuit On Gene Expression Data for Cancer Classification

Conference · Wed Jun 23 00:00:00 EDT 2004 · OSTI ID:15020557

Webb-Robertson, Bobbie-Jo M; Havre, Susan L

Sequential Projection Pursuit Principal Component Analysis – Dealing with Missing Data Associated with New -Omics Technologies

Journal Article · Fri Mar 15 00:00:00 EDT 2013 · BioTechniques, 54(3):165-168 · OSTI ID:15020557

Webb-Robertson, Bobbie-Jo M.; Matzke, Melissa M.; Metz, Thomas O.; +5 more

Dimension Reduction via Unsupervised Learning Yields Significant Computational Improvements for Support Vector Machine Based Protein Family Classification.

Conference · Thu Feb 26 00:00:00 EST 2009 · OSTI ID:15020557

Webb-Robertson, Bobbie-Jo M; Matzke, Melissa M; Oehmen, Christopher S

Related Subjects

59 BASIC BIOLOGICAL SCIENCES
ACCURACY
ALGORITHMS
CLASSIFICATION
DIMENSIONS
GENETICS
IMPLEMENTATION
OPTIMIZATION
ORGANIC COMPOUNDS
RAMAN SPECTRA
SAMPLING
statistics
multivariate
classification
clustering
Bayes

Title: An improved optimization algorithm and Bayes factor termination criterion for sequential projection pursuit

Citation Formats

Similar Records

Related Subjects