Evaluation of normalization methods for cDNA microarray data by k-NN classification

Wu, Wei; Xing, Eric P; Myers, Connie; Mian, Saira; Bissell, Mina J

Title: Evaluation of normalization methods for cDNA microarray data by k-NN classification

Journal Article · Fri Dec 17 00:00:00 EST 2004 · BMC Bioinformatics

OSTI ID:989220

Wu, Wei; Xing, Eric P; Myers, Connie; Mian, Saira; Bissell, Mina J

Non-biological factors give rise to unwanted variations in cDNA microarray data. There are many normalization methods designed to remove such variations. However, to date there have been few published systematic evaluations of these techniques for removing variations arising from dye biases in the context of downstream, higher-order analytical tasks such as classification. Ten location normalization methods that adjust spatial- and/or intensity-dependent dye biases, and three scale methods that adjust scale differences were applied, individually and in combination, to five distinct, published, cancer biology-related cDNA microarray data sets. Leave-one-out cross-validation (LOOCV) classification error was employed as the quantitative end-point for assessing the effectiveness of a normalization method. In particular, a known classifier, k-nearest neighbor (k-NN), was estimated from data normalized using a given technique, and the LOOCV error rate of the ensuing model was computed. We found that k-NN classifiers are sensitive to dye biases in the data. Using NONRM and GMEDIAN as baseline methods, our results show that single-bias-removal techniques which remove either spatial-dependent dye bias (referred later as spatial effect) or intensity-dependent dye bias (referred later as intensity effect) moderately reduce LOOCV classification errors; whereas double-bias-removal techniques which remove both spatial- and intensity effect reduce LOOCV classification errors even further. Of the 41 different strategies examined, three two-step processes, IGLOESS-SLFILTERW7, ISTSPLINE-SLLOESS and IGLOESS-SLLOESS, all of which removed intensity effect globally and spatial effect locally, appear to reduce LOOCV classification errors most consistently and effectively across all data sets. We also found that the investigated scale normalization methods do not reduce LOOCV classification error. Using LOOCV error of k-NNs as the evaluation criterion, three double-bias-removal normalization strategies, IGLOESS-SLFILTERW7, ISTSPLINE-SLLOESS and IGLOESS-SLLOESS, outperform other strategies for removing spatial effect, intensity effect and scale differences from cDNA microarray data. The apparent sensitivity of k-NN LOOCV classification error to dye biases suggests that this criterion provides an informative measure for evaluating normalization methods. All the computational tools used in this study were implemented using the R language for statistical computing and graphics.

View Journal Article

Cite

Export

Save

Research Organization:: Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

Sponsoring Organization:: Life Sciences Division

DOE Contract Number:: DE-AC02-05CH11231; DE-AC02-05CH11231, OBER DEAC0376SF00098, CA64786

OSTI ID:: 989220

Report Number(s):: LBNL-3959E; TRN: US201019%%401

Journal Information:: BMC Bioinformatics, Vol. 6, Issue 191; Related Information: Journal Publication Date: 7/26/2005

Country of Publication:: United States

Language:: English

Similar Records

Evaluation of normalization methods for cDNA microarray data by k-NN classification

Journal Article · Tue Jul 26 00:00:00 EDT 2005 · BMC Bioinformatics · OSTI ID:989220

Wu, Wei; Xing, Eric P.; Myers, Connie; +2 more

Assessing probe-specific dye and slide biases in two-color microarray data

Journal Article · Sat Jul 19 00:00:00 EDT 2008 · BMC Bioinformatics · OSTI ID:989220

Lu, Ruixiao; Lee, Geun-Cheol; Shultz, Michael; +9 more

Capturing the spatial variability of noise levels based on a short-term monitoring campaign and comparing noise surfaces against personal exposures collected through a panel study

Journal Article · Thu Nov 15 00:00:00 EST 2018 · Environmental Research · OSTI ID:989220

Fallah-Shorshani, Masoud; Minet, Laura; Liu, Rick; +6 more

Related Subjects

59 BASIC BIOLOGICAL SCIENCES
CLASSIFICATION
DYES
EVALUATION
NEOPLASMS
SENSITIVITY

Title: Evaluation of normalization methods for cDNA microarray data by k-NN classification

Citation Formats

Similar Records

Related Subjects