Clustering high dimensional data using RIA

Aziz, Nazrina

doi:10.1063/1.4915706

Title: Clustering high dimensional data using RIA

Journal Article · Fri May 15 00:00:00 EDT 2015 · AIP Conference Proceedings

DOI:https://doi.org/10.1063/1.4915706· OSTI ID:22391657

Aziz, Nazrina ^[1]

School of Quantitative Sciences, College of Arts and Sciences, Universiti Utara Malaysia, 06010 Sintok, Kedah (Malaysia)

Clustering may simply represent a convenient method for organizing a large data set so that it can easily be understood and information can efficiently be retrieved. However, identifying cluster in high dimensionality data sets is a difficult task because of the curse of dimensionality. Another challenge in clustering is some traditional functions cannot capture the pattern dissimilarity among objects. In this article, we used an alternative dissimilarity measurement called Robust Influence Angle (RIA) in the partitioning method. RIA is developed using eigenstructure of the covariance matrix and robust principal component score. We notice that, it can obtain cluster easily and hence avoid the curse of dimensionality. It is also manage to cluster large data sets with mixed numeric and categorical value.

Cite

Export

Save

OSTI ID:: 22391657

Journal Information:: AIP Conference Proceedings, Vol. 1660, Issue 1; Conference: ICoMEIA 2014: International Conference on Mathematics, Engineering and Industrial Applications 2014, Penang (Malaysia), 28-30 May 2014; Other Information: (c) 2015 AIP Publishing LLC; Country of input: International Atomic Energy Agency (IAEA); ISSN 0094-243X

Country of Publication:: United States

Language:: English

Similar Records

Statistical Exploration of Electronic Structure of Molecules from Quantum Monte-Carlo Simulations

Technical Report · Wed Dec 22 00:00:00 EST 2010 · OSTI ID:22391657

Prabhat, Mr; Zubarev, Dmitry

The clustering of galaxies in the completed SDSS-III Baryon Oscillation Spectroscopic Survey: Double-probe measurements from BOSS galaxy clustering and Planck data – towards an analysis without informative priors

Journal Article · Mon Mar 28 00:00:00 EDT 2016 · Monthly Notices of the Royal Astronomical Society · OSTI ID:22391657

Pellejero-Ibanez, Marco; Chuang, Chia -Hsun; Rubino-Martin, J. A.; +27 more

Evaluating the Effects of Missing Values and Mixed Data Types on Social Sequence Clustering Using t-SNE Visualization

Journal Article · Wed Mar 06 00:00:00 EST 2019 · Journal of Data and Information Quality · OSTI ID:22391657

Lazar, Alina; Jin, Ling; Spurlock, C. Anna; +3 more

Related Subjects

71 CLASSICAL AND QUANTUM MECHANICS
GENERAL PHYSICS
DATA PROCESSING
EIGENVALUES
FUNCTIONS
MATHEMATICAL MODELS
MATHEMATICAL SOLUTIONS
MATRICES
PARTITION

Title: Clustering high dimensional data using RIA

Citation Formats

Similar Records

Related Subjects