Distance Metrics and Clustering Methods for Mixed-type Data
Journal Article
·
· International Statistical Review
- Univ. at Buffalo, Buffalo, NY (United States)
- Arenadotio, New York, NY (United States)
In spite of the abundance of clustering techniques and algorithms, clustering mixed interval (continuous) and categorical (nominal and/or ordinal) scale data remain a challenging problem. In order to identify the most effective approaches for clustering mixed–type data, we use both theoretical and empirical analyses to present a critical review of the strengths and weaknesses of the methods identified in the literature. Here, the guidelines on approaches to use under different scenarios are provided, along with potential directions for future research.
- Research Organization:
- Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA)
- Grant/Contract Number:
- AC04-94AL85000
- OSTI ID:
- 1459931
- Report Number(s):
- SAND--2018-7091J; {665360,"Journal ID: ISSN 0306-7734"}
- Journal Information:
- International Statistical Review, Journal Name: International Statistical Review Journal Issue: 1 Vol. 87; ISSN 0306-7734
- Country of Publication:
- United States
- Language:
- English
Distance‐based clustering of mixed data
|
journal | November 2018 |
Similar Records
Evaluating the Effects of Missing Values and Mixed Data Types on Social Sequence Clustering Using t-SNE Visualization
Evaluation of Machine Learning Approaches to Estimate Aerosol Mixing State Metrics in Atmospheric Models
Detection of Anomalies in Gamma Background Radiation Data with K-Means and Self-Organizing Map Clustering Algorithms (Consortium on Nuclear Security Technologies (CONNECT) Q1 Report)
Journal Article
·
2019
· Journal of Data and Information Quality
·
OSTI ID:1526589
+3 more
Evaluation of Machine Learning Approaches to Estimate Aerosol Mixing State Metrics in Atmospheric Models
Technical Report
·
2019
·
OSTI ID:1513380
+1 more
Detection of Anomalies in Gamma Background Radiation Data with K-Means and Self-Organizing Map Clustering Algorithms (Consortium on Nuclear Security Technologies (CONNECT) Q1 Report)
Technical Report
·
2021
·
OSTI ID:1841591