Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A Highly Parallel Implementation of K-Means for Multithreaded Architecture

Conference ·
OSTI ID:1030877

We present a parallel implementation of the popular k-means clustering algorithm for massively multithreaded computer systems, as well as a parallelized version of the KKZ seed selection algorithm. We demonstrate that as system size increases, sequential seed selection can become a bottleneck. We also present an early attempt at parallelizing k-means that highlights critical performance issues when programming massively multithreaded systems. For our case studies, we used data collected from electric power simulations and run on the Cray XMT.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (US), Environmental Molecular Sciences Laboratory (EMSL)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1030877
Report Number(s):
PNNL-SA-76703; 37300; 400470000
Country of Publication:
United States
Language:
English