skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Accelerating k-NN Algorithm with Hybrid MPI and OpenSHMEM

Conference ·

Machine Learning algorithms are benefiting from the continuous improvement of programming models, including MPI, MapReduce and PGAS. k-Nearest Neighbors (k-NN) algorithm is a widely used machine learning algorithm, applied to supervised learning tasks such as classification. Several parallel implementations of k-NN have been proposed in the literature and practice. However, on high-performance computing systems with high-speed interconnects, it is important to further accelerate existing designs of the k-NN algorithm through taking advantage of scalable programming models. To improve the performance of k-NN on large-scale environment with InfiniBand network, this paper proposes several alternative hybrid MPI+OpenSHMEM designs and performs a systemic evaluation and analysis on typical workloads. The hybrid designs leverage the one-sided memory access to better overlap communication with computation than the existing pure MPI design, and propose better schemes for efficient buffer management. The implementation based on k-NN program from MaTEx with MVAPICH2-X (Unified MPI+PGAS Communication Runtime over InfiniBand) shows up to 9.0% time reduction for training KDD Cup 2010 workload over 512 cores, and 27.6% time reduction for small workload with balanced communication and computation. Experiments of running with varied number of cores show that our design can maintain good scalability.

Research Organization:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1236322
Report Number(s):
PNNL-SA-113024; 400470000
Resource Relation:
Conference: OpenSHMEM 2015: Second Workshop on OpenSHMEM and Related Technologies, August 4-6, 2015, Annapolis, Maryland, 164-177
Country of Publication:
United States
Language:
English

Similar Records

OpenSHMEM over MPI-3
Software · Wed Feb 26 00:00:00 EST 2014 · OSTI ID:1236322

Final Report for Project DE-FC02-06ER25755 [Pmodels2]
Technical Report · Wed Mar 12 00:00:00 EDT 2014 · OSTI ID:1236322

A Case for Application Oblivious Energy-Efficient MPI Runtime
Conference · Mon Oct 19 00:00:00 EDT 2015 · OSTI ID:1236322

Related Subjects