Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Machine Learning Toolkit for Extreme Scale

Software ·
DOI:https://doi.org/10.11578/dc.20171025.1458· OSTI ID:code-2638 · Code ID:2638

Support Vector Machines (SVM) is a popular machine learning technique, which has been applied to a wide range of domains such as science, finance, and social networks for supervised learning. MaTEx undertakes the challenge of designing a scalable parallel SVM training algorithm for large scale systems, which includes commodity multi-core machines, tightly connected supercomputers and cloud computing systems. Several techniques are proposed for improved speed and memory space usage including adaptive and aggressive elimination of samples for faster convergence , and sparse format representation of data samples. Several heuristics for earliest possible to lazy elimination of non-contributing samples are considered in MaTEx. In many cases, where an early sample elimination might result in a false positive, low overhead mechanisms for reconstruction of key data structures are proposed. The proposed algorithm and heuristics are implemented and evaluated on various publicly available datasets

Short Name / Acronym:
MATEX
Site Accession Number:
5118
Software Type:
Scientific
License(s):
Other (Commercial or Open-Source)
Research Organization:
Pacific Northwest National Laboratory
Sponsoring Organization:
USDOE

Primary Award/Contract Number:
AC05-76RL01830
DOE Contract Number:
AC05-76RL01830
Code ID:
2638
OSTI ID:
code-2638
Country of Origin:
United States

Similar Records

Machine Learning Toolkit for Extreme Scale
Software · Mon Mar 31 00:00:00 EDT 2014 · OSTI ID:1231737

Fast and Accurate Support Vector Machines on Large Scale Systems
Conference · Tue Sep 08 00:00:00 EDT 2015 · OSTI ID:1236333

Adaptive Neuron Apoptosis for Accelerating Deep Learning on Large Scale Systems
Conference · Sun Feb 05 23:00:00 EST 2017 · OSTI ID:1440683

Related Subjects