Machine Learning Toolkit for Extreme Scale
Support Vector Machines (SVM) is a popular machine learning technique, which has been applied to a wide range of domains such as science, finance, and social networks for supervised learning. MaTEx undertakes the challenge of designing a scalable parallel SVM training algorithm for large scale systems, which includes commodity multi-core machines, tightly connected supercomputers and cloud computing systems. Several techniques are proposed for improved speed and memory space usage including adaptive and aggressive elimination of samples for faster convergence , and sparse format representation of data samples. Several heuristics for earliest possible to lazy elimination of non-contributing samples are considered in MaTEx. In many cases, where an early sample elimination might result in a false positive, low overhead mechanisms for reconstruction of key data structures are proposed. The proposed algorithm and heuristics are implemented and evaluated on various publicly available datasets
- Short Name / Acronym:
- MATEX
- Site Accession Number:
- 5118
- Software Type:
- Scientific
- License(s):
- Other (Commercial or Open-Source)
- Research Organization:
- Pacific Northwest National Laboratory
- Sponsoring Organization:
- USDOEPrimary Award/Contract Number:AC05-76RL01830
- DOE Contract Number:
- AC05-76RL01830
- Code ID:
- 2638
- OSTI ID:
- code-2638
- Country of Origin:
- United States
Similar Records
Fast and Accurate Support Vector Machines on Large Scale Systems
Adaptive Neuron Apoptosis for Accelerating Deep Learning on Large Scale Systems