Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Scaling Up Data-Parallel Analytics Platforms: Linear Algebraic Operation Cases

Conference ·

Linear algebraic operations such as matrix manipulations form the kernel of many machine learning and other crucial algorithms. Scaling up as well as scaling out such algorithms are key to supporting large scale data analysis that require efficient processing over millions of data samples. To this end, we present, ARION, a hardware acceleration based approach for scaling-up individual tasks of Spark, a popular data-parallel analytics platform. We support both linear algebraic operations of between two dense matrices, and between sparse and dense matrices in distributed environments. ARION provides a flexible control of acceleration according to matrix density, along with efficient scheduling based on runtime resource utilization. We demonstrate the benefit of our approach for general matrix multiplication operations over large matrices with up to four billion elements by using Gramian matrix computation that is commonly used in machine learning. Experiments show that our approach achieves more than 2× and 1.5× end-to-end performance speedups for dense and sparse matrices, respectively, and up to 57.04× faster computation compared to MLlib, a state of the art Spark-based implementation. This work is sponsored in part by the NSF under the grants: CNS-1565314, CNS-1405697, and CNS-1615411. The manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1422792
Country of Publication:
United States
Language:
English

References (24)

Big data analytics with small footprint conference August 2013
Elemental: A New Framework for Distributed Memory Dense Matrix Computations journal February 2013
Comparing the performance of different x86 SIMD instruction sets for a medical imaging application on modern multi- and manycore chips conference February 2014
A BSP-Based Parallel Iterative Processing System with Multiple Partition Strategies for Big Graphs conference June 2013
Pregel: a system for large-scale graph processing conference January 2010
Plapack conference January 1997
Design, Synthesis and Dipeptidyl Peptidase 4 Inhibition of Novel Aminomethyl Biaryl Derivatives journal September 2017
Rethinking Data-Intensive Science Using Scalable Analytics Systems
  • Nothaft, Frank Austin; Linderman, Michael; Franklin, Michael J.
  • Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data - SIGMOD '15 https://doi.org/10.1145/2723372.2742787
conference January 2015
sPCA conference May 2015
Sensitivity of PCA for traffic anomaly detection journal June 2007
Autotuning GEMM Kernels for the Fermi GPU journal November 2012
Spark-Based Large-Scale Matrix Inversion for Big Data Processing journal January 2016
Singular Value Decomposition and Principal Component Analysis book January 2005
Scalable matrix inversion using MapReduce conference June 2014
A fast GEMM implementation on the cypress GPU journal March 2011
A Multi-Platform Evaluation of the Randomized CX Low-Rank Matrix Factorization in Spark conference May 2016
An Improved Magma Gemm For Fermi Graphics Processing Units journal September 2010
Exploiting accelerators for efficient high dimensional similarity search
  • Agrawal, Sandeep R.; Dee, Christopher M.; Lebeck, Alvin R.
  • Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming - PPoPP '16 https://doi.org/10.1145/2851141.2851144
conference January 2016
Level 3 basic linear algebra subprograms for sparse matrices journal September 1997
PEGASUS: A Peta-Scale Graph Mining System Implementation and Observations conference December 2009
FatMan vs. LittleBoy: Scaling Up Linear Algebraic Operations in Scale-Out Data Platforms conference November 2016
Caffe: Convolutional Architecture for Fast Feature Embedding conference January 2014
ScaLAPACK conference November 1996
HAMA: An Efficient Matrix Computation with the MapReduce Framework conference November 2010

Similar Records

Sample changers for direct geometry neutron chopper spectrometers
Journal Article · Fri Aug 29 00:00:00 EDT 2025 · Scientific Reports · OSTI ID:3002892

Data Analysis Approach for Large Data Volumes in a Connected Community
Conference · Sun Jan 31 23:00:00 EST 2021 · OSTI ID:1783001

A study of stress-induced phase transformation and micromechanical behavior of CuZr-based alloy by in-situ neutron diffraction
Journal Article · Tue Feb 28 23:00:00 EST 2017 · Journal of Alloys and Compounds · OSTI ID:1335306

Related Subjects