skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A Scalable Graph Analytics Framework for Programming with Big Data in R (pbdR)

Abstract

Many disciplines such as biology, economics, engineering, physics, and the social sciences represent their data as graphs to capture patterns, trends, and associations. There are are many commercially available graph libraries in different programming languages to analyze these complex graphs. But there is no distributed graph library package in R - the popular statistical programming language to analyze graphs that bigger than a single machine's memory. Many domain experts prefer R over the numerous other alternatives. Towards this, we present a distributed graph analytics framework for R called programming with big graph using R (pBGR.) Our proposed framework leverages the Programming with Big Data in R (pbdR) ecosystem that provides scalable R packages for distributed computing in data science. We present an early prototype implementation of this framework using the distributed-memory parallel graph library CombBLAS and evaluate the framework's performance on leadership class computing platforms. Our experimental results demonstrate that the proposed framework is capable of performing large-scale parallel graph mining through the easyto-use R language. This enhanced graph processing capability coupled with other statistical tools already available in R, should be valuable to many domain experts.

Authors:
 [1];  [1]; ORCiD logo [1]; ORCiD logo [1]
  1. ORNL
Publication Date:
Research Org.:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1649576
DOE Contract Number:  
AC05-00OR22725
Resource Type:
Conference
Resource Relation:
Conference: IEEE Big Data 2019 - Sixth International Workshop on High Performance Big Graph Data Management, Analysis, and Mining (BigGraphs 2019) - Los Angeles, California, United States of America - 12/9/2019 6:00:00 PM-12/12/2019 6:00:00 PM
Country of Publication:
United States
Language:
English

Citation Formats

Hasan, S M Shamimul, Schmidt, Drew, Kannan, Ramakrishnan, and Imam, Neena. A Scalable Graph Analytics Framework for Programming with Big Data in R (pbdR). United States: N. p., 2019. Web.
Hasan, S M Shamimul, Schmidt, Drew, Kannan, Ramakrishnan, & Imam, Neena. A Scalable Graph Analytics Framework for Programming with Big Data in R (pbdR). United States.
Hasan, S M Shamimul, Schmidt, Drew, Kannan, Ramakrishnan, and Imam, Neena. Sun . "A Scalable Graph Analytics Framework for Programming with Big Data in R (pbdR)". United States. https://www.osti.gov/servlets/purl/1649576.
@article{osti_1649576,
title = {A Scalable Graph Analytics Framework for Programming with Big Data in R (pbdR)},
author = {Hasan, S M Shamimul and Schmidt, Drew and Kannan, Ramakrishnan and Imam, Neena},
abstractNote = {Many disciplines such as biology, economics, engineering, physics, and the social sciences represent their data as graphs to capture patterns, trends, and associations. There are are many commercially available graph libraries in different programming languages to analyze these complex graphs. But there is no distributed graph library package in R - the popular statistical programming language to analyze graphs that bigger than a single machine's memory. Many domain experts prefer R over the numerous other alternatives. Towards this, we present a distributed graph analytics framework for R called programming with big graph using R (pBGR.) Our proposed framework leverages the Programming with Big Data in R (pbdR) ecosystem that provides scalable R packages for distributed computing in data science. We present an early prototype implementation of this framework using the distributed-memory parallel graph library CombBLAS and evaluate the framework's performance on leadership class computing platforms. Our experimental results demonstrate that the proposed framework is capable of performing large-scale parallel graph mining through the easyto-use R language. This enhanced graph processing capability coupled with other statistical tools already available in R, should be valuable to many domain experts.},
doi = {},
url = {https://www.osti.gov/biblio/1649576}, journal = {},
number = ,
volume = ,
place = {United States},
year = {2019},
month = {12}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: