skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: PapyrusKV: a high-performance parallel key-value store for distributed NVM architectures

Abstract

This paper introduces PapyrusKV, a parallel embedded key-value store (KVS) for distributed high-performance computing (HPC) architectures that offer potentially massive pools of nonvolatile memory (NVM). PapyrusKV stores keys with their values in arbitrary byte arrays across multiple NVMs in a distributed system. PapyrusKV provides standard KVS operations such as put, get, and delete. More importantly, PapyrusKV provides advanced features for HPC such as dynamic consistency control, zero-copy workflow, and asynchronous checkpoint/restart. Beyond filesystems, PapyrusKV provides HPC programmers with a high-level interface to exploit distributed NVM in the system, and it transparently organizes data to achieve high performance. Also, it allows HPC applications to specialize PapyrusKV to meet their specific requirements. We empirically evaluate PapyrusKV on three HPC systems with real NVM devices: OLCF's Summitdev, TACC's Stampede, and NERSC's Cori. Our results show that PapyrusKV can offer high performance, scalability, and portability across these various distributed NVM architectures.

Authors:
 [1];  [1];  [1]
  1. Oak Ridge National Laboratory
Publication Date:
Research Org.:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Org.:
USDOE Office of Science (SC)
OSTI Identifier:
1567466
DOE Contract Number:  
AC02-05CH11231; AC05-00OR22725
Resource Type:
Conference
Resource Relation:
Conference: SC '17 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING; Computer Science

Citation Formats

Kim, Jungwon, Lee, Seyong, and Vetter, Jeffrey S. PapyrusKV: a high-performance parallel key-value store for distributed NVM architectures. United States: N. p., 2017. Web. doi:10.1145/3126908.3126943.
Kim, Jungwon, Lee, Seyong, & Vetter, Jeffrey S. PapyrusKV: a high-performance parallel key-value store for distributed NVM architectures. United States. doi:10.1145/3126908.3126943.
Kim, Jungwon, Lee, Seyong, and Vetter, Jeffrey S. Sun . "PapyrusKV: a high-performance parallel key-value store for distributed NVM architectures". United States. doi:10.1145/3126908.3126943.
@article{osti_1567466,
title = {PapyrusKV: a high-performance parallel key-value store for distributed NVM architectures},
author = {Kim, Jungwon and Lee, Seyong and Vetter, Jeffrey S.},
abstractNote = {This paper introduces PapyrusKV, a parallel embedded key-value store (KVS) for distributed high-performance computing (HPC) architectures that offer potentially massive pools of nonvolatile memory (NVM). PapyrusKV stores keys with their values in arbitrary byte arrays across multiple NVMs in a distributed system. PapyrusKV provides standard KVS operations such as put, get, and delete. More importantly, PapyrusKV provides advanced features for HPC such as dynamic consistency control, zero-copy workflow, and asynchronous checkpoint/restart. Beyond filesystems, PapyrusKV provides HPC programmers with a high-level interface to exploit distributed NVM in the system, and it transparently organizes data to achieve high performance. Also, it allows HPC applications to specialize PapyrusKV to meet their specific requirements. We empirically evaluate PapyrusKV on three HPC systems with real NVM devices: OLCF's Summitdev, TACC's Stampede, and NERSC's Cori. Our results show that PapyrusKV can offer high performance, scalability, and portability across these various distributed NVM architectures.},
doi = {10.1145/3126908.3126943},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2017},
month = {11}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share:

Works referenced in this record:

Space/time trade-offs in hash coding with allowable errors
journal, July 1970


Distributed snapshots: determining global states of distributed systems
journal, February 1985

  • Chandy, K. Mani; Lamport, Leslie
  • ACM Transactions on Computer Systems, Vol. 3, Issue 1
  • DOI: 10.1145/214451.214456

Cassandra: a decentralized structured storage system
journal, April 2010

  • Lakshman, Avinash; Malik, Prashant
  • ACM SIGOPS Operating Systems Review, Vol. 44, Issue 2
  • DOI: 10.1145/1773912.1773922

A Survey of Software Techniques for Using Non-Volatile Memories for Storage and Main Memory Systems
journal, May 2016

  • Mittal, Sparsh; Vetter, Jeffrey S.
  • IEEE Transactions on Parallel and Distributed Systems, Vol. 27, Issue 5
  • DOI: 10.1109/TPDS.2015.2442980

The log-structured merge-tree (LSM-tree)
journal, June 1996

  • O’Neil, Patrick; Cheng, Edward; Gawlick, Dieter
  • Acta Informatica, Vol. 33, Issue 4
  • DOI: 10.1007/s002360050048

Opportunities for Nonvolatile Memory Systems in Extreme-Scale High-Performance Computing
journal, March 2015

  • Vetter, Jeffrey S.; Mittal, Sparsh
  • Computing in Science & Engineering, Vol. 17, Issue 2
  • DOI: 10.1109/MCSE.2015.4

In-Memory Big Data Management and Processing: A Survey
journal, July 2015

  • Zhang, Hao; Chen, Gang; Ooi, Beng Chin
  • IEEE Transactions on Knowledge and Data Engineering, Vol. 27, Issue 7
  • DOI: 10.1109/TKDE.2015.2427795