DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Data shuffling with hierarchical tuple spaces

Abstract

Methods and systems for shuffling data are described. A processor may generate pair data from source data. The processor may insert the pair data into local tuple spaces. In response to a request for a particular key, the processor may determine a presence of the requested key in a global tuple space. The processor may, in response to a presence of the requested key in the global tuple space, update the global tuple space. The update may be based on the pair data among the local tuple spaces including the existing key. The processor may, in response to an absence of the requested key in the global tuple space, insert pair data including the missing key from the local tuple spaces into the global tuple space. The processor may fetch the requested pair data, and may shuffle the fetched data to generate a dataset.

Inventors:
; ; ;
Issue Date:
Research Org.:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1805660
Patent Number(s):
10956125
Application Number:
15/851,480
Assignee:
International Business Machines Corporation (Armonk, NY)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
AC02-05CH11231
Resource Type:
Patent
Resource Relation:
Patent File Date: 12/21/2017
Country of Publication:
United States
Language:
English

Citation Formats

Andrade Costa, Carlos Henrique, Kayi, Abdullah, Park, Yoonho, and Johns, Charles Ray. Data shuffling with hierarchical tuple spaces. United States: N. p., 2021. Web.
Andrade Costa, Carlos Henrique, Kayi, Abdullah, Park, Yoonho, & Johns, Charles Ray. Data shuffling with hierarchical tuple spaces. United States.
Andrade Costa, Carlos Henrique, Kayi, Abdullah, Park, Yoonho, and Johns, Charles Ray. Tue . "Data shuffling with hierarchical tuple spaces". United States. https://www.osti.gov/servlets/purl/1805660.
@article{osti_1805660,
title = {Data shuffling with hierarchical tuple spaces},
author = {Andrade Costa, Carlos Henrique and Kayi, Abdullah and Park, Yoonho and Johns, Charles Ray},
abstractNote = {Methods and systems for shuffling data are described. A processor may generate pair data from source data. The processor may insert the pair data into local tuple spaces. In response to a request for a particular key, the processor may determine a presence of the requested key in a global tuple space. The processor may, in response to a presence of the requested key in the global tuple space, update the global tuple space. The update may be based on the pair data among the local tuple spaces including the existing key. The processor may, in response to an absence of the requested key in the global tuple space, insert pair data including the missing key from the local tuple spaces into the global tuple space. The processor may fetch the requested pair data, and may shuffle the fetched data to generate a dataset.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2021},
month = {3}
}

Works referenced in this record:

Timeline Index for Partitioned Temporal Database Tables
patent-application, February 2016


Methods and Apparatus of Analyzing Electrical Power Grid Data
patent-application, December 2013


Method and apparatus for shuffling data
patent, December 2018


Organizing, Joining, and Performing Statistical Calculations on Massive Sets of Data
patent-application, September 2015


Method and system for facilitating data retrieval from a plurality of data sources
patent-application, November 2006


Data Shuffling with Hierarchical Tuple Spaces
patent-application, June 2019


Transparent Efficiency for in-Memory Execution of Map Reduce Job Sequences
patent-application, February 2014


Data-Parallel Computation Management
patent-application, March 2014


Network Server Systems, Architectures, Components and Related Methods
patent-application, April 2019


Software architecture for control of highly parallel computer systems
patent, November 1997


Workload Balancing to Handle Skews for Big Data Analytics
patent-application, November 2015


Method and Apparatus for Shuffling Data
patent-application, February 2011


Deterministic Progressive Big Data Analytics
patent-application, December 2014