DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Storing files in a parallel computing system using list-based index to identify replica files

Abstract

Improved techniques are provided for storing files in a parallel computing system using a list-based index to identify file replicas. A file and at least one replica of the file are stored in one or more storage nodes of the parallel computing system. An index for the file comprises at least one list comprising a pointer to a storage location of the file and a storage location of the at least one replica of the file. The file comprises one or more of a complete file and one or more sub-files. The index may also comprise a checksum value for one or more of the file and the replica(s) of the file. The checksum value can be evaluated to validate the file and/or the file replica(s). A query can be processed using the list.

Inventors:
; ; ; ;
Issue Date:
Research Org.:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1195932
Patent Number(s):
9087075
Application Number:
13/536,331
Assignee:
EMC Corporation (Hopkinton, MA)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
AC52-06NA25396
Resource Type:
Patent
Resource Relation:
Patent File Date: 2012 Jun 28
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, and Grider, Gary. Storing files in a parallel computing system using list-based index to identify replica files. United States: N. p., 2015. Web.
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, & Grider, Gary. Storing files in a parallel computing system using list-based index to identify replica files. United States.
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, and Grider, Gary. Tue . "Storing files in a parallel computing system using list-based index to identify replica files". United States. https://www.osti.gov/servlets/purl/1195932.
@article{osti_1195932,
title = {Storing files in a parallel computing system using list-based index to identify replica files},
author = {Faibish, Sorin and Bent, John M. and Tzelnic, Percy and Zhang, Zhenhua and Grider, Gary},
abstractNote = {Improved techniques are provided for storing files in a parallel computing system using a list-based index to identify file replicas. A file and at least one replica of the file are stored in one or more storage nodes of the parallel computing system. An index for the file comprises at least one list comprising a pointer to a storage location of the file and a storage location of the at least one replica of the file. The file comprises one or more of a complete file and one or more sub-files. The index may also comprise a checksum value for one or more of the file and the replica(s) of the file. The checksum value can be evaluated to validate the file and/or the file replica(s). A query can be processed using the list.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2015},
month = {7}
}

Works referenced in this record:

A Self-Organizing Storage Cluster for Parallel Data-Intensive Applications
conference, January 2004


PLFS: a checkpoint filesystem for parallel applications
conference, January 2009


Storage management system with file aggregation
patent, August 2000


Leasing scheme for data-modifying operations
patent, June 2006


Filesystem-aware block storage system, apparatus, and method
patent, January 2011


Method and system of providing replica files within a fileset
patent, August 2011


Systems and methods for managing portions of files in multi-tier storage systems
patent, January 2013


Distributed file system and method of operating a distributed file system
patent-application, May 2006


Maintaining active-only copy storage pools
patent-application, February 2007


    Works referencing / citing this record: