DOE Patents title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Storing files in a parallel computing system using list-based index to identify replica files

Abstract

Improved techniques are provided for storing files in a parallel computing system using a list-based index to identify file replicas. A file and at least one replica of the file are stored in one or more storage nodes of the parallel computing system. An index for the file comprises at least one list comprising a pointer to a storage location of the file and a storage location of the at least one replica of the file. The file comprises one or more of a complete file and one or more sub-files. The index may also comprise a checksum value for one or more of the file and the replica(s) of the file. The checksum value can be evaluated to validate the file and/or the file replica(s). A query can be processed using the list.

Inventors:
; ; ; ;
Issue Date:
Research Org.:
Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1195932
Patent Number(s):
9087075
Application Number:
13/536,331
Assignee:
EMC Corporation (Hopkinton, MA)
Patent Classifications (CPCs):
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
DOE Contract Number:  
AC52-06NA25396
Resource Type:
Patent
Resource Relation:
Patent File Date: 2012 Jun 28
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING

Citation Formats

Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, and Grider, Gary. Storing files in a parallel computing system using list-based index to identify replica files. United States: N. p., 2015. Web.
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, & Grider, Gary. Storing files in a parallel computing system using list-based index to identify replica files. United States.
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, and Grider, Gary. Tue . "Storing files in a parallel computing system using list-based index to identify replica files". United States. https://www.osti.gov/servlets/purl/1195932.
@article{osti_1195932,
title = {Storing files in a parallel computing system using list-based index to identify replica files},
author = {Faibish, Sorin and Bent, John M. and Tzelnic, Percy and Zhang, Zhenhua and Grider, Gary},
abstractNote = {Improved techniques are provided for storing files in a parallel computing system using a list-based index to identify file replicas. A file and at least one replica of the file are stored in one or more storage nodes of the parallel computing system. An index for the file comprises at least one list comprising a pointer to a storage location of the file and a storage location of the at least one replica of the file. The file comprises one or more of a complete file and one or more sub-files. The index may also comprise a checksum value for one or more of the file and the replica(s) of the file. The checksum value can be evaluated to validate the file and/or the file replica(s). A query can be processed using the list.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2015},
month = {7}
}

Works referenced in this record:

A Self-Organizing Storage Cluster for Parallel Data-Intensive Applications
conference, January 2004


PLFS: a checkpoint filesystem for parallel applications
conference, January 2009