Storing files in a parallel computing system using list-based index to identify replica files
Abstract
Improved techniques are provided for storing files in a parallel computing system using a list-based index to identify file replicas. A file and at least one replica of the file are stored in one or more storage nodes of the parallel computing system. An index for the file comprises at least one list comprising a pointer to a storage location of the file and a storage location of the at least one replica of the file. The file comprises one or more of a complete file and one or more sub-files. The index may also comprise a checksum value for one or more of the file and the replica(s) of the file. The checksum value can be evaluated to validate the file and/or the file replica(s). A query can be processed using the list.
- Inventors:
- Issue Date:
- Research Org.:
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1195932
- Patent Number(s):
- 9087075
- Application Number:
- 13/536,331
- Assignee:
- EMC Corporation (Hopkinton, MA)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- AC52-06NA25396
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2012 Jun 28
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 97 MATHEMATICS AND COMPUTING
Citation Formats
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, and Grider, Gary. Storing files in a parallel computing system using list-based index to identify replica files. United States: N. p., 2015.
Web.
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, & Grider, Gary. Storing files in a parallel computing system using list-based index to identify replica files. United States.
Faibish, Sorin, Bent, John M., Tzelnic, Percy, Zhang, Zhenhua, and Grider, Gary. Tue .
"Storing files in a parallel computing system using list-based index to identify replica files". United States. https://www.osti.gov/servlets/purl/1195932.
@article{osti_1195932,
title = {Storing files in a parallel computing system using list-based index to identify replica files},
author = {Faibish, Sorin and Bent, John M. and Tzelnic, Percy and Zhang, Zhenhua and Grider, Gary},
abstractNote = {Improved techniques are provided for storing files in a parallel computing system using a list-based index to identify file replicas. A file and at least one replica of the file are stored in one or more storage nodes of the parallel computing system. An index for the file comprises at least one list comprising a pointer to a storage location of the file and a storage location of the at least one replica of the file. The file comprises one or more of a complete file and one or more sub-files. The index may also comprise a checksum value for one or more of the file and the replica(s) of the file. The checksum value can be evaluated to validate the file and/or the file replica(s). A query can be processed using the list.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2015},
month = {7}
}
Works referenced in this record:
A Self-Organizing Storage Cluster for Parallel Data-Intensive Applications
conference, January 2004
- Hong Tang, ; Gulbeden, A.
- Proceedings of the ACM/IEEE SC2004 Conference
PLFS: a checkpoint filesystem for parallel applications
conference, January 2009
- Bent, John; Gibson, Garth; Grider, Gary