Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Efficient algorithms for multi-file caching

Conference ·
OSTI ID:824285
Multi-File Caching issues arise in applications where a set of jobs are processed and each job requests one or more input files. A given job can only be started if all its input files are preloaded into a disk cache. Examples of applications where Multi-File caching may be required are scientific data mining, bit-sliced indexes, and analysis of sets of vertically partitioned files. The difference between this type of caching and traditional file caching systems is that in this environment, caching and replacement decisions are made based on ''combinations of files (file bundles),'' rather than single files. In this work we propose new algorithms for Multi-File caching and analyze their performance. Extensive simulations are presented to establish the effectiveness of the Multi-File caching algorithm in terms of job response time and job queue length.
Research Organization:
Ernest Orlando Lawrence Berkeley National Laboratory, Berkeley, CA (US)
Sponsoring Organization:
USDOE Director. Office of Science. Computational and Advanced Scientific Computing Research, Office of Laboratory Policy and Infrastructure Management (US)
DOE Contract Number:
AC03-76SF00098
OSTI ID:
824285
Report Number(s):
LBNL--54880
Country of Publication:
United States
Language:
English

Similar Records

Optimal file-bundle caching algorithms for data-grids
Conference · Sat Apr 24 00:00:00 EDT 2004 · OSTI ID:824286

File caching in data intensive scientific applications
Conference · Sun Jul 18 00:00:00 EDT 2004 · OSTI ID:882745

Impact of admission and cache replacement policies on response times of jobs on data grids
Conference · Mon Apr 21 00:00:00 EDT 2003 · OSTI ID:813393