Impact of admission and cache replacement policies on response times of jobs on data grids
Caching techniques have been used widely to improve the performance gaps of storage hierarchies in computing systems. Little is known about the impact of policies on the response times of jobs that access and process very large files in data grids particularly when data and computations on the data have to be co-located on the same host. In data intensive applications that access large data files over wide area network environment, such as data-grids, the combination of policies for job servicing (or scheduling), caching and cache replacement can significantly impact the performance of grid jobs. We present some preliminary results of a simulation study that combines an admission policy with a cache replacement policy when servicing jobs submitted to a storage resource manager. The results show that, in comparison to a first come first serve policy, the response times of jobs are significantly improved, for practical limits of disk cache sizes, when the jobs that are back-logged to access the same files are taken into consideration in scheduling the next file to be retrieved into the disk cache. Not only are the response times of jobs improved, but also the metric measures for caching policies, such as the hit ratio and the average cost per retrieval, are improved irrespective of the cache replacement policy.
- Research Organization:
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- U.S. DOE. Director, Office of Science. Computational and Technology Research (US)
- DOE Contract Number:
- AC03-76SF00098
- OSTI ID:
- 813393
- Report Number(s):
- LBNL-53022; R&D Project: KS3310; TRN: US200316%%146
- Resource Relation:
- Conference: Challenges of Large Applications in Distributed Environments (CLADE'2003), Seattle, WA (US), 06/21/2003; Other Information: PBD: 21 Apr 2003
- Country of Publication:
- United States
- Language:
- English
Similar Records
Optimal file-bundle caching algorithms for data-grids
The D0 Experiment Data Grid: SAM