Declustering databases on heterogeneous disk systems
Conference
·
OSTI ID:129221
- Lawrence Berkeley Lab., CA (United States)
- New York Univ., New york City, NY (United States). Leonard N. Stern School of Business
Declustering is a well known strategy to achieve maximum I/O parallelism in multi-disk systems. Many declustering methods have been proposed for symmetrical disk systems, i.e., multi-disk systems in which all disks have the same speed and capacity. This work deals with the problem of adapting such declustering methods to work in heterogeneous environments. In such environments these are many types of disks and servers with a large range of speeds and capacities. We deal first with the case of perfectly declustered queries, i.e., queries which retrieve a fixed proportion of the answer from each disk. We show that the fraction of the dataset which must be allocated to each disk is affected by both the relative speed and capacity of the disk. Furthermore, the hierarchical structure of most distributed systems, where groups of disks are placed in servers, imposes further complications due to variations . in server and network bandwidths which may affect the actual achievable transfer rates. We propose an algorithm which determines the fraction of the dataset which must be loaded on each disk. The algorithm may be tailored to find disk loading for minimal response time for a given database size, or to compute a system profile showing the optimal loading of the disks for all possible ranges of database sizes. Next we look at the probabilistic aspects of this problem and show how to optimize the expected retrieval time when the Proportions of the data retrieved from each disk axe random variables. We show the rather surprising result that in this case to achieve optimality, the fraction of the data loaded on each disk must not simply be proportional to its speed but rather some compensation must be made with bias towards the faster disks. The methods proposed here are general and can be used in conjunction with most known symmetric declustering methods.
- Research Organization:
- Lawrence Berkeley Lab., CA (United States)
- Sponsoring Organization:
- USDOE, Washington, DC (United States)
- DOE Contract Number:
- AC03-76SF00098
- OSTI ID:
- 129221
- Report Number(s):
- LBL--37215; CONF-950960--1; ON: DE96001312
- Country of Publication:
- United States
- Language:
- English
Similar Records
Distributed data access in the sequential access model at the D0 experiment at Fermilab
Methods of travel-time residual declustering for the knowledge base calibration and integration tool (KBCIT)
Parallel evaluation of the transitive closure of a database relation
Conference
·
Wed Jul 05 00:00:00 EDT 2000
·
OSTI ID:757586
Methods of travel-time residual declustering for the knowledge base calibration and integration tool (KBCIT)
Technical Report
·
Sun Feb 04 23:00:00 EST 2001
·
OSTI ID:15006177
Parallel evaluation of the transitive closure of a database relation
Journal Article
·
Sun Jan 31 23:00:00 EST 1988
· Int. J. Parallel Program.; (United States)
·
OSTI ID:6062706