Precision-time tradeoffs: a paradigm for processing statistical queries on databases
Conference
·
OSTI ID:6929968
Conventional query processing techniques are aimed at queries which access small amounts of data, and require each data item for the answer. In case the database is used for statistical analysis as well as operational purposes, for some types of queries a large part of the database may be required to compute the answer. This may lead to a data access bottleneck, caused by the excessive number of disk assesses needed to get the data into primary memory. An example is computation of statistical parameters, such as count, average, median, and standard deviation, which are useful for statistical analysis of the database. Yet another example that faces this bottleneck is the verification of the truth of a set of predicates (goals), based on the current database state, for the purposes of intelligent decision making. A solution to this problem is to maintain a set of precomputed information about the database in a view or a snapshot. Statistical queries can be processed using the view rather than the real database. A crucial issue is that the precision of the precomputed information in the view deteriorates with time, because of the dynamic nature of the underlying database. Thus the answer provided is approximate, which is acceptable under many circumstances, especially when the error is bounded. The tradeoff is that the processing of queries is made faster at the expense of the precision in the answer. The concept of precision in the context of database queries is formalized, and a data model to incorporate it is developed. Algorithms are designed to maintain materialized views of data to specified degrees of precision.
- Research Organization:
- California Univ., Berkeley (USA). Computer Science Div.; Lawrence Berkeley Lab., CA (USA)
- DOE Contract Number:
- AC03-76SF00098
- OSTI ID:
- 6929968
- Report Number(s):
- LBL-24767; CONF-8806166-1; ON: DE88012024
- Country of Publication:
- United States
- Language:
- English
Similar Records
A framework for expressing and controlling impressions in databases
Temporal aggregates for time-constrained queries
Query optimization in distributed databases
Technical Report
·
Mon Feb 29 23:00:00 EST 1988
·
OSTI ID:7153537
Temporal aggregates for time-constrained queries
Conference
·
Mon Dec 30 23:00:00 EST 1996
·
OSTI ID:457115
Query optimization in distributed databases
Technical Report
·
Fri Oct 01 00:00:00 EDT 1982
·
OSTI ID:6744813