Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Precision-time tradeoffs: a paradigm for processing statistical queries on databases

Conference ·
OSTI ID:6929968
Conventional query processing techniques are aimed at queries which access small amounts of data, and require each data item for the answer. In case the database is used for statistical analysis as well as operational purposes, for some types of queries a large part of the database may be required to compute the answer. This may lead to a data access bottleneck, caused by the excessive number of disk assesses needed to get the data into primary memory. An example is computation of statistical parameters, such as count, average, median, and standard deviation, which are useful for statistical analysis of the database. Yet another example that faces this bottleneck is the verification of the truth of a set of predicates (goals), based on the current database state, for the purposes of intelligent decision making. A solution to this problem is to maintain a set of precomputed information about the database in a view or a snapshot. Statistical queries can be processed using the view rather than the real database. A crucial issue is that the precision of the precomputed information in the view deteriorates with time, because of the dynamic nature of the underlying database. Thus the answer provided is approximate, which is acceptable under many circumstances, especially when the error is bounded. The tradeoff is that the processing of queries is made faster at the expense of the precision in the answer. The concept of precision in the context of database queries is formalized, and a data model to incorporate it is developed. Algorithms are designed to maintain materialized views of data to specified degrees of precision.
Research Organization:
California Univ., Berkeley (USA). Computer Science Div.; Lawrence Berkeley Lab., CA (USA)
DOE Contract Number:
AC03-76SF00098
OSTI ID:
6929968
Report Number(s):
LBL-24767; CONF-8806166-1; ON: DE88012024
Country of Publication:
United States
Language:
English

Similar Records

A framework for expressing and controlling impressions in databases
Technical Report · Mon Feb 29 23:00:00 EST 1988 · OSTI ID:7153537

Temporal aggregates for time-constrained queries
Conference · Mon Dec 30 23:00:00 EST 1996 · OSTI ID:457115

Query optimization in distributed databases
Technical Report · Fri Oct 01 00:00:00 EDT 1982 · OSTI ID:6744813