A probabilistic approach to information retrieval in heterogeneous databases
During the post decade, organizations have increased their scope and operations beyond their traditional geographic boundaries. At the same time, they have adopted heterogeneous and incompatible information systems independent of each other without a careful consideration that one day they may need to be integrated. As a result of this diversity, many important business applications today require access to data stored in multiple autonomous databases. This paper examines a problem of inter-database information retrieval in a heterogeneous environment, where conventional techniques are no longer efficient. To solve the problem, broader definitions for join, union, intersection and selection operators are proposed. Also, a probabilistic method to specify the selectivity of these operators is discussed. An algorithm to compute these probabilities is provided in pseudocode.
- Research Organization:
- Lawrence Berkeley Lab., CA (United States)
- Sponsoring Organization:
- USDOE; USDOE, Washington, DC (United States)
- DOE Contract Number:
- AC03-76SF00098
- OSTI ID:
- 7275289
- Report Number(s):
- LBL-31117; CONF-9112126-1; ON: DE92016993
- Resource Relation:
- Conference: Workshop on information technologies and systems, Cambridge, MA (United States), 14-15 Dec 1991
- Country of Publication:
- United States
- Language:
- English
Similar Records
Data manipulation in heterogeneous databases
Data manipulation in heterogeneous databases