Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

An efficient approach to discovering knowledge from large databases

Conference ·
OSTI ID:535538
;  [1]
  1. National Tsing Hua Univ., Taiwan (China)
In this paper, we study two problems: mining association rules and mining sequential patterns in a large database of customer transactions. The problem of mining association rules focuses on discovering large itemsets where a large itemset is a group of items which appear together in a sufficient number of transactions; while the problem of mining sequential patterns focuses on discovering large sequences where a large sequence is an ordered list of sets of items which appear in a sufficient number of transactions. We present efficient graph-based algorithms to solve these problems. The algorithms construct an association graph to indicate the associations between items and then traverse the graph to generate large itemsets and large sequences, respectively. Our algorithms need to scan the database only once. Empirical evaluations show that our algorithms outperform other algorithms which need to make multiple passes over the database.
OSTI ID:
535538
Report Number(s):
CONF-961209--; CNN: Contract NSC 86-2213-E-007-009
Country of Publication:
United States
Language:
English

Similar Records

A fast distributed algorithm for mining association rules
Conference · Mon Dec 30 23:00:00 EST 1996 · OSTI ID:535540

DisClose: Discovering Colossal Closed Itemsets via a Memory Efficient Compact Row-Tree
Conference · Thu Jan 31 23:00:00 EST 2013 · OSTI ID:1076707

Hash based parallel algorithms for mining association rules
Conference · Mon Dec 30 23:00:00 EST 1996 · OSTI ID:535539