Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Improving Naive Bayes with Online Feature Selection for Quick Adaptation to Evolving Feature Usefulness

Conference ·
OSTI ID:929189
The definition of what makes an article interesting varies from user to user and continually evolves even for a single user. As a result, for news recommendation systems, useless document features can not be determined a priori and all features are usually considered for interestingness classification. Consequently, the presence of currently useless features degrades classification performance [1], particularly over the initial set of news articles being classified. The initial set of document is critical for a user when considering which particular news recommendation system to adopt. To address these problems, we introduce an improved version of the naive Bayes classifier with online feature selection. We use correlation to determine the utility of each feature and take advantage of the conditional independence assumption used by naive Bayes for online feature selection and classification. The augmented naive Bayes classifier performs 28% better than the traditional naive Bayes classifier in recommending news articles from the Yahoo! RSS feeds.
Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA
Sponsoring Organization:
USDOE
DOE Contract Number:
W-7405-ENG-48
OSTI ID:
929189
Report Number(s):
UCRL-CONF-235295
Country of Publication:
United States
Language:
English

Similar Records

Tracking Multiple Topics for Finding Interesting Articles
Conference · Wed Feb 14 23:00:00 EST 2007 · OSTI ID:913552

Measuring the Interestingness of Articles in a Limited User Environment
Thesis/Dissertation · Mon Dec 31 23:00:00 EST 2007 · OSTI ID:945553

iScore: news filtering software
Software · Thu Dec 06 00:00:00 EST 2007 · OSTI ID:1304622