Method and system to discover and recommend interesting documents
Disclosed are several examples of systems that can read millions of news feeds per day about topics (e.g., your customers, competitors, markets, and partners), and provide a small set of the most relevant items to read to keep current with the overwhelming amount of information currently available. Topics of interest can be chosen by the user of the system for use as seeds. The seeds can be vectorized and compared with the target documents to determine their similarity. The similarities can be sorted from highest to lowest so that the most similar seed and target documents are at the top of the list. This output can be produced in XML format so that an RSS Reader can format the XML. This allows for easy Internet access to these recommendations.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-00OR22725
- Assignee:
- UT-Battelle LLC (Oak Ridge, TN)
- Patent Number(s):
- 9,558,185
- Application Number:
- 13/737,652
- OSTI ID:
- 1341872
- Country of Publication:
- United States
- Language:
- English
TF-ICF: A New Term Weighting Scheme for Clustering Dynamic Data Streams
|
conference | December 2006 |
A geometric view on bilingual lexicon extraction from comparable corpora
|
conference | January 2004 |
Similar Records
Recommended documentation for computer users at ANL
Distribute The Highest Selected Textual Recommendation (DTHSTR)