skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Method and system to discover and recommend interesting documents

Patent ·
OSTI ID:1341872

Disclosed are several examples of systems that can read millions of news feeds per day about topics (e.g., your customers, competitors, markets, and partners), and provide a small set of the most relevant items to read to keep current with the overwhelming amount of information currently available. Topics of interest can be chosen by the user of the system for use as seeds. The seeds can be vectorized and compared with the target documents to determine their similarity. The similarities can be sorted from highest to lowest so that the most similar seed and target documents are at the top of the list. This output can be produced in XML format so that an RSS Reader can format the XML. This allows for easy Internet access to these recommendations.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
Assignee:
UT-Battelle LLC (Oak Ridge, TN)
Patent Number(s):
9,558,185
Application Number:
13/737,652
OSTI ID:
1341872
Resource Relation:
Patent File Date: 2013 Jan 09
Country of Publication:
United States
Language:
English

References (18)

Retrieval system and method patent September 1999
Methods and apparatus for similarity text search based on conceptual indexing patent April 2003
Method and apparatus for measuring similarity among electronic documents patent January 2006
Ontology-based information management system and method patent May 2007
Method for gathering and summarizing internet information patent April 2010
Agent-based method for distributed clustering of textual information patent September 2010
Latent semantic clustering patent November 2010
Dynamic reduction of dimensions of a document vector in a document search and retrieval system patent May 2011
Systems and methods for identifying similar documents patent June 2011
Process for the Document Management and Computer-Assisted Translation of Documents Utilizing Document Corpora Constructed by Intelligent Agents patent-application August 2003
Method and System for Linking Documents with Multiple Topics to Related Documents patent-application June 2007
Document Similarity Scoring and Ranking Method, Device and Computer Program Product patent-application August 2007
Identifying Information Related to a Particular Entity from Electronic Sources patent-application March 2009
Document Processing Device and Document Processing Method patent-application May 2009
Data Classification Using Machine Learning Techniques patent-application August 2011
Similarity Score Lookup and Representation patent-application November 2014
TF-ICF: A New Term Weighting Scheme for Clustering Dynamic Data Streams conference December 2006
A geometric view on bilingual lexicon extraction from comparable corpora conference January 2004

Similar Records

Related Subjects