Method and system to discover and recommend interesting documents
Abstract
Disclosed are several examples of systems that can read millions of news feeds per day about topics (e.g., your customers, competitors, markets, and partners), and provide a small set of the most relevant items to read to keep current with the overwhelming amount of information currently available. Topics of interest can be chosen by the user of the system for use as seeds. The seeds can be vectorized and compared with the target documents to determine their similarity. The similarities can be sorted from highest to lowest so that the most similar seed and target documents are at the top of the list. This output can be produced in XML format so that an RSS Reader can format the XML. This allows for easy Internet access to these recommendations.
- Inventors:
- Issue Date:
- Research Org.:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1341872
- Patent Number(s):
- 9558185
- Application Number:
- 13/737,652
- Assignee:
- UT-Battelle LLC (Oak Ridge, TN)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
- DOE Contract Number:
- AC05-00OR22725
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 2013 Jan 09
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 99 GENERAL AND MISCELLANEOUS
Citation Formats
Potok, Thomas Eugene, Steed, Chad Allen, and Patton, Robert Matthew. Method and system to discover and recommend interesting documents. United States: N. p., 2017.
Web.
Potok, Thomas Eugene, Steed, Chad Allen, & Patton, Robert Matthew. Method and system to discover and recommend interesting documents. United States.
Potok, Thomas Eugene, Steed, Chad Allen, and Patton, Robert Matthew. Tue .
"Method and system to discover and recommend interesting documents". United States. https://www.osti.gov/servlets/purl/1341872.
@article{osti_1341872,
title = {Method and system to discover and recommend interesting documents},
author = {Potok, Thomas Eugene and Steed, Chad Allen and Patton, Robert Matthew},
abstractNote = {Disclosed are several examples of systems that can read millions of news feeds per day about topics (e.g., your customers, competitors, markets, and partners), and provide a small set of the most relevant items to read to keep current with the overwhelming amount of information currently available. Topics of interest can be chosen by the user of the system for use as seeds. The seeds can be vectorized and compared with the target documents to determine their similarity. The similarities can be sorted from highest to lowest so that the most similar seed and target documents are at the top of the list. This output can be produced in XML format so that an RSS Reader can format the XML. This allows for easy Internet access to these recommendations.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Tue Jan 31 00:00:00 EST 2017},
month = {Tue Jan 31 00:00:00 EST 2017}
}
Works referenced in this record:
Retrieval system and method
patent, September 1999
- Cohen, Edith; Lewis, David Dolan
- US Patent Document 5,950,189
Methods and apparatus for similarity text search based on conceptual indexing
patent, April 2003
- Aggarwal, Charu C.; Yu, Philip Shi-Lung
- US Patent Document 6,542,889
Method and apparatus for measuring similarity among electronic documents
patent, January 2006
- Palmer, Michael E.; Sun, Gordon; Zha, Hongyuan
- US Patent Document 6,990,628
Ontology-based information management system and method
patent, May 2007
- Gardner, Steve
- US Patent Document 7,225,183
Method for gathering and summarizing internet information
patent, April 2010
- Potok, Thomas E.; Elmore, Mark Thomas; Reed, Joel Wesley
- US Patent Document 7,693,903
Agent-based method for distributed clustering of textual information
patent, September 2010
- Potok, Thomas E.; Reed, Joel Wesley; Elmore, Mark Thomas
- US Patent Document 7,805,446
Dynamic reduction of dimensions of a document vector in a document search and retrieval system
patent, May 2011
- Jiao, Yu; Potok, Thomas E.
- US Patent Document 7,937,389
Systems and methods for identifying similar documents
patent, June 2011
- Curtis, Taylor; Heafield, Kenneth
- US Patent Document 7,958,136
Process for the Document Management and Computer-Assisted Translation of Documents Utilizing Document Corpora Constructed by Intelligent Agents
patent-application, August 2003
- Shreve, Gregory M.
- US Patent Application 10/073516; 20030154071
Method and System for Linking Documents with Multiple Topics to Related Documents
patent-application, June 2007
- Miller, David James
- US Patent Application 11/295531; 20070130100
Document Similarity Scoring and Ranking Method, Device and Computer Program Product
patent-application, August 2007
- Canright, Geoffrey; Engo-Monsen, Kenth
- US Patent Application 11/349235; 20070185871
Identifying Information Related to a Particular Entity from Electronic Sources
patent-application, March 2009
- Gabriel, Raefer Christopher; Fertik, Michael Benjamin; Tripp, Owen Wheble
- US Patent Application 12/209169; 20090070325
Document Processing Device and Document Processing Method
patent-application, May 2009
- Ochi, Shingo; Hino, Takanori
- US Patent Application 12/294135; 20090132566
Data Classification Using Machine Learning Techniques
patent-application, August 2011
- Schmidtler, Mauritius A. R.; Borrey, Roland; Sarah, Anthony
- US Patent Application 13/090216; 20110196870
Similarity Score Lookup and Representation
patent-application, November 2014
- Chen, Lijiang; Hou, Hui-Man; Chen, Shimin
- US Patent Application 14/372712; 20140337337
TF-ICF: A New Term Weighting Scheme for Clustering Dynamic Data Streams
conference, December 2006
- Reed, Joel; Jiao, Yu; Potok, Thomas
- 2006 5th International Conference on Machine Learning and Applications (ICMLA'06)
A geometric view on bilingual lexicon extraction from comparable corpora
conference, January 2004
- Gaussier, E.; Renders, J. -M.; Matveeva, I.
- Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics - ACL '04