Method and system to discover and recommend interesting documents

Potok, Thomas Eugene; Steed, Chad Allen; Patton, Robert Matthew

Advanced Search OptionsAdvanced Search queries use a traditional Term Search. For more info, see our FAQ.

All Fields:

Patent Title:

Abstract:

Assignee:

Inventor(s):

Patent Number:

Patent Classification (CPC):

All Classifications
A - human necessities
A01 - agriculture
A21 - baking
A22 - butchering
A23 - foods or foodstuffs
A24 - tobacco
A41 - wearing apparel
A42 - headwear
A43 - footwear
A44 - haberdashery
A45 - hand or travelling articles
A46 - brushware
A47 - furniture
A61 - medical or veterinary science
A62 - life-saving
A63 - sports
A99 - subject matter not otherwise provided for in this section
B - performing operations
B01 - physical or chemical processes or apparatus in general
B02 - crushing, pulverising, or disintegrating
B03 - separation of solid materials using liquids or using pneumatic tables or jigs
B04 - centrifugal apparatus or machines for carrying-out physical or chemical processes
B05 - spraying or atomising in general
B06 - generating or transmitting mechanical vibrations in general
B07 - separating solids from solids
B08 - cleaning
B09 - disposal of solid waste
B21 - mechanical metal-working without essentially removing material
B22 - casting
B23 - machine tools
B24 - grinding
B25 - hand tools
B26 - hand cutting tools
B27 - working or preserving wood or similar material
B28 - working cement, clay, or stone
B29 - working of plastics
B30 - presses
B31 - making articles of paper, cardboard or material worked in a manner analogous to paper
B32 - layered products
B33 - additive manufacturing technology
B41 - printing
B42 - bookbinding
B43 - writing or drawing implements
B44 - decorative arts
B60 - vehicles in general
B61 - railways
B62 - land vehicles for travelling otherwise than on rails
B63 - ships or other waterborne vessels
B64 - aircraft
B65 - conveying
B66 - hoisting
B67 - opening, closing {or cleaning} bottles, jars or similar containers
B68 - saddlery
B81 - microstructural technology
B82 - nanotechnology
B99 - subject matter not otherwise provided for in this section
C - chemistry
C01 - inorganic chemistry
C02 - treatment of water, waste water, sewage, or sludge
C03 - glass
C04 - cements
C05 - fertilisers
C06 - explosives
C07 - organic chemistry
C08 - organic macromolecular compounds
C09 - dyes
C10 - petroleum, gas or coke industries
C11 - animal or vegetable oils, fats, fatty substances or waxes
C12 - biochemistry
C13 - sugar industry
C14 - skins
C21 - metallurgy of iron
C22 - metallurgy
C23 - coating metallic material
C25 - electrolytic or electrophoretic processes
C30 - crystal growth
C40 - combinatorial technology
C99 - subject matter not otherwise provided for in this section
D - textiles
D01 - natural or man-made threads or fibres
D02 - yarns
D03 - weaving
D04 - braiding
D05 - sewing
D06 - treatment of textiles or the like
D07 - ropes
D10 - indexing scheme associated with sublasses of section d, relating to textiles
D21 - paper-making
D99 - subject matter not otherwise provided for in this section
E - fixed constructions
E01 - construction of roads, railways, or bridges
E02 - hydraulic engineering
E03 - water supply
E04 - building
E05 - locks
E06 - doors, windows, shutters, or roller blinds in general
E21 - earth drilling
E99 - subject matter not otherwise provided for in this section
F - mechanical engineering
F01 - machines or engines in general
F02 - combustion engines
F03 - machines or engines for liquids
F04 - positive - displacement machines for liquids
F05 - indexing schemes relating to engines or pumps in various subclasses of classes f01-f04
F15 - fluid-pressure actuators
F16 - engineering elements and units
F17 - storing or distributing gases or liquids
F21 - lighting
F22 - steam generation
F23 - combustion apparatus
F24 - heating
F25 - refrigeration or cooling
F26 - drying
F27 - furnaces
F28 - heat exchange in general
F41 - weapons
F42 - ammunition
F99 - subject matter not otherwise provided for in this section
G - physics
G01 - measuring
G02 - optics
G03 - photography
G04 - horology
G05 - controlling
G06 - computing
G07 - checking-devices
G08 - signalling
G09 - education
G10 - musical instruments
G11 - information storage
G12 - instrument details
G16 - information and communication technology [ict] specially adapted for specific application fields
G21 - nuclear physics
G99 - subject matter not otherwise provided for in this section
H - electricity
H01 - basic electric elements
H02 - generation
H03 - basic electronic circuitry
H04 - electric communication technique
H05 - electric techniques not otherwise provided for
H99 - subject matter not otherwise provided for in this section
Y - new / cross sectional technologies
Y02 - technologies or applications for mitigation or adaptation against climate change
Y04 - information or communication technologies having an impact on other technology areas
Y10 - technical subjects covered by former uspc

More Options ...

Title: Method and system to discover and recommend interesting documents

Abstract

Disclosed are several examples of systems that can read millions of news feeds per day about topics (e.g., your customers, competitors, markets, and partners), and provide a small set of the most relevant items to read to keep current with the overwhelming amount of information currently available. Topics of interest can be chosen by the user of the system for use as seeds. The seeds can be vectorized and compared with the target documents to determine their similarity. The similarities can be sorted from highest to lowest so that the most similar seed and target documents are at the top of the list. This output can be produced in XML format so that an RSS Reader can format the XML. This allows for easy Internet access to these recommendations.

Inventors:: Potok, Thomas Eugene; Steed, Chad Allen; Patton, Robert Matthew

Issue Date:: Tue Jan 31 00:00:00 EST 2017

Research Org.:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)

Sponsoring Org.:: USDOE

OSTI Identifier:: 1341872

Patent Number(s):: 9558185

Application Number:: 13/737,652

Assignee:: UT-Battelle LLC (Oak Ridge, TN)

Patent Classifications (CPCs):: G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING

Show more

G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
G06F16/334 - {Query execution
G06F16/93 - Document management systems
G06F16/951 - Indexing

Show less

DOE Contract Number:: AC05-00OR22725

Resource Type:: Patent

Resource Relation:: Patent File Date: 2013 Jan 09

Country of Publication:: United States

Language:: English

Subject:: 99 GENERAL AND MISCELLANEOUS

Citation Formats


                    Potok, Thomas Eugene, Steed, Chad Allen, and Patton, Robert Matthew. Method and system to discover and recommend interesting documents.  United States: N. p., 2017. 
        Web.

Copy to clipboard


                    Potok, Thomas Eugene, Steed, Chad Allen, & Patton, Robert Matthew. Method and system to discover and recommend interesting documents.  United States.

Copy to clipboard


                    Potok, Thomas Eugene, Steed, Chad Allen, and Patton, Robert Matthew. Tue .  
        "Method and system to discover and recommend interesting documents".  United States.  https://www.osti.gov/servlets/purl/1341872.

Copy to clipboard


                    
@article{osti_1341872,

  title        = {Method and system to discover and recommend interesting documents},

  author       = {Potok, Thomas Eugene and Steed, Chad Allen and Patton, Robert Matthew},

  abstractNote = {Disclosed are several examples of systems that can read millions of news feeds per day about topics (e.g., your customers, competitors, markets, and partners), and provide a small set of the most relevant items to read to keep current with the overwhelming amount of information currently available. Topics of interest can be chosen by the user of the system for use as seeds. The seeds can be vectorized and compared with the target documents to determine their similarity. The similarities can be sorted from highest to lowest so that the most similar seed and target documents are at the top of the list. This output can be produced in XML format so that an RSS Reader can format the XML. This allows for easy Internet access to these recommendations.},

  doi          = {},

  journal      = {},
number       = ,

  volume       = ,

  place        = {United States},

  year         = {Tue Jan 31 00:00:00 EST 2017},

  month        = {Tue Jan 31 00:00:00 EST 2017}

}

Copy to clipboard

Patent:

Save / Share:

Export Metadata

Save to My Library

Works referenced in this record:

Retrieval system and method
patent, September 1999

Cohen, Edith; Lewis, David Dolan
US Patent Document 5,950,189
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/5950189

Methods and apparatus for similarity text search based on conceptual indexing
patent, April 2003

Aggarwal, Charu C.; Yu, Philip Shi-Lung
US Patent Document 6,542,889
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/6542889

Method and apparatus for measuring similarity among electronic documents
patent, January 2006

Palmer, Michael E.; Sun, Gordon; Zha, Hongyuan
US Patent Document 6,990,628
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/6990628

Ontology-based information management system and method
patent, May 2007

Gardner, Steve
US Patent Document 7,225,183
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/7225183

Method for gathering and summarizing internet information
patent, April 2010

Potok, Thomas E.; Elmore, Mark Thomas; Reed, Joel Wesley
US Patent Document 7,693,903
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/7693903

Agent-based method for distributed clustering of textual information
patent, September 2010

Potok, Thomas E.; Reed, Joel Wesley; Elmore, Mark Thomas
US Patent Document 7,805,446
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/7805446

Latent semantic clustering
patent, November 2010

Wnek, Janusz
US Patent Document 7,844,566
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/7844566

Dynamic reduction of dimensions of a document vector in a document search and retrieval system
patent, May 2011

Jiao, Yu; Potok, Thomas E.
US Patent Document 7,937,389
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/7937389

Systems and methods for identifying similar documents
patent, June 2011

Curtis, Taylor; Heafield, Kenneth
US Patent Document 7,958,136
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/7958136

Process for the Document Management and Computer-Assisted Translation of Documents Utilizing Document Corpora Constructed by Intelligent Agents
patent-application, August 2003

Shreve, Gregory M.
US Patent Application 10/073516; 20030154071
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20030154071

Method and System for Linking Documents with Multiple Topics to Related Documents
patent-application, June 2007

Miller, David James
US Patent Application 11/295531; 20070130100
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20070130100

Document Similarity Scoring and Ranking Method, Device and Computer Program Product
patent-application, August 2007

Canright, Geoffrey; Engo-Monsen, Kenth
US Patent Application 11/349235; 20070185871
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20070185871

Identifying Information Related to a Particular Entity from Electronic Sources
patent-application, March 2009

Gabriel, Raefer Christopher; Fertik, Michael Benjamin; Tripp, Owen Wheble
US Patent Application 12/209169; 20090070325
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20090070325

Document Processing Device and Document Processing Method
patent-application, May 2009

Ochi, Shingo; Hino, Takanori
US Patent Application 12/294135; 20090132566
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20090132566

Data Classification Using Machine Learning Techniques
patent-application, August 2011

Schmidtler, Mauritius A. R.; Borrey, Roland; Sarah, Anthony
US Patent Application 13/090216; 20110196870
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20110196870

Similarity Score Lookup and Representation
patent-application, November 2014

Chen, Lijiang; Hou, Hui-Man; Chen, Shimin
US Patent Application 14/372712; 20140337337
URL: https://image-ppubs.uspto.gov/dirsearch-public/print/downloadPdf/20140337337

TF-ICF: A New Term Weighting Scheme for Clustering Dynamic Data Streams
conference, December 2006

Reed, Joel; Jiao, Yu; Potok, Thomas
2006 5th International Conference on Machine Learning and Applications (ICMLA'06)
https://doi.org/10.1109/ICMLA.2006.50

A geometric view on bilingual lexicon extraction from comparable corpora
conference, January 2004

Gaussier, E.; Renders, J. -M.; Matveeva, I.
Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics - ACL '04
https://doi.org/10.3115/1218955.1219022

Similar Records in DOE Patents and OSTI.GOV collections:

Method and system of filtering and recommending documents

Patent Patton, Robert M.; Potok, Thomas E.

Disclosed is a method and system for discovering documents using a computer and providing a small set of the most relevant documents to the attention of a human observer. Using the method, the computer obtains a seed document from the user and generates a seed document vector using term frequency-inverse corpus frequency weighting. A keyword index for a plurality of source documents can be compared with the weighted terms of the seed document vector. The comparison is then filtered to reduce the number of documents, which define an initial subset of the source documents. Initial subset vectors are generated andmore » « less
Full Text Available
Method of recommending items to a user based on user interest

Patent Bollen, John; Van De Sompel, Herbert

Although recording of usage data is common in scholarly information services, its exploitation for the creation of value-added services remains limited due to concerns regarding, among others, user privacy, data validity, and the lack of accepted standards for the representation, sharing and aggregation of usage data. A technical, standards-based architecture for sharing usage information is presented. In this architecture, OpenURL-compliant linking servers aggregate usage information of a specific user community as it navigates the distributed information environment that it has access to. This usage information is made OAI-PMH harvestable so that usage information exposed by many linking servers can bemore » « less
Full Text Available
Recommending personally interested contents by text mining, filtering, and interfaces

Patent Xu, Songhua

A personalized content recommendation system includes a client interface device configured to monitor a user's information data stream. A collaborative filter remote from the client interface device generates automated predictions about the interests of the user. A database server stores personal behavioral profiles and user's preferences based on a plurality of monitored past behaviors and an output of the collaborative user personal interest inference engine. A programmed personal content recommendation server filters items in an incoming information stream with the personal behavioral profile and identifies only those items of the incoming information stream that substantially matches the personal behavioral profile.more » « less
Full Text Available
System and method for identifying objects of interest in images based on likelihood map decluttering

Patent Paglieroni, David W.; Martz, Jr., Harry E.

An automatic threat recognition system and method is disclosed for scanning the x-ray CT image of an article to identify the objects of interest (OOIs) contained within the article, which are otherwise not always quickly apparent or discernable to an individual. The system uses a computer to receive information from two-dimensional (2D) image slices from a reconstructed computed tomography (CT) scan image and to produce a plurality of voxels for each slice of the 2D image. The computer analyzes the voxels to create a likelihood map (LM) representing likelihoods that voxels making up the CT image are associated with amore » « less
Full Text Available
System and methods for automated detection, reasoning and recommendations for resilient cyber systems

Patent Choudhury, Sutanay; Agarwal, Khushbu; Chen, Pin-Yu; ...

A method for securing an IT (information technology) system using a set of methods for knowledge extraction, event detection, risk estimation and explanation for ranking cyber-alerts which includes a method to explain the relationship (or an attack pathway) from an entity (user or host) and an event context to another entity (a high-value resource) and an event context (attack or service failure).
Full Text Available

Similar Records

Title: Method and system to discover and recommend interesting documents

Abstract

Citation Formats

Retrieval system and method patent, September 1999

Methods and apparatus for similarity text search based on conceptual indexing patent, April 2003

Method and apparatus for measuring similarity among electronic documents patent, January 2006

Ontology-based information management system and method patent, May 2007

Method for gathering and summarizing internet information patent, April 2010

Agent-based method for distributed clustering of textual information patent, September 2010

Latent semantic clustering patent, November 2010

Dynamic reduction of dimensions of a document vector in a document search and retrieval system patent, May 2011

Systems and methods for identifying similar documents patent, June 2011

Process for the Document Management and Computer-Assisted Translation of Documents Utilizing Document Corpora Constructed by Intelligent Agents patent-application, August 2003

Method and System for Linking Documents with Multiple Topics to Related Documents patent-application, June 2007

Document Similarity Scoring and Ranking Method, Device and Computer Program Product patent-application, August 2007

Identifying Information Related to a Particular Entity from Electronic Sources patent-application, March 2009

Document Processing Device and Document Processing Method patent-application, May 2009

Data Classification Using Machine Learning Techniques patent-application, August 2011

Similarity Score Lookup and Representation patent-application, November 2014

TF-ICF: A New Term Weighting Scheme for Clustering Dynamic Data Streams conference, December 2006

A geometric view on bilingual lexicon extraction from comparable corpora conference, January 2004

Retrieval system and method
patent, September 1999

Methods and apparatus for similarity text search based on conceptual indexing
patent, April 2003

Method and apparatus for measuring similarity among electronic documents
patent, January 2006

Ontology-based information management system and method
patent, May 2007

Method for gathering and summarizing internet information
patent, April 2010

Agent-based method for distributed clustering of textual information
patent, September 2010

Latent semantic clustering
patent, November 2010

Dynamic reduction of dimensions of a document vector in a document search and retrieval system
patent, May 2011

Systems and methods for identifying similar documents
patent, June 2011

Process for the Document Management and Computer-Assisted Translation of Documents Utilizing Document Corpora Constructed by Intelligent Agents
patent-application, August 2003

Method and System for Linking Documents with Multiple Topics to Related Documents
patent-application, June 2007

Document Similarity Scoring and Ranking Method, Device and Computer Program Product
patent-application, August 2007

Identifying Information Related to a Particular Entity from Electronic Sources
patent-application, March 2009

Document Processing Device and Document Processing Method
patent-application, May 2009

Data Classification Using Machine Learning Techniques
patent-application, August 2011

Similarity Score Lookup and Representation
patent-application, November 2014

TF-ICF: A New Term Weighting Scheme for Clustering Dynamic Data Streams
conference, December 2006

A geometric view on bilingual lexicon extraction from comparable corpora
conference, January 2004