Rapid automatic keyword extraction for information retrieval and analysis
Patent
·
OSTI ID:1039881
- Richland, WA
Methods and systems for rapid automatic keyword extraction for information retrieval and analysis. Embodiments can include parsing words in an individual document by delimiters, stop words, or both in order to identify candidate keywords. Word scores for each word within the candidate keywords are then calculated based on a function of co-occurrence degree, co-occurrence frequency, or both. Based on a function of the word scores for words within the candidate keyword, a keyword score is calculated for each of the candidate keywords. A portion of the candidate keywords are then extracted as keywords based, at least in part, on the candidate keywords having the highest keyword scores.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- Assignee:
- Battelle Memorial Institute (Richland, WA)
- Patent Number(s):
- 8,131,735
- Application Number:
- 12/555,916
- OSTI ID:
- 1039881
- Country of Publication:
- United States
- Language:
- English
Similar Records
Automatic Keyword Extraction from Individual Documents
Automatic generation of stop word lists for information retrieval and analysis
Experiments in automatic word class and word sense identification for information retrieval
Book
·
Mon May 03 00:00:00 EDT 2010
·
OSTI ID:1039881
+1 more
Automatic generation of stop word lists for information retrieval and analysis
Patent
·
Tue Jan 08 00:00:00 EST 2013
·
OSTI ID:1039881
Experiments in automatic word class and word sense identification for information retrieval
Technical Report
·
Sat Dec 31 00:00:00 EST 1994
·
OSTI ID:1039881