Document Retrieval and Ranking using Similarity Graph Mean Hitting Times
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
- Galisteo Consulting Group, Inc., Albuquerque, NM (United States)
We present a novel approach to information retrieval and document analysis based on graph analytic methods. Traditional information retrieval methods use a set of terms to define a query that is applied against a document corpus to identify the documents most related to those terms. In contrast, we define a query as a set of documents of interest and apply the query by computing mean hitting times between this set and all other documents on a document similarity graph abstraction of the semantic relationships between all pairs of documents. We present the steps of our approach along with a simple example application illustrating how this approach can be used to find documents related to two or more documents or topics of interest.
- Research Organization:
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA); NMSBA Program
- DOE Contract Number:
- NA0003525
- OSTI ID:
- 1835671
- Report Number(s):
- SAND2021-15731; 702227
- Country of Publication:
- United States
- Language:
- English
Similar Records
VisIRR: A Visual Analytics System for Information Retrieval and Recommendation for Large-Scale Document Data
Metric graph structure for information retrieval. [Considers semantic distance between documents to enhance retrieval relevancy]