ROPE: Recoverable Order-Preserving Embedding of Natural Language

Widemann, David P.; Wang, Eric X.; Thiagarajan, Jayaraman J.

doi:10.2172/1239214

Title: ROPE: Recoverable Order-Preserving Embedding of Natural Language

Technical Report · Thu Feb 11 00:00:00 EST 2016

DOI:https://doi.org/10.2172/1239214· OSTI ID:1239214

Widemann, David P. ^[1]; Wang, Eric X. ^[1]; Thiagarajan, Jayaraman J. ^[1]

Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

We present a novel Recoverable Order-Preserving Embedding (ROPE) of natural language. ROPE maps natural language passages from sparse concatenated one-hot representations to distributed vector representations of predetermined fixed length. We use Euclidean distance to return search results that are both grammatically and semantically similar. ROPE is based on a series of random projections of distributed word embeddings. We show that our technique typically forms a dictionary with sufficient incoherence such that sparse recovery of the original text is possible. We then show how our embedding allows for efficient and meaningful natural search and retrieval on Microsoft’s COCO dataset and the IMDB Movie Review dataset.

View Technical Report

Cite

Export

Save

Research Organization:: Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: AC52-07NA27344

OSTI ID:: 1239214

Report Number(s):: LLNL-TR-682663

Country of Publication:: United States

Language:: English

Similar Records

A Computational Theory for the Emergence of Grammatical Categories in Cortical Dynamics

Journal Article · Thu Apr 16 00:00:00 EDT 2020 · Frontiers in Neural Circuits · OSTI ID:1239214

Dematties, Dario; Rizzi, Silvio; Thiruvathukal, George K.; +3 more

Computationally Efficient Learning of Quality Controlled Word Embeddings for Natural Language Processing

Conference · Mon Jul 01 00:00:00 EDT 2019 · OSTI ID:1239214

Alawad, Mohammed; Tourassi, Georgia

Left-corner unification-based natural language processing

Conference · Tue Dec 31 00:00:00 EST 1996 · OSTI ID:1239214

Lytinen, S L; Tomuro, N

Related Subjects

97 MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE

Title: ROPE: Recoverable Order-Preserving Embedding of Natural Language

Citation Formats

Similar Records

Related Subjects