Toward a multi-sensor neural net approach to automatic text classification
- Sacred Heart Univ., Fairfield, CT (United States). Department of Computer Science and Information Technology
- Oak Ridge National Laboratory, TN (United States)
Many automatic text indexing and retrieval methods use a term-document matrix that is automatically derived from the text in question. Latent Semantic Indexing, a recent method for approximating large term-document matrices, appears to be quite useful in the problem of text information retrieval, rather than text classification. Here we outline a method that attempts to combine the strength of the LSI method with that of neural networks, in addressing the problem of text classification. In doing so, we also indicate ways to improve performance by adding additional {open_quotes}logical sensors{close_quotes} to the neural network, something that is hard to do with the LSI method when employed by itself. Preliminary results are summarized, but much work remains to be done.
- Research Organization:
- Oak Ridge National Lab., TN (United States)
- Sponsoring Organization:
- USDOE, Washington, DC (United States)
- DOE Contract Number:
- AC05-96OR22464
- OSTI ID:
- 266901
- Report Number(s):
- CONF-9609113--1; ON: DE96008633
- Country of Publication:
- United States
- Language:
- English
Similar Records
Information fusion for automatic text classification
Neural net learning issues in classification of free text documents