Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Information fusion for automatic text classification

Conference ·
OSTI ID:378178
 [1]; ;  [2]
  1. Department of Computer Science and Information Technology, Sacred Heart University, Fairfield, CT (United States)
  2. Computer and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN (United States)

Analysis and classification of free text documents encompass decision-making processes that rely on several clues derived from text and other contextual information. When using multiple clues, it is generally not known a priori how these should be integrated into a decision. An algorithmic sensor based on Latent Semantic Indexing (LSI) (a recent successful method for text retrieval rather than classification) is the primary sensor used in our work, but its utility is limited by the {ital reference}{ital library} of documents. Thus, there is an important need to complement or at least supplement this sensor. We have developed a system that uses a neural network to integrate the LSI-based sensor with other clues derived from the text. This approach allows for systematic fusion of several information sources in order to determine a combined best decision about the category to which a document belongs.

Research Organization:
Oak Ridge National Lab., TN (United States)
Sponsoring Organization:
USDOE, Washington, DC (United States)
DOE Contract Number:
AC05-96OR22464
OSTI ID:
378178
Report Number(s):
CONF-9608120--1; ON: DE96013781
Country of Publication:
United States
Language:
English

Similar Records

Toward a multi-sensor neural net approach to automatic text classification
Conference · Thu Jan 25 23:00:00 EST 1996 · OSTI ID:266901

Toward a multi-sensor-based approach to automatic text classification
Technical Report · Sun Oct 01 00:00:00 EDT 1995 · OSTI ID:130610

Neural net learning issues in classification of free text documents
Conference · Thu Feb 29 23:00:00 EST 1996 · OSTI ID:212422