Text encoding for retrieving scientific and technical information
The authors have recently proposed a method a manage and to retrieve scientific and technical information based on the encoding of information within the document using Standard Generalized Markup Language (SGML). The encoding will include the scientific and technical information, as well as associated and ancillary scientific data, management data, and other metadata. The work proposed at the Oak Ridge National Laboratory will include tagset selection, document type definition writing, test document encoding, and programming that will permit various views of a document to be made. This paper describes the proposed method and work. Precise retrieval of contained information is conventionally done by means of Boolean searches to retrieve a set of documents that have been probability of containing the desired information. Standardized terminologies have long been a key element in constructing efficient search plans for these searches. Combined with added techniques of corpus selection, author or institution restriction, etc., effective searches can often be constructed. With the increase in corpus size, however, the standard methods become progressively less effective. The planned method described here involves a radically different approach to the retrieval problems. Its objective is the direct retrieval of information. The work of this study will be carried out by a multi-disciplinary team of engineering, scientific, publication, and information specialists. Computer programs to retrieve information from encoded sample documents will be either acquired or written and then tested. At the final stage of the work, after evaluation of the program, plans will be developed to apply the method to a wide range of specialized fields.
- Research Organization:
- Oak Ridge National Lab., TN (United States)
- Sponsoring Organization:
- DOE; USDOE, Washington, DC (United States)
- DOE Contract Number:
- AC05-84OR21400
- OSTI ID:
- 5970336
- Report Number(s):
- CONF-9110249-1; ON: DE92004403
- Country of Publication:
- United States
- Language:
- English
Similar Records
SGML encoding for technical reports
SGML encoding for technical reports. [Standard Generatlized Markup Language (SGML)]
Related Subjects
990200 -- Mathematics & Computers
990301* -- Information Handling-- Data Handling-- (1992-)
COOPERATION
DATA
DATA BASE MANAGEMENT
DATA COMPILATION
ENGINEERING PERSONNEL
INFORMATION
INFORMATION RETRIEVAL
MANAGEMENT
MATHEMATICAL LOGIC
PERSONNEL
PROFESSIONAL PERSONNEL
RESEARCH PROGRAMS
SCIENTIFIC PERSONNEL
STANDARDIZED TERMINOLOGY