skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: An automated system for numerically rating document image quality

Conference ·
OSTI ID:463673
;  [1]; ;  [2]
  1. Los Alamos National Lab., NM (United States)
  2. Louisiana State Univ., Baton Rouge, LA (United States)

As part of the Department of Energy document declassification program, the authors have developed a numerical rating system to predict the OCR error rate that they expect to encounter when processing a particular document. The rating algorithm produces a vector containing scores for different document image attributes such as speckle and touching characters. The OCR error rate for a document is computed from a weighted sum of the elements of the corresponding quality vector. The predicted OCR error rate will be used to screen documents that would not be handled properly with existing document processing products.

Research Organization:
Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE, Washington, DC (United States)
DOE Contract Number:
W-7405-ENG-36
OSTI ID:
463673
Report Number(s):
LA-UR-97-214; CONF-970231-24; ON: DE97004756; TRN: AHC29709%%110
Resource Relation:
Conference: SPIE international symposium, San Jose, CA (United States), 8-14 Feb 1997; Other Information: PBD: [1997]
Country of Publication:
United States
Language:
English