An automated system for numerically rating document image quality
Conference
·
OSTI ID:463673
- Los Alamos National Lab., NM (United States)
- Louisiana State Univ., Baton Rouge, LA (United States)
As part of the Department of Energy document declassification program, the authors have developed a numerical rating system to predict the OCR error rate that they expect to encounter when processing a particular document. The rating algorithm produces a vector containing scores for different document image attributes such as speckle and touching characters. The OCR error rate for a document is computed from a weighted sum of the elements of the corresponding quality vector. The predicted OCR error rate will be used to screen documents that would not be handled properly with existing document processing products.
- Research Organization:
- Los Alamos National Lab., NM (United States)
- Sponsoring Organization:
- USDOE, Washington, DC (United States)
- DOE Contract Number:
- W-7405-ENG-36
- OSTI ID:
- 463673
- Report Number(s):
- LA-UR--97-214; CONF-970231--24; ON: DE97004756
- Country of Publication:
- United States
- Language:
- English
Similar Records
Validation of document image defect models for optical character recognition
UNLV Information Science Research Institute 1995 annual report
Adaptive image enhancement of text images that contain touching or broken characters
Technical Report
·
Fri Dec 30 23:00:00 EST 1994
·
OSTI ID:68571
UNLV Information Science Research Institute 1995 annual report
Technical Report
·
Tue Aug 01 00:00:00 EDT 1995
·
OSTI ID:93939
Adaptive image enhancement of text images that contain touching or broken characters
Technical Report
·
Mon Nov 28 23:00:00 EST 1994
·
OSTI ID:42491