Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Adaptive image enhancement of text images that contain touching or broken characters

Technical Report ·
OSTI ID:42491

Text images that contain touching or broken characters can significantly degrade the accuracy of optical character recognition (OCR) systems. This paper proposes an adaptive image restoration technique that can improve OCR accuracy by enhancing touching or broken character images. The technique begins by processing a distorted text image with an OCR system. Using the distorted text image and output information from the OCR system, an inverse model of the distortion that caused the touching or broken character problem is generated. After generating the inverse model, the unrecognized distorted characters are filtered by the inverse model and then processes by the OCR system. To demonstrate its feasibility, six distorted text images were processed using this technique. Four of the text images, two with touching characters and two with broken characters, were synthesized using mathematical distortion models. The remaining two distorted text images, one with touching characters and one with broken characters, were distorted using a photocopier. The performance of the adaptive image restoration technique was measured using pixel accuracy and OCR improvement. The examples demonstrate that this technique can improve both the pixel and OCR accuracy of text images containing touching or broken characters.

Research Organization:
Nevada Univ., Las Vegas, NV (United States)
Sponsoring Organization:
USDOE, Washington, DC (United States)
DOE Contract Number:
FC08-90NV10872
OSTI ID:
42491
Report Number(s):
DOE/NV/10872--T176; ON: DE95010173
Country of Publication:
United States
Language:
English

Similar Records

UNLV Information Science Research Institute 1995 annual report
Technical Report · Tue Aug 01 00:00:00 EDT 1995 · OSTI ID:93939

Prediction of OCR accuracy using simple image features
Technical Report · Fri Mar 31 23:00:00 EST 1995 · OSTI ID:46719

Information Science Research Institute: 1994 annual report
Technical Report · Thu Sep 01 00:00:00 EDT 1994 · OSTI ID:10184080