Document and Content Analysis
Lecture 06 - Document Image Analysis
�Faisal Shafait
26.05.2011
How to convert a document image into editable text?
Optical Character Recognition (OCR)
We will learn how OCR works in the next four lectures!
How to convert a document image into editable text?
Optical Character Recognition (OCR)
We will learn how OCR works in the next four lectures!
How to convert a document image into editable text?
Optical Character Recognition (OCR)
We will learn how OCR works in the next four lectures!
A typical scanned book page
Character Recognition
Isolated character recognition can be done as a standard pattern recognition problem, but a lot more needs to be done for a complete OCR system
Character Recognition
Flow chart for OCR
Binarization
Effect of binarization on OCR
Binarization algorithms
Global Binarization
Global Binarization
Otsu Global Thresholding
Let be the normalized histogram of the image
Otsu Global Thresholding
Local Adaptive Thresholding
White (1983):
Niblack (1986):
Sauvola (2000):
Sauvola Local Thresholding
Local Vs Global Thresholding
Shafait Binarization (2008)
Connected Component Analysis
Connected Component Analysis
Connected Component Analysis Example
Other Pre-processing Tasks
Orientation Detection
Marginal Noise Removal
Skew Correction
Page Segmentation
Incorrect Page Segmentation
Incorrect Page Segmentation
Page Segmentation Algorithms
Run-Length Smearing Algorithm
Run-Length Smearing Algorithm
Run-Length Smearing Algorithm
(a) Original Image
(b) Horizontally Smeared Image
(c) Vertically Smeared Image
Run-Length Smearing Algorithm
(d) Final Image after Smoothing
(e) Identified text regions
Recursive X-Y Cut Algorithm
Horizontal Projection
Recursive X-Y Cut Algorithm
Things to remember