TY - GEN
T1 - An efficient method for document image geometric layout analysis
AU - Chi, Suyoung
AU - Chung, Yunkoo
AU - Jang, Dae Geun
AU - Oh, Weongeun
AU - Lee, Jaeyeon
AU - Changhun, Kim
PY - 2003
Y1 - 2003
N2 - Document image analysis is necessary for optical character recognition (OCR) and also very useful for many other document image manipulations. In this paper, we propose a document image geometric layout analysis system which has less region segmentation and classification error than that of the commercial software and previous works. The proposed method segments the document image into small regions to the size of a character using fast connected components generation method, so that it prevents the different types of connected components from combining. We also propose new criterion for clustering the connected components and some new techniques to deal with noise and reduce computation time. Experiment shows classification error rate of text and picture regions is decreased.
AB - Document image analysis is necessary for optical character recognition (OCR) and also very useful for many other document image manipulations. In this paper, we propose a document image geometric layout analysis system which has less region segmentation and classification error than that of the commercial software and previous works. The proposed method segments the document image into small regions to the size of a character using fast connected components generation method, so that it prevents the different types of connected components from combining. We also propose new criterion for clustering the connected components and some new techniques to deal with noise and reduce computation time. Experiment shows classification error rate of text and picture regions is decreased.
KW - Connected Component Analysis
KW - Document Image Analysis
KW - Optical Character Recognition(OCR)
UR - http://www.scopus.com/inward/record.url?scp=1542359455&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=1542359455&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:1542359455
SN - 0889863768
T3 - IASTED International Conference on Computer Graphics and Imaging
SP - 238
EP - 243
BT - IASTED International Conference on Computer Graphics and Imaging
A2 - Hamza, M.H.
A2 - Hamza, M.H.
T2 - Sixth IASTED International Conference on Computer Graphics and Imaging
Y2 - 13 August 2003 through 15 August 2003
ER -