Automatic document processing

A survey

Yuan Y. Tang, Seong Whan Lee, Ching Y. Suen

Research output: Contribution to journalArticle

91 Citations (Scopus)

Abstract

Surveys of the basic concepts and underlying techniques are presented in this paper. A basic model for document processing is described. In this model, document processing can be divided into two phases: document analysts and document understanding. A document has two structures: geometric (layout) structure and logical structure. Extraction of the geometric structure from a document refers to document analysis; mapping the geometric structure into logical structure deals with document understanding. Both types of document structures and the two areas of document processing are discussed. Two categories of methods have been used in document analysis, namely, (1) hierarchical methods including top-down and bottom-up approaches, (2) no-hierarchical methods including modified fractal signature. Tree transform, formatting knowledge and description language approaches have been used in document understanding. A particular case of form document processing is discussed. Form description and form registration approaches are presented. A form processing system is also introduced. Finally, many techniques, such as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc. which have been used in these approaches are discussed.

Original languageEnglish
Pages (from-to)1931-1952
Number of pages22
JournalPattern Recognition
Volume29
Issue number12
DOIs
Publication statusPublished - 1996 Dec 1

Fingerprint

Processing
Gabor filters
Hough transforms
Fractals

Keywords

  • Description languages
  • Document analysis and understanding
  • Document processing
  • Formatting knowledge
  • Geometric and logical structures
  • Hierarchical and no-hierarchical methods
  • Texture analysis
  • Tree transform

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

Automatic document processing : A survey. / Tang, Yuan Y.; Lee, Seong Whan; Suen, Ching Y.

In: Pattern Recognition, Vol. 29, No. 12, 01.12.1996, p. 1931-1952.

Research output: Contribution to journalArticle

Tang, Yuan Y. ; Lee, Seong Whan ; Suen, Ching Y. / Automatic document processing : A survey. In: Pattern Recognition. 1996 ; Vol. 29, No. 12. pp. 1931-1952.
@article{818518c94f78406c95131e725d832a91,
title = "Automatic document processing: A survey",
abstract = "Surveys of the basic concepts and underlying techniques are presented in this paper. A basic model for document processing is described. In this model, document processing can be divided into two phases: document analysts and document understanding. A document has two structures: geometric (layout) structure and logical structure. Extraction of the geometric structure from a document refers to document analysis; mapping the geometric structure into logical structure deals with document understanding. Both types of document structures and the two areas of document processing are discussed. Two categories of methods have been used in document analysis, namely, (1) hierarchical methods including top-down and bottom-up approaches, (2) no-hierarchical methods including modified fractal signature. Tree transform, formatting knowledge and description language approaches have been used in document understanding. A particular case of form document processing is discussed. Form description and form registration approaches are presented. A form processing system is also introduced. Finally, many techniques, such as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc. which have been used in these approaches are discussed.",
keywords = "Description languages, Document analysis and understanding, Document processing, Formatting knowledge, Geometric and logical structures, Hierarchical and no-hierarchical methods, Texture analysis, Tree transform",
author = "Tang, {Yuan Y.} and Lee, {Seong Whan} and Suen, {Ching Y.}",
year = "1996",
month = "12",
day = "1",
doi = "10.1016/S0031-3203(96)00044-1",
language = "English",
volume = "29",
pages = "1931--1952",
journal = "Pattern Recognition",
issn = "0031-3203",
publisher = "Elsevier Limited",
number = "12",

}

TY - JOUR

T1 - Automatic document processing

T2 - A survey

AU - Tang, Yuan Y.

AU - Lee, Seong Whan

AU - Suen, Ching Y.

PY - 1996/12/1

Y1 - 1996/12/1

N2 - Surveys of the basic concepts and underlying techniques are presented in this paper. A basic model for document processing is described. In this model, document processing can be divided into two phases: document analysts and document understanding. A document has two structures: geometric (layout) structure and logical structure. Extraction of the geometric structure from a document refers to document analysis; mapping the geometric structure into logical structure deals with document understanding. Both types of document structures and the two areas of document processing are discussed. Two categories of methods have been used in document analysis, namely, (1) hierarchical methods including top-down and bottom-up approaches, (2) no-hierarchical methods including modified fractal signature. Tree transform, formatting knowledge and description language approaches have been used in document understanding. A particular case of form document processing is discussed. Form description and form registration approaches are presented. A form processing system is also introduced. Finally, many techniques, such as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc. which have been used in these approaches are discussed.

AB - Surveys of the basic concepts and underlying techniques are presented in this paper. A basic model for document processing is described. In this model, document processing can be divided into two phases: document analysts and document understanding. A document has two structures: geometric (layout) structure and logical structure. Extraction of the geometric structure from a document refers to document analysis; mapping the geometric structure into logical structure deals with document understanding. Both types of document structures and the two areas of document processing are discussed. Two categories of methods have been used in document analysis, namely, (1) hierarchical methods including top-down and bottom-up approaches, (2) no-hierarchical methods including modified fractal signature. Tree transform, formatting knowledge and description language approaches have been used in document understanding. A particular case of form document processing is discussed. Form description and form registration approaches are presented. A form processing system is also introduced. Finally, many techniques, such as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc. which have been used in these approaches are discussed.

KW - Description languages

KW - Document analysis and understanding

KW - Document processing

KW - Formatting knowledge

KW - Geometric and logical structures

KW - Hierarchical and no-hierarchical methods

KW - Texture analysis

KW - Tree transform

UR - http://www.scopus.com/inward/record.url?scp=0030384386&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0030384386&partnerID=8YFLogxK

U2 - 10.1016/S0031-3203(96)00044-1

DO - 10.1016/S0031-3203(96)00044-1

M3 - Article

VL - 29

SP - 1931

EP - 1952

JO - Pattern Recognition

JF - Pattern Recognition

SN - 0031-3203

IS - 12

ER -