Table structure extraction from form documents based on gradient-wavelet scheme

Dihua Xi, Seong Whan Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Based on gradient and wavelet analyses, a novel scheme has been developed to extract table structures from skewed form document images. In this scheme, first, a skewed form document image is rotated according to the angle obtained from the gradient algorithm. Then the deskewed image is decomposed into four sub-images by divisible Multiresolution Analysis(MRA) wavelets. Afterwards, the table structure image which represents the geometric structure of the form can be obtained from the sub-images by a modified wavelet reconstruction algorithm. Meanwhile, another document image without table lines can be produced by Minkowski operation and is referred to as a table free image. Experimental results indicate that this new scheme can be applied to process the skewed form document images with promising achievements.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages240-254
Number of pages15
Volume1655
ISBN (Print)3540665072, 9783540665076
DOIs
Publication statusPublished - 1999
Event3rd IAPR Workshop on Document Analysis Systems, DAS 1998 - Nagano, Japan
Duration: 1998 Nov 41998 Nov 6

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1655
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other3rd IAPR Workshop on Document Analysis Systems, DAS 1998
CountryJapan
CityNagano
Period98/11/498/11/6

Fingerprint

Table
Wavelets
Gradient
Multiresolution analysis
Form
Multiresolution Analysis
Gradient Algorithm
Reconstruction Algorithm
Geometric Structure
Divisible
Angle
Line
Experimental Results

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Xi, D., & Lee, S. W. (1999). Table structure extraction from form documents based on gradient-wavelet scheme. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1655, pp. 240-254). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1655). Springer Verlag. https://doi.org/10.1007/3-540-48172-9_20

Table structure extraction from form documents based on gradient-wavelet scheme. / Xi, Dihua; Lee, Seong Whan.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 1655 Springer Verlag, 1999. p. 240-254 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1655).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Xi, D & Lee, SW 1999, Table structure extraction from form documents based on gradient-wavelet scheme. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 1655, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 1655, Springer Verlag, pp. 240-254, 3rd IAPR Workshop on Document Analysis Systems, DAS 1998, Nagano, Japan, 98/11/4. https://doi.org/10.1007/3-540-48172-9_20
Xi D, Lee SW. Table structure extraction from form documents based on gradient-wavelet scheme. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 1655. Springer Verlag. 1999. p. 240-254. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/3-540-48172-9_20
Xi, Dihua ; Lee, Seong Whan. / Table structure extraction from form documents based on gradient-wavelet scheme. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 1655 Springer Verlag, 1999. pp. 240-254 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{6378deaeaa1a416c8474a61de448671e,
title = "Table structure extraction from form documents based on gradient-wavelet scheme",
abstract = "Based on gradient and wavelet analyses, a novel scheme has been developed to extract table structures from skewed form document images. In this scheme, first, a skewed form document image is rotated according to the angle obtained from the gradient algorithm. Then the deskewed image is decomposed into four sub-images by divisible Multiresolution Analysis(MRA) wavelets. Afterwards, the table structure image which represents the geometric structure of the form can be obtained from the sub-images by a modified wavelet reconstruction algorithm. Meanwhile, another document image without table lines can be produced by Minkowski operation and is referred to as a table free image. Experimental results indicate that this new scheme can be applied to process the skewed form document images with promising achievements.",
author = "Dihua Xi and Lee, {Seong Whan}",
year = "1999",
doi = "10.1007/3-540-48172-9_20",
language = "English",
isbn = "3540665072",
volume = "1655",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "240--254",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Table structure extraction from form documents based on gradient-wavelet scheme

AU - Xi, Dihua

AU - Lee, Seong Whan

PY - 1999

Y1 - 1999

N2 - Based on gradient and wavelet analyses, a novel scheme has been developed to extract table structures from skewed form document images. In this scheme, first, a skewed form document image is rotated according to the angle obtained from the gradient algorithm. Then the deskewed image is decomposed into four sub-images by divisible Multiresolution Analysis(MRA) wavelets. Afterwards, the table structure image which represents the geometric structure of the form can be obtained from the sub-images by a modified wavelet reconstruction algorithm. Meanwhile, another document image without table lines can be produced by Minkowski operation and is referred to as a table free image. Experimental results indicate that this new scheme can be applied to process the skewed form document images with promising achievements.

AB - Based on gradient and wavelet analyses, a novel scheme has been developed to extract table structures from skewed form document images. In this scheme, first, a skewed form document image is rotated according to the angle obtained from the gradient algorithm. Then the deskewed image is decomposed into four sub-images by divisible Multiresolution Analysis(MRA) wavelets. Afterwards, the table structure image which represents the geometric structure of the form can be obtained from the sub-images by a modified wavelet reconstruction algorithm. Meanwhile, another document image without table lines can be produced by Minkowski operation and is referred to as a table free image. Experimental results indicate that this new scheme can be applied to process the skewed form document images with promising achievements.

UR - http://www.scopus.com/inward/record.url?scp=84957616320&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84957616320&partnerID=8YFLogxK

U2 - 10.1007/3-540-48172-9_20

DO - 10.1007/3-540-48172-9_20

M3 - Conference contribution

SN - 3540665072

SN - 9783540665076

VL - 1655

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 240

EP - 254

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

PB - Springer Verlag

ER -