A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches

Jae Hee Lee, Seung Wook Lee, Gumwon Hong, Young Sook Hwang, Sang Bum Kim, Hae-Chang Rim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Statistical word alignment often suffers from data sparseness. Part-of-speeches are often incorporated in NLP tasks to reduce data sparseness. In this paper, we attempt to mitigate such problem by reflecting alignment tendency between part-of-speeches to statistical word alignment. Because our approach does not rely on any language-dependent knowledge, it is very simple and purely statistic to be applied to any language pairs. End-to-end evaluation shows that the proposed method can improve not only the quality of statistical word alignment but the performance of statistical machine translation.

Original languageEnglish
Title of host publicationColing 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference
Pages623-629
Number of pages7
Volume2
Publication statusPublished - 2010 Dec 1
Event23rd International Conference on Computational Linguistics, Coling 2010 - Beijing, China
Duration: 2010 Aug 232010 Aug 27

Other

Other23rd International Conference on Computational Linguistics, Coling 2010
CountryChina
CityBeijing
Period10/8/2310/8/27

Fingerprint

Processing
language
statistics
evaluation
performance
Statistics
Part of Speech
Alignment
Language
Evaluation
Statistical Machine Translation
Natural Language Processing

ASJC Scopus subject areas

  • Language and Linguistics
  • Computational Theory and Mathematics
  • Linguistics and Language

Cite this

Lee, J. H., Lee, S. W., Hong, G., Hwang, Y. S., Kim, S. B., & Rim, H-C. (2010). A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. In Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference (Vol. 2, pp. 623-629)

A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. / Lee, Jae Hee; Lee, Seung Wook; Hong, Gumwon; Hwang, Young Sook; Kim, Sang Bum; Rim, Hae-Chang.

Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference. Vol. 2 2010. p. 623-629.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lee, JH, Lee, SW, Hong, G, Hwang, YS, Kim, SB & Rim, H-C 2010, A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. in Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference. vol. 2, pp. 623-629, 23rd International Conference on Computational Linguistics, Coling 2010, Beijing, China, 10/8/23.
Lee JH, Lee SW, Hong G, Hwang YS, Kim SB, Rim H-C. A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. In Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference. Vol. 2. 2010. p. 623-629
Lee, Jae Hee ; Lee, Seung Wook ; Hong, Gumwon ; Hwang, Young Sook ; Kim, Sang Bum ; Rim, Hae-Chang. / A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference. Vol. 2 2010. pp. 623-629
@inproceedings{c101a3345dd640549eb6be6f8ba31176,
title = "A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches",
abstract = "Statistical word alignment often suffers from data sparseness. Part-of-speeches are often incorporated in NLP tasks to reduce data sparseness. In this paper, we attempt to mitigate such problem by reflecting alignment tendency between part-of-speeches to statistical word alignment. Because our approach does not rely on any language-dependent knowledge, it is very simple and purely statistic to be applied to any language pairs. End-to-end evaluation shows that the proposed method can improve not only the quality of statistical word alignment but the performance of statistical machine translation.",
author = "Lee, {Jae Hee} and Lee, {Seung Wook} and Gumwon Hong and Hwang, {Young Sook} and Kim, {Sang Bum} and Hae-Chang Rim",
year = "2010",
month = "12",
day = "1",
language = "English",
volume = "2",
pages = "623--629",
booktitle = "Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference",

}

TY - GEN

T1 - A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches

AU - Lee, Jae Hee

AU - Lee, Seung Wook

AU - Hong, Gumwon

AU - Hwang, Young Sook

AU - Kim, Sang Bum

AU - Rim, Hae-Chang

PY - 2010/12/1

Y1 - 2010/12/1

N2 - Statistical word alignment often suffers from data sparseness. Part-of-speeches are often incorporated in NLP tasks to reduce data sparseness. In this paper, we attempt to mitigate such problem by reflecting alignment tendency between part-of-speeches to statistical word alignment. Because our approach does not rely on any language-dependent knowledge, it is very simple and purely statistic to be applied to any language pairs. End-to-end evaluation shows that the proposed method can improve not only the quality of statistical word alignment but the performance of statistical machine translation.

AB - Statistical word alignment often suffers from data sparseness. Part-of-speeches are often incorporated in NLP tasks to reduce data sparseness. In this paper, we attempt to mitigate such problem by reflecting alignment tendency between part-of-speeches to statistical word alignment. Because our approach does not rely on any language-dependent knowledge, it is very simple and purely statistic to be applied to any language pairs. End-to-end evaluation shows that the proposed method can improve not only the quality of statistical word alignment but the performance of statistical machine translation.

UR - http://www.scopus.com/inward/record.url?scp=80053434256&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80053434256&partnerID=8YFLogxK

M3 - Conference contribution

VL - 2

SP - 623

EP - 629

BT - Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference

ER -