Translation model size reduction for hierarchical phrase-based statistical machine translation

Seung Wook Lee, Dongdong Zhang, Mu Li, Ming Zhou, Hae-Chang Rim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

In this paper, we propose a novel method of reducing the size of translation model for hierarchical phrase-basedmachine translation systems. Previous approaches try to prune infrequent entries or unreliable entries based on statistics, but cause a problem of reducing the translation coverage. On the contrary, the proposed method try to prune only ineffective entries based on the estimation of the information redundancy encoded in phrase pairs and hierarchical rules, and thus preserve the search space of SMT decoders as much as possible. Experimental results on Chinese-to- English machine translation tasks show that our method is able to reduce almost the half size of the translation model with very tiny degradation of translation performance.

Original languageEnglish
Title of host publication50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference
Pages291-295
Number of pages5
Volume2
Publication statusPublished - 2012 Dec 1
Event50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Jeju Island, Korea, Republic of
Duration: 2012 Jul 82012 Jul 14

Other

Other50th Annual Meeting of the Association for Computational Linguistics, ACL 2012
CountryKorea, Republic of
CityJeju Island
Period12/7/812/7/14

Fingerprint

Surface mount technology
Redundancy
Statistics
Degradation

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Software

Cite this

Lee, S. W., Zhang, D., Li, M., Zhou, M., & Rim, H-C. (2012). Translation model size reduction for hierarchical phrase-based statistical machine translation. In 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference (Vol. 2, pp. 291-295)

Translation model size reduction for hierarchical phrase-based statistical machine translation. / Lee, Seung Wook; Zhang, Dongdong; Li, Mu; Zhou, Ming; Rim, Hae-Chang.

50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference. Vol. 2 2012. p. 291-295.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lee, SW, Zhang, D, Li, M, Zhou, M & Rim, H-C 2012, Translation model size reduction for hierarchical phrase-based statistical machine translation. in 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference. vol. 2, pp. 291-295, 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012, Jeju Island, Korea, Republic of, 12/7/8.
Lee SW, Zhang D, Li M, Zhou M, Rim H-C. Translation model size reduction for hierarchical phrase-based statistical machine translation. In 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference. Vol. 2. 2012. p. 291-295
Lee, Seung Wook ; Zhang, Dongdong ; Li, Mu ; Zhou, Ming ; Rim, Hae-Chang. / Translation model size reduction for hierarchical phrase-based statistical machine translation. 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference. Vol. 2 2012. pp. 291-295
@inproceedings{0a9b25fc0b3f498fb1ee6b84564ba285,
title = "Translation model size reduction for hierarchical phrase-based statistical machine translation",
abstract = "In this paper, we propose a novel method of reducing the size of translation model for hierarchical phrase-basedmachine translation systems. Previous approaches try to prune infrequent entries or unreliable entries based on statistics, but cause a problem of reducing the translation coverage. On the contrary, the proposed method try to prune only ineffective entries based on the estimation of the information redundancy encoded in phrase pairs and hierarchical rules, and thus preserve the search space of SMT decoders as much as possible. Experimental results on Chinese-to- English machine translation tasks show that our method is able to reduce almost the half size of the translation model with very tiny degradation of translation performance.",
author = "Lee, {Seung Wook} and Dongdong Zhang and Mu Li and Ming Zhou and Hae-Chang Rim",
year = "2012",
month = "12",
day = "1",
language = "English",
isbn = "9781937284251",
volume = "2",
pages = "291--295",
booktitle = "50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference",

}

TY - GEN

T1 - Translation model size reduction for hierarchical phrase-based statistical machine translation

AU - Lee, Seung Wook

AU - Zhang, Dongdong

AU - Li, Mu

AU - Zhou, Ming

AU - Rim, Hae-Chang

PY - 2012/12/1

Y1 - 2012/12/1

N2 - In this paper, we propose a novel method of reducing the size of translation model for hierarchical phrase-basedmachine translation systems. Previous approaches try to prune infrequent entries or unreliable entries based on statistics, but cause a problem of reducing the translation coverage. On the contrary, the proposed method try to prune only ineffective entries based on the estimation of the information redundancy encoded in phrase pairs and hierarchical rules, and thus preserve the search space of SMT decoders as much as possible. Experimental results on Chinese-to- English machine translation tasks show that our method is able to reduce almost the half size of the translation model with very tiny degradation of translation performance.

AB - In this paper, we propose a novel method of reducing the size of translation model for hierarchical phrase-basedmachine translation systems. Previous approaches try to prune infrequent entries or unreliable entries based on statistics, but cause a problem of reducing the translation coverage. On the contrary, the proposed method try to prune only ineffective entries based on the estimation of the information redundancy encoded in phrase pairs and hierarchical rules, and thus preserve the search space of SMT decoders as much as possible. Experimental results on Chinese-to- English machine translation tasks show that our method is able to reduce almost the half size of the translation model with very tiny degradation of translation performance.

UR - http://www.scopus.com/inward/record.url?scp=84878209275&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84878209275&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84878209275

SN - 9781937284251

VL - 2

SP - 291

EP - 295

BT - 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference

ER -