Translation model size reduction for hierarchical phrase-based statistical machine translation

Seung Wook Lee, Dongdong Zhang, Mu Li, Ming Zhou, Hae Chang Rim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

In this paper, we propose a novel method of reducing the size of translation model for hierarchical phrase-basedmachine translation systems. Previous approaches try to prune infrequent entries or unreliable entries based on statistics, but cause a problem of reducing the translation coverage. On the contrary, the proposed method try to prune only ineffective entries based on the estimation of the information redundancy encoded in phrase pairs and hierarchical rules, and thus preserve the search space of SMT decoders as much as possible. Experimental results on Chinese-to- English machine translation tasks show that our method is able to reduce almost the half size of the translation model with very tiny degradation of translation performance.

Original languageEnglish
Title of host publication50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference
Pages291-295
Number of pages5
Publication statusPublished - 2012
Event50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Jeju Island, Korea, Republic of
Duration: 2012 Jul 82012 Jul 14

Publication series

Name50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference
Volume2

Other

Other50th Annual Meeting of the Association for Computational Linguistics, ACL 2012
Country/TerritoryKorea, Republic of
CityJeju Island
Period12/7/812/7/14

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Software

Fingerprint

Dive into the research topics of 'Translation model size reduction for hierarchical phrase-based statistical machine translation'. Together they form a unique fingerprint.

Cite this