A Content-Based Neural Reordering Model for Statistical Machine Translation

Pan, Yirong; Li, Xiao; Yang, Yating; Mi, Chenggang; Dong, Rui; Zeng, Wenxiao

doi:10.1007/978-981-10-7134-8_11

Yirong Pan^11,12,
Xiao Li^11,13,
Yating Yang^11,13,
Chenggang Mi^11,13,
Rui Dong^11,13 &
…
Wenxiao Zeng^11,13

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 787))

Included in the following conference series:

China Workshop on Machine Translation

510 Accesses
4 Citations

Abstract

Phrase-based lexicalized reordering models have attracted extensive interest in statistical machine translation (SMT) due to their capacity for dealing with swap between consecutive phrases. However, translations between two languages that with significant differences in syntactic structure have made it challenging to generate a semantically and syntactically correct word sequence. In an effort to alleviate this problem, we propose a novel content-based neural reordering model that estimates reordering probabilities based on the words of its surrounding contexts. We first utilize a simple convolutional neural network (CNN) to capture semantic contents conditioned on various sizes of context. And then we employ a softmax layer to predict the reordering orientations and probability distributions. Experimental results show that our model provides statistically obvious improvements for both Chinese-Uyghur (+0.48 on CWMT2015) and Chinese-English (+0.27 on CWMT2013) translation tasks over conventional lexicalized reordering models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Koehn, P., Och, F.J., Marcu, D.: Statistical phrase-based translation. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 48–54. Association for Computational Linguistics (2003)
Google Scholar
Och, F.J., Ney, H.: The alignment template approach to statistical machine translation. Comput. Linguist. 30(4), 417–449 (2004)
Article MATH Google Scholar
Och, F.J., Gildea, D., Khudanpur, S., et al.: A smorgasbord of features for statistical machine translation. In: Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004 (2004)
Google Scholar
Koehn, P., Hoang, H., Birch, A., et al.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, pp. 177–180. Association for Computational Linguistics (2007)
Google Scholar
Li, P., Liu, Y., Sun, M., et al.: A neural reordering model for phrase-based translation. In: COLING, pp. 1897–1907 (2014)
Google Scholar
Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6645–6649. IEEE (2013)
Google Scholar
Liu, X., Gao, J., He, X., et al.: Representation learning using multi-task deep neural networks for semantic classification and information retrieval. In: HLT-NAACL, pp. 912–921 (2015)
Google Scholar
Cho, K., Merrienboer, B.V., Gulcehre, C., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. Comput. Sci. (2014)
Google Scholar
Barone, A.V.M., Attardi, G.: Non-projective dependency-based pre-reordering with recurrent neural network for machine translation. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Long Papers, vol. 1, pp. 846–856 (2015)
Google Scholar
Hadiwinoto, C., Ng, H.T.: A dependency-based neural reordering model for statistical machine translation. In: AAAI, pp. 109–115 (2017)
Google Scholar
Schwenk, H., Dchelotte, D., Gauvain, J.L.: Continuous space language models for statistical machine translation. In: COLING/ACL on Main Conference Poster Sessions, pp. 723–730. Association for Computational Linguistics (2006)
Google Scholar
Xiong, D., Liu, Q., Lin, S.: Maximum entropy based phrase reordering model for statistical machine translation. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, pp. 521–528. Association for Computational Linguistics (2006)
Google Scholar
Socher, R., Pennington, J., Huang, E.H., et al.: Semi-supervised recursive autoencoders for predicting sentiment distributions. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 151–161. Association for Computational Linguistics (2011)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)
Bojanowski, P., Grave, E., Joulin, A., et al.: Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606 (2016)
Hensman, P., Masko, D.: The impact of imbalanced training data for convolutional neural networks. Degree Project in Computer Science, KTH Royal Institute of Technology (2015)
Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation (1985)
Google Scholar
Zeiler, M.D.: ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012)
Loper, E., Bird, S.: NLTK: the natural language toolkit. In: ACL 2002 Workshop on Effective TOOLS and Methodologies for Teaching Natural Language Processing and Computational Linguistics, pp. 63–70. Association for Computational Linguistics (2002)
Google Scholar
Och, F.J., Ney, H.: GIZA++: training of statistical translation models (2000)
Google Scholar
Stolcke, A.: SRILM-an extensible language modeling toolkit. In: Interspeech 2002, vol. 2002 (2002)
Google Scholar
Papineni, K., Roukos, S., Ward, T., et al.: BLEU: a method for automatic evaluation of machine translation. In: Meeting on Association for Computational Linguistics, pp. 311–318. Association for Computational Linguistics (2002)
Google Scholar

Download references

Acknowledgements

This research is supported by the West Light Foundation of Chinese Academy of Sciences (No. YBXM-2014-04), the Important Project on Science and Technology of Xinjiang Province (No. 2016A03007-3), the Xinjiang Province Open Project of Key Laboratory (No. 2015KL031) and the Natural Science Foundation of Xinjiang Province (No. 2015211B034).

Author information

Authors and Affiliations

Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, China
Yirong Pan, Xiao Li, Yating Yang, Chenggang Mi, Rui Dong & Wenxiao Zeng
University of Chinese Academy of Sciences, Beijing, China
Yirong Pan
Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi, China
Xiao Li, Yating Yang, Chenggang Mi, Rui Dong & Wenxiao Zeng

Authors

Yirong Pan
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Li
View author publications
You can also search for this author in PubMed Google Scholar
Yating Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chenggang Mi
View author publications
You can also search for this author in PubMed Google Scholar
Rui Dong
View author publications
You can also search for this author in PubMed Google Scholar
Wenxiao Zeng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yating Yang .

Editor information

Editors and Affiliations

University of Macau, Macau SAR, China
Derek F. Wong
Soochow University, Suzhou, China
Deyi Xiong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pan, Y., Li, X., Yang, Y., Mi, C., Dong, R., Zeng, W. (2017). A Content-Based Neural Reordering Model for Statistical Machine Translation. In: Wong, D., Xiong, D. (eds) Machine Translation. CWMT 2017. Communications in Computer and Information Science, vol 787. Springer, Singapore. https://doi.org/10.1007/978-981-10-7134-8_11

Download citation

DOI: https://doi.org/10.1007/978-981-10-7134-8_11
Published: 14 November 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7133-1
Online ISBN: 978-981-10-7134-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics