Skip to main content

Combining Phrase and Neural-Based Machine Translation: What Worked and Did Not

  • Conference paper
  • First Online:
Computational Linguistics and Intelligent Text Processing (CICLing 2016)

Abstract

Phrase-based machine translation assumes that all words are at the same distance and translates them using feature functions that approximate the probability at different levels. On the other hand, neural machine translation infers a word embedding and translates these word vectors using a neural model. At the moment, both approaches co-exist and are being intensively investigated.

This paper to the best of our knowledge is the first work that both compares and combines these two systems by: using the phrase-based output to solve unknown words in the neural machine translation output; using the neural alignment in the phrase-based system; comparing how the popular strategy of pre-reordering affects both systems; and combining both translation outputs. Improvements are achieved in Catalan-to-Spanish and German-to-English.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://www.statmt.org/wmt15/translation-task.html.

  2. 2.

    http://dl4mt.computing.dcu.ie/.

References

  1. Sundermeyer, M., Alkhouli, T., Wuebker, J., Ney, H.: Translation modeling with bidirectional recurrent neural networks. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Doha (2014)

    Google Scholar 

  2. Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. CoRR abs/1409.0473 (2015)

    Google Scholar 

  3. Bisazza, A., Federico, M.: A survey of word reordering in statistical machine translation: computational models and language phenomena. arXiv preprint arXiv:1502.04938 (2015)

  4. Castaño, M.A., Casacuberta, F.: A connectionist approach to MT. In: Proceedings of the EUROSPEECH Conference (1997)

    Google Scholar 

  5. Chiang, D.: Hierarchical phrase-based translation. Comput. Linguist. 33(2), 201–228 (2007)

    Article  MATH  Google Scholar 

  6. Cho, K., van Merrienboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: encoder-decoder approaches. In: Proceedings of the 8th Workshop on Syntax, Semantics and Structure in Statistical Translation, Doha (2014)

    Google Scholar 

  7. Collins, M., Koehn, P., Kucerova, I.: Clause restructuring for statistical machine translation. In: Annual Conference of the Association for Computational Lingusitics (ACL 2005), Michigan (2005)

    Google Scholar 

  8. Costa-Jussà, M.R., Fonollosa, J.A.R.: Statistical machine reordering. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, pp. 70–76 (2006)

    Google Scholar 

  9. Costa-Jussà, M., Poch, M., Fonollosa, J., Farrús, M., Marinno, J.: A large Spanish-Catalan parallel corpus release for machine translation. Comput. Inf. 33(4), 907–920 (2014)

    Google Scholar 

  10. Graves, A.: Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850 (2013)

  11. Jean, S., Firat, O., Cho, K., Memisevic, R., Bengio, Y.: Montreal neural machine translation systems for WMT15. In: Proceedings of the 10th Workshop on Statistical Machine Translation, Lisbon (2015)

    Google Scholar 

  12. Kalchbrenner, N., Blunsom, P.: Recurrent continuous translation models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Seattle (2013)

    Google Scholar 

  13. Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, pp. 177–180 (2007)

    Google Scholar 

  14. Koehn, P., Och, F., Marcu, D.: Statistical phrase-based translation. In: Proceedings of the 41th Annual Meeting of the Association for Computational Linguistics (2003)

    Google Scholar 

  15. Kumar, S., Byrne, W.J.: Minimum Bayes-risk decoding for statistical machine translation. In: HLT-NAACL, pp. 169–176 (2004)

    Google Scholar 

  16. Luong, T., Sutskever, I., Le, Q.V., Vinyals, O., Zaremba, W.: Addressing the rare word problem in neural machine translation. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Beijing, pp. 11–19 (2015)

    Google Scholar 

  17. Och, F.: Minimum error rate training in statistical machine translation. In: Proceedings of the 41th Annual Meeting of the Association for Computational Linguistics, pp. 160–167 (2003)

    Google Scholar 

  18. Quirk, C., Menezes, A., Cherry, C.: Dependency treelet translation: syntactically informed phrasal SMT. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 271–279 (2005)

    Google Scholar 

  19. Schwenk, H., Costa-Jussà, M.R., Fonollosa, J.A.R.: Smooth bilingual n-gram translation. In: Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Prague, pp. 430–438 (2007)

    Google Scholar 

  20. Stolcke, A.: SRILM - an extensible language modeling toolkit. In: 7th International Conference on Spoken Language Processing, Denver (2002)

    Google Scholar 

  21. Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 3104–3112. Curran Associates, Inc. (2014)

    Google Scholar 

  22. Toutanova, K., Suzuki, H., Ruopp, A.: Applying morphology generation models to machine translation. In: Proceedings of the Joint Conference of the Association for Computational Linguistics and Human Language Technology, Columbus, pp. 514–522 (2008)

    Google Scholar 

Download references

Acknowledgements

This work has been supported by Spanish Ministerio de Economía y Competitividad, contract TEC2015-69266-P and the Seventh Framework Program of the European Commission through the International Outgoing Fellowship Marie Curie Action (IMTraP-2011-29951).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marta R. Costa-jussà .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

R. Costa-jussà, M., Fonollosa, J.A.R. (2018). Combining Phrase and Neural-Based Machine Translation: What Worked and Did Not. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2016. Lecture Notes in Computer Science(), vol 9624. Springer, Cham. https://doi.org/10.1007/978-3-319-75487-1_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-75487-1_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-75486-4

  • Online ISBN: 978-3-319-75487-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics