Abstract
In this paper, to improve the translation quality of a sentence, Convolutional sequence to sequence architecture has been applied to English-Punjabi, Punjabi-English, Hindi-Punjabi, Punjabi-Hindi language pairs. The Convolution architecture consists of Gated Linear Unit (GLU) and a Multi-Hop attention mechanism. The GLU mechanism controls the information flow through hidden units to produce good translations. The Multi-Hop attention is an enhanced version of the attention mechanism that allows the model to make repeated gaze on the sentence than looking at the sentence only once. As compared to the Recurrent Neural Network (RNN), the Convolution based technique provides better accuracy, captures long-range dependencies between the words, capture local context, and allows parallelization over every element in a sequence without depending on the computations of the previous time steps. In this work, we have applied the architecture on four Indian language pairs and compared the performance of our model with conventional models using BLEU, METEOR and WER as evaluation metrics. Our model outperforms recurrent neural networks by 2 BLEU points on English-Punjabi, 2.5 BLEU points on Punjabi-English, 3.5 BLEU points on Hindi-Punjabi translations, and 1.5 BLEU points on Punjabi-Hindi.
Similar content being viewed by others
References
Agrawal R (2017) Towards efficient Neural Machine Translation for Indian Languages. Diss. International Institute of Information Technology, Hyderabad
Chen SF, Beeferman D, Rosenfeld R (1998) Evaluation metrics for language models. DARPA Broadcast News Transcription and Understanding Workshop
Dauphin YN, Fan A, Auli M, Grangier D (2017) Language modeling with gated convolutional networks. In International Conference on Machine Learning pp. 933–941
Forcada ML et al (2011) Apertium: a free/open-source platform for rule-based machine translation. Mach Transl 25.2:127–144
Gehring J et al (2017) Convolutional sequence to sequence learning. Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org
Goyal V, Lehal GS (2011) Hindi to Punjabi machine translation system. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Systems Demonstrations. Association for Computational Linguistics
Goyal V, Lehal GS (2019) Hindi-Punjabi Machine Transliteration System (For Machine Translation System). George Ronchi Foundation Journal, Italy 64.1:2009
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition pp. 770–778
Jindal S, Goyal V, Bhullar JS (2018) English to Punjabi statistical machine translation using moses (Corpus Based). J Stat Manag Syst 21.4:553–560
Josan GS, Lehal GS (2008) A Punjabi to Hindi machine translation system. 22nd International Conference on on Computational Linguistics: Demonstration Papers. Association for Computational Linguistics
Josan GS, Lehal GS (2012) Direct Approach for Machine Translation from Punjabi to Hindi. CSI J Comput 1.1:3–19
Josan GS, Kaur J (2011) Punjabi to Hindi statistical machine transliteration. Int J Inf Technol Knowl Manag 4.2:459–463
Kaur K (2011) A Punjabi to English machine translation system for legal documents
Kaur J, Josan G (2011) Statistical approach to transliteration from english to punjabi. Int J Comput Sci Eng 3.4:1518–1527
Kaur D, Kaur ER (2015) English To Punjabi Script Converter System For Proper Nouns Using Hybrid Approach. International Journal of Scientific Research and Management 3.3
Kumar A, Goyal V (2018) Hindi to Punjabi machine translation system based on statistical approach. J Stat Manag Syst 21(4):547–552
Kaur A, Rani J (2015) A web based Punjabi to Hindi Statistical Machine Translation System. 2015 2nd International Conference on Recent Advances in Engineering & Computational Sciences (RAECS). IEEE
Koehn P (2009) Statistical machine translation. Cambridge University Press
Koehn P, Hoang H, Birch A, Callison-Burch C, Federico M, Bertoldi N, Herbst E (2007) Moses: Open source toolkit for statistical machine translation. In Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions 177–180
Kumar P, Kumar V (2013) Statistical machine translation based punjabi to english transliteration system for proper nouns. Int J Appl Innov Eng Manag 2(8):318–321
Lavie A, Agarwal A (2007) METEOR: An automatic metric for MT evaluation with high levels of correlation with human judgements. Proceddings of the Second Workshop on Statistical Machine Translation. Association for Computational Linguistics 228–231
Luong MT, Pham H, Manning CD (2015) Effective approaches to attention-based neural machine translation. arXiv preprintarXiv:1508.04025
Mahata SKU, Das D, Bandyopadhyay S (2019) MTIL2017: Machine translation using recurrent neural network on statistical machine translation. J Intell Syst 28.3:447–453
Nair LR, David Peter S (2012) Machine translation systems for Indian languages. Int J Comp Appl 39.1:0975–8887
Papineni K, Roukos S, Ward T, Zhu W (2002) BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pp.311–318
Premjith B, Kumar MA, Soman KP (2019) Neural Machine Translation System for English to Indian Language Translation Using MTIL Parallel Corpus: Special Issue on Natural Language Processing. J Intell Syst 28(3):387–398
Revanuru K, Turlapaty K, Rao S (2017) Neural Machine Translation of Indian Languages. Proceedings of the 10th Annual ACM India Compute Conference on ZZZ. ACM
Shirai S, Bond F, Takahashi Y (1997) A hybrid rule and example-based method for machine translation. Recent Advances in Example-Based Machine Translation 211–224
Sidhu BK, Singh A, Goyal V (2010) Identification of proverbs in Hindi text corpus and their translation into Punjabi. Journal of Computer Science and Engineering 2(1):32–37
Singh S, Anand Kumar M, Soman KP (2018) Attention based English to Punjabi neural machine translation. J Intell Fuzzy Syst 34.3:1551–1559
Somers H (1999) Example-based machine translation. Mach Transl 14(2):113–157
Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. Advances in neural information processing systems
van der Wel E, Ullrich K (2017) Optical music recognition with convolutional sequence-to-sequence models. arXiv preprintarXiv:1707.04877
Zhang Y et al (2019) Abstract Text Summarization with a Convolutional Seq2seq Model. Appl Sci 9.8:1665
Acknowledgements
This work was supported by Council of Scientific and Indian Research (CSIR) with file no. 09/263(1152)/2018-EMR-I. We are also thankful to Department of Science and Technology (DST), Govt. of India for support through PURSE Grant.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Bansal, M., Lobiyal, D.K. Multilingual sequence to sequence convolutional machine translation. Multimed Tools Appl 80, 33701–33726 (2021). https://doi.org/10.1007/s11042-021-11345-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11345-6