Skip to main content

Statistical Feature Aided Intelligent Deep Learning Machine Translation in Internet of Things

Abstract

Internet of Things (IoT) networks have been widely deployed to achieve communication among machines and humans. Machine translation can enable human-machine interactions for IoT equipment. In this paper, we propose to combine the neural machine translation (NMT) and statistical machine translation (SMT) to improve translation precision. In our design, we propose a hybrid deep learning (DL) network that uses the statistical feature extracted from the words as the data set. Namely, we use the SMT model to score the generated words in each decoding step of the NMT model, instead of directly processing their outputs. These scores will be converted to the generation probability corresponding to words by classifiers and used for generating the output of the hybrid MT system. For the NMT, the DL network consists of the input layer, embedding layer, recurrent layer, hidden layer, and output layer. At the offline training stage, the NMT network is jointly trained with SMT models. Then at the online deployment stage, we load the fine-trained models and parameters to generate the outputs. Experimental results on French-to-English translation tasks show that the proposed scheme can take advantage of both NMT and SMT methods, thus higher translation precision could be achieved.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

References

  1. Arthur P, Neubig G, Nakamura S (2016) Incorporating discrete translation lexicons into neural machine translation. In: Proc. Conf. Empirical methods natural lang. process

  2. Dahl G, Yu D, Deng L, Acero A (2012) Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. Audio, Speech, Language Process, IEEE Trans 20(1):30–42

    Article  Google Scholar 

  3. Heafield K (2011) Kenlm: Faster and smaller language model queries. In: Proceedings of the sixth workshop on statistical machine translation. WMT ’11, Association for Computational Linguistics, USA, pp 187–197

  4. Jean S, Cho K, Memisevic R, Bengio Y (2015) On using very large target vocabulary for neural machine translation. In: Proc. 53rd annu, meeting assoc. comput. linguistics 7th int. joint conf. natural lang. process, vol 1. Association for Computational Linguistics, Beijing, China, pp 1–10

  5. Krizhevsky A, Sutskever I, Hinton G (2012) Imagenet classification with deep convolutional neural networks. In: Proc. adv. neural inf. process. syst, pp 1097–1105

  6. Och F, Nev H (2002) Discriminative training and maximum entropy models for statistical machine translation. In: Association for Computational Linguistics, Philadelphia, Pennsylvania, USA, pp 295–302

  7. Palangi H, Deng L, Shen Y, Gao J, He X, Chen J, Song X, Ward R (Apr 2016) Deep sentence embedding using long short-term memory networks: Analysis and application to information retrieval. IEEE/ACM Trans Audio, Speech and Lang Proc 24(4):694–707

    Article  Google Scholar 

  8. Tu Z., Lu Z., Liu Y., Liu X., Li H. (2016) Modeling coverage for neural machine translation. In: Proc. 40th annu. meeting assoc. comput. linguistics. pp 76–85

  9. Wang X., Tu Z., Zhang M. (2018) Incorporating statistical machine translation word knowledge into neural machine translation. IEEE/ACM Trans Audio Speech, Language Process 26(12):2255–2266

    Article  Google Scholar 

  10. Wang X, Lu Z, Tu Z, Li H, Xiong D, Zhang Mx (2016) Neural machine translation advised by statistical machine translation. In: Proc. AAAI conf. artif intell

  11. Zhu X, Yang M, Zhao T, Zhu C (2018) Minimum bayes-risk phrase table pruning for pivot-based machine translation in internet of things. IEEE Access 6:55754–55764

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported in part by the State Key Program of National Social Science of China (No. 18AZD035), the Key Research & Development and Transformation Plan of Science and Technology Program for Tibet Autonomous Region (No. XZ201901-GB-16), the Special Fund from the Central Finance to Support the Development of Local Universities (No.ZFYJY201902001) and the National Natural Science Foundation of China (No.71964030).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lin Zhang.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Zhang, Y., Zhang, L., Lan, P. et al. Statistical Feature Aided Intelligent Deep Learning Machine Translation in Internet of Things. Mobile Netw Appl (2022). https://doi.org/10.1007/s11036-022-01936-4

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11036-022-01936-4

Keywords

  • Neural machine translation
  • Statistical machine translation
  • Neural network
  • Statistical feature extraction