Skip to main content

Error Analysis of English-Chinese Machine Translation

  • Conference paper
  • First Online:
Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data (NLP-NABD 2016, CCL 2016)

Abstract

In order to explore a practical way of improving machine translation (MT) quality, the error types and distribution of MT results have to be analyzed first. This paper analyzed English-Chinese MT errors from the perspective of naming-telling clause (NT clause, hereafter). Two types of text were input to get the MT output: one was to input the whole original English sentences into an MT engine; the other was to parse English sentences into English NT clauses, and then input these clauses into the MT engine in order. The errors of MT output are categorized into three classes: incorrect lexical choices, structural errors and component omissions. Structural errors are further divided into SV-structure errors and non-SV-structure errors. The analyzed data shows firstly, the major errors are structural errors, in which non-SV-structural errors account for a larger proportion; secondly, translation errors decrease significantly after English sentences are parsed into NT clauses. This result reveals that non-SV clauses are the main source of MT errors, and suggests that English long sentences should be parsed into NT clauses before they are translated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Koehn, P.: Statistic Machine Translation. Publishing House of Electronic Industry, Beijing (2012). (Trans. by Zong, C., Zhang, X.)

    MATH  Google Scholar 

  2. Zhao, H., Xie, J., Lv, Y., Yu, H., Zhang, H., Liu, Q.: Common error analysis of machine translation output. In: The Ninth China Workshop on Machine Translation, Kunming (2013)

    Google Scholar 

  3. Song, R., Ge, S.: English-Chinese translation unit and translation model for discourse-based machine translation. J. Chin. Inf. Process. 29, 125–135 (2015)

    Google Scholar 

  4. Song, R.: Stream model of generalized topic structure in Chinese text. Chin. Lang. 6, 483–494 (2013)

    Google Scholar 

Download references

Acknowledgements

This research is supported by the 2016 Key Project of the National Languages Committee (ZDI135-30), Innovative School Project in Higher Education of Guangdong, China (GWTP-LH-2015-10), the Science and Technology Project of Guangdong Province, China (2016A040403113), National Natural Science Foundation of China (61171129) and the fund of Center for Translation Studies, Guangdong University of Foreign Studies (CTS2014-13).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shili Ge .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Fang, F., Ge, S., Song, R. (2016). Error Analysis of English-Chinese Machine Translation. In: Sun, M., Huang, X., Lin, H., Liu, Z., Liu, Y. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2016 2016. Lecture Notes in Computer Science(), vol 10035. Springer, Cham. https://doi.org/10.1007/978-3-319-47674-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-47674-2_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-47673-5

  • Online ISBN: 978-3-319-47674-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics