Abstract
In order to explore a practical way of improving machine translation (MT) quality, the error types and distribution of MT results have to be analyzed first. This paper analyzed English-Chinese MT errors from the perspective of naming-telling clause (NT clause, hereafter). Two types of text were input to get the MT output: one was to input the whole original English sentences into an MT engine; the other was to parse English sentences into English NT clauses, and then input these clauses into the MT engine in order. The errors of MT output are categorized into three classes: incorrect lexical choices, structural errors and component omissions. Structural errors are further divided into SV-structure errors and non-SV-structure errors. The analyzed data shows firstly, the major errors are structural errors, in which non-SV-structural errors account for a larger proportion; secondly, translation errors decrease significantly after English sentences are parsed into NT clauses. This result reveals that non-SV clauses are the main source of MT errors, and suggests that English long sentences should be parsed into NT clauses before they are translated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Koehn, P.: Statistic Machine Translation. Publishing House of Electronic Industry, Beijing (2012). (Trans. by Zong, C., Zhang, X.)
Zhao, H., Xie, J., Lv, Y., Yu, H., Zhang, H., Liu, Q.: Common error analysis of machine translation output. In: The Ninth China Workshop on Machine Translation, Kunming (2013)
Song, R., Ge, S.: English-Chinese translation unit and translation model for discourse-based machine translation. J. Chin. Inf. Process. 29, 125–135 (2015)
Song, R.: Stream model of generalized topic structure in Chinese text. Chin. Lang. 6, 483–494 (2013)
Acknowledgements
This research is supported by the 2016 Key Project of the National Languages Committee (ZDI135-30), Innovative School Project in Higher Education of Guangdong, China (GWTP-LH-2015-10), the Science and Technology Project of Guangdong Province, China (2016A040403113), National Natural Science Foundation of China (61171129) and the fund of Center for Translation Studies, Guangdong University of Foreign Studies (CTS2014-13).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Fang, F., Ge, S., Song, R. (2016). Error Analysis of English-Chinese Machine Translation. In: Sun, M., Huang, X., Lin, H., Liu, Z., Liu, Y. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2016 2016. Lecture Notes in Computer Science(), vol 10035. Springer, Cham. https://doi.org/10.1007/978-3-319-47674-2_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-47674-2_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47673-5
Online ISBN: 978-3-319-47674-2
eBook Packages: Computer ScienceComputer Science (R0)