Discriminative Latent Variable Based Classifier for Translation Error Detection

Du, Jinhua; Guo, Junbo; Zhao, Fei

doi:10.1007/978-3-642-41644-6_13

Jinhua Du⁴,
Junbo Guo⁵ &
Fei Zhao⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 400))

Included in the following conference series:

CCF International Conference on Natural Language Processing and Chinese Computing

1806 Accesses

Abstract

This paper presents a discriminative latent variable model (DPLVM) based classifier for improving the translation error detection performance for statistical machine translation (SMT). It uses latent variables to carry additional information which may not be expressed by those original labels and capture more complicated dependencies between translation errors and their corresponding features to improve the classification performance. Specifically, we firstly detail the mathematical representation of the proposed DPLVM method, and then introduce features, namely word posterior probabilities (WPP), linguistic features, syntactic features. Finally, we compare the proposed method with MaxEnt and SVM classifiers to verify its effectiveness. Experimental results show that the proposed DPLVM-based classifier reduce classification error rate (CER) by relative 1.75%, 1.69%, 2.61% compared to the MaxEnt classifier, and relative 0.17%, 0.91%, 2.12% compared to the SVM classifier over three different feature combinations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ueffing, N., Klaus, M., Hermann, N.: Confidence Measures for Statistical Machine Translation. In: Proceedings of the MT Summit IX, pp. 169–176 (2003)
Google Scholar
Blatz, J., Fitzgerald, E., Foster, G., Gandrabur, S., Goutte, C., Kuesza, A., Sanchis, A., Ueffing, N.: Confidence Estimation for Machine Translation. In: Proceedings of the 20th International Conference on Computational Linguistics, pp. 315–321 (2004)
Google Scholar
Ueffing, N., Ney, H.: Word-Level Confidence Estimation for Machine Translation. Computational Linguistics 33(1), 9–40 (2007)
Article Google Scholar
Xiong, D., Zhang, M., Li, H.: Error detection for statistical machine translation using linguistic features. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 604–611 (2010)
Google Scholar
Nguyen, B., Huang, F., AI-Onaizan, Y.: Goodness: A Method for Measuring Machine Translation Confidence. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, pp. 211–219 (2011)
Google Scholar
Specia, L., Hajlaoui, N., Hallett, C., Aziz, W.: Predicting machine translation adequacy. In: MT Summit XIII: Proceedings of the Thirteenth Machine Translation Summit, pp. 513–520 (2011)
Google Scholar
Mariano, F., Specia, L.: Linguistic features for quality estimation. In: WMT 2012: Proceedings of the 7th Workshop on Statistical Machine Translation, pp. 96–103 (2012)
Google Scholar
Hardmeier, C., Nivre, J., Tiedemann, J.: Tree kernels for machine translation quality estimation. In: Proceedings of the 7th Workshop on Statistical Machine Translation, pp. 109–113 (2012)
Google Scholar
Du, J., Wang, S.: A Systematic Comparison of SVM and Maximum Entropy Classifiers for Translation Error Detection. In: Proceedings of the International Conference on Asian Language Processing, IALP (2012)
Google Scholar
Morency, L.P., Quattoni, A., Darrell, T.: Latent-dynamic Discriminative Models for Continuous Gesture Recognition. In: Proceedings of the CVPR 2007, pp. 1–8 (2007)
Google Scholar
Sun, X., Tsujii, J.: Sequential Labeling with Latent Variables: An Exact Inference Algorithm and An Ecient Approximation. In: Proceedings of the European Chapter of the Association for Computational Linguistics (EACL 2009), pp. 772–780 (2009)
Google Scholar
Du, J., Way, A.: A discriminative latent variable-based classifier for Chinese-English SMT. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 286–294 (2010)
Google Scholar
Specia, L., Cancedda, N., Dymetman, M., Turchi, M., Cristianini, N.: Estimating the sentence-level quality of machine translation systems. In: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, pp. 28–35 (2009)
Google Scholar
Specia, L., Saunders, C., Turchi, M., Wang, Z., Shawe-Taylor, J.: Improving the confidence of machine translation quality estimates. In: Proceedings of the Twelfth Machine Translation Summit, pp. 136–143 (2009)
Google Scholar
Koehn, P., Hoang, H., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: Open Source Toolkit for Statistical Machine Translation. In: Proceedings of the Demo and Poster Sessions, ACL 2007, pp. 177–180 (2007)
Google Scholar
Snover, M., Dorr, B., Schwartz, R., Micciulla, L., Makhoul, J.: A study of translation edit rate with targeted human annotation. In: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, pp. 223–231 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Automation and Information Engineering, Xi’an University of Technology, Xi’an, 710048, China
Jinhua Du & Fei Zhao
Faculty of High Vocational Education, Xi’an University of Technology, Xi’an, 710048, China
Junbo Guo

Authors

Jinhua Du
View author publications
You can also search for this author in PubMed Google Scholar
Junbo Guo
View author publications
You can also search for this author in PubMed Google Scholar
Fei Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Soochow University, 1 Shizi Street, 215006, Suzhou, China
Guodong Zhou
Department of Computer Science and Technology, Tsinghua University, 100084, Beijing, China
Juanzi Li
Institute of Computer Science & Technology, Peking University, 100871, Beijing, China
Dongyan Zhao & Yansong Feng &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Du, J., Guo, J., Zhao, F. (2013). Discriminative Latent Variable Based Classifier for Translation Error Detection. In: Zhou, G., Li, J., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2013. Communications in Computer and Information Science, vol 400. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41644-6_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-41644-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41643-9
Online ISBN: 978-3-642-41644-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics