Machine Translation

, Volume 27, Issue 3, pp 193–212

Investigating the contribution of linguistic information to quality estimation


DOI: 10.1007/s10590-013-9137-5

Cite this article as:
Felice, M. & Specia, L. Machine Translation (2013) 27: 193. doi:10.1007/s10590-013-9137-5


This paper describes a study on the contribution of linguistically-informed features to the task of quality estimation for machine translation at sentence level. A standard regression algorithm is used to build models using a combination of linguistic and non-linguistic features extracted from the input text and its machine translation. Experiments with three English–Spanish translation datasets show that linguistic features on their own are not able to outperform shallower features based on statistics from the input text, its translation and additional corpora. However, further analysis suggests that linguistic information can be useful to produce better results if carefully combined with other features. An in-depth analysis of the results highlights a number of issues related to the use of linguistic features.


Machine translationEvaluationQuality estimation

Copyright information

© Springer Science+Business Media Dordrecht 2013

Authors and Affiliations

  1. 1.Computer LaboratoryUniversity of CambridgeCambridgeUK
  2. 2.Department of Computer ScienceUniversity of SheffieldSheffieldUK