Machine Translation

, Volume 27, Issue 3, pp 193-212

First online:

Investigating the contribution of linguistic information to quality estimation

  • Mariano FeliceAffiliated withComputer Laboratory, University of Cambridge Email author 
  • , Lucia SpeciaAffiliated withDepartment of Computer Science, University of Sheffield

Rent the article at a discount

Rent now

* Final gross prices may vary according to local VAT.

Get Access


This paper describes a study on the contribution of linguistically-informed features to the task of quality estimation for machine translation at sentence level. A standard regression algorithm is used to build models using a combination of linguistic and non-linguistic features extracted from the input text and its machine translation. Experiments with three English–Spanish translation datasets show that linguistic features on their own are not able to outperform shallower features based on statistics from the input text, its translation and additional corpora. However, further analysis suggests that linguistic information can be useful to produce better results if carefully combined with other features. An in-depth analysis of the results highlights a number of issues related to the use of linguistic features.


Machine translation Evaluation Quality estimation