, Volume 79, Issue 2, pp 210-231

First online:

Analyses of Model Fit and Robustness. A New Look at the PISA Scaling Model Underlying Ranking of Countries According to Reading Literacy

  • Svend KreinerAffiliated withDepartment of Biostatistics, University of Copenhagen Email author 
  • , Karl Bang ChristensenAffiliated withDepartment of Biostatistics, University of Copenhagen

Rent the article at a discount

Rent now

* Final gross prices may vary according to local VAT.

Get Access


This paper addresses methodological issues that concern the scaling model used in the international comparison of student attainment in the Programme for International Student Attainment (PISA), specifically with reference to whether PISA’s ranking of countries is confounded by model misfit and differential item functioning (DIF). To determine this, we reanalyzed the publicly accessible data on reading skills from the 2006 PISA survey. We also examined whether the ranking of countries is robust in relation to the errors of the scaling model. This was done by studying invariance across subscales, and by comparing ranks based on the scaling model and ranks based on models where some of the flaws of PISA’s scaling model are taken into account. Our analyses provide strong evidence of misfit of the PISA scaling model and very strong evidence of DIF. These findings do not support the claims that the country rankings reported by PISA are robust.

Key words

differential item functioning ranking robustness educational testing programme for international student assessment PISA Rasch models reading literacy