Abstract
Recent empirical literature has seen many multidimensional indices emerge as well-being or poverty measures, in particular indices derived from principal components and various latent variable models. Though such indices are being increasingly and widely employed, few studies motivate their use or report the standard errors or confidence intervals associated with these estimators. This paper reviews the different underlying models, reaffirms their appropriateness in this context, examines the statistical properties of resulting indices, gives analytical expressions of their variances and establishes certain exact relationships among them.
Similar content being viewed by others
References
Anderson, T. W. (1984). An introduction to multivariate statistical analysis. New York: John Wiley and Sons.
Balestrino, A., & Sciclone, N. (2000). Should we use functionings instead of income to measure well-being? Theory, and some evidence from Italy. Mimeo, University of Pisa.
Bartholomew, D. J., & Knott, M. (1999). Latent variable models and factor analysis. U.K.: Edward Arnold.
Biswas, B., & Caliendo, F. (2002). A multivariate analysis of the human development index. Indian Economic Journal, 49(4), 96–100.
Bollen, K. A. (1989). Structural equations with latent variables. New York: John Wiley & Sons.
Browne, M. W. (1984). Asymptotically distribution-free methods for the analysis of covariance structures. British Journal of Mathematical and Staistical Psychology, 37, 62–83.
Browne, M. W., & Arminger, G. (1995). Specification and estimation of mean - and covariance-structural models. In G. Arminger, C. C. Clogg, & M. E. Sobel (Eds.), Handbook of statistical modelling for the social and behavioral sciences (pp. 311–359). Newbury Park: Plenum Press.
Di Tommaso, M. L. (2006). Measuring the well-being of children using a capability approach: An application to Indian data. Working Paper CHILD No. 05/2006, University of Turin.
Gouriroux, C., Monfort, A., & Trognon, A.(1984). Pseudo-maximum likelihood methods: Theory. Econometrica, 52, 681–700.
Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24, 417–441
Jöreskog, K. (1973). A general method for estimatimg a linear structural equation system. In A. S. Goldberger & O. D. Duncan (Eds.), Structural equation models in the social sciences. New York: Seminar Press.
Jöreskog, K. (2002). Structural equation modelling with ordinal variables using LISREL. http://www.ssicentral.com/lisrel/ordinal.htm
Jöreskog, K., & Goldberger, A. (1975). Estimation of a model with multiple indicators and multiple causes of a single latent variable. Journal of the American Statistical Association, 70(351), 631–639.
Klasen, S. (2000). Measuring poverty and deprivation in South Africa. Review of Income and Wealth, 46, 33–58.
Krishnakumar, J. (2007). Going beyond functionings to capabilities: An econometric model to explain and estimate capabilities. Journal of Human Development, 7, 39–63.
Krishnakumar, J., & Ballon, P. (2007). Estimating basic capabilities: A structural equation model applied to Bolivia. Working paper under review.
Kuklys, W. (2005). Amartya Sen’s capability approach: Theoretical insights and empirical applications. Berlin: Springer.
Lelli, S. (2001). Factor analysis vs. fuzzy sets theory: Assessing the influence of different techniques on Sen’s functioning approach. Center for Economic studies, K.U. Leuven.
Maasoumi, E., & Nickelsburg, G. (1988). Multidimensional measures of well-being and an analysis of inequality in the Michigan data. Journal of Business and Economic Statistics, 6(3), 327–334.
McGillivray, M. (2005). Measuring non-economic well-being achievement. Review of Income and Wealth, 51(2), 337–364.
Morris, M. D. (1979). Measuring the condition of the world’s poor: The physical quality of life index. New York: Pergamon.
Muthen, B. (1984). A general structural equation model with dichotomous, ordered categorical and continuous latent indicators. Psychometrika, 49, 115–132.
Muthen, B. (2002). Beyond SEM: General latent variable modelling. Behaviormetrika, 29(1), 81–117.
Muthen, B. O. (1998-2004). Mplus technical appendices. Los Angeles, CA: Muthen & Muthen.
Nagar, A. L., & Basu, S. (2001). Weighting socio-economic indicators of human development (a latent variable approach). New Delhi: National Institute of Public Finance and Policy.
Noorbaksh, F. (2003). Human development and regional disparities in India. Discussion Paper, Helsinki: UN-WIDER.
Rahman, T., Mittelhammer, R. C., & Wandschneider, P. (2003). Measuring the quality of life across countries: A sensitivity analysis of well-being indices. In WIDER International Conference on Inequality, Poverty and Human Well-being, Helsinki, Finland
Ram, R. (1982). Composite indices of physical quality of life, basic needs fulfilment, and income: A principal component representation. Journal of Development Economics, 11, 227–247.
Schokkaert, E., & Lootehgem, L. (1990). Sen’s concept of the living standard applied to the Belgian unemployed. Recherches Economiques de Louvain, 56, 429–450.
Sen, A. K. (1985). Commodities and capabilities. Amsterdam: North-Holland.
Sen, A. K. (1999). Development as freedom. Oxford: Oxford University Press.
Slottje, D. J (1991). Measuring the quality of life across countries. The Review of Economics and Statistics 73(4), 684–693.
Skrondal, A., & Rabe-Hesketh, S. (2004). Generalized latent variable modeling: Multilevel, longitudinal, and structural equation models. Boca Raton, U.S.A.: Chapman & Hall/CRC.
UNDP (1990). Human Development Report (HDR). U.K.: Oxford University Press.
Wagle, U. (2005). Multidimensional poverty measurement with economic well-being, capability and social inclusion: A case from Kathmandu, Nepal. Journal of Human Development, 6(3), 301–328.
White, H. (1982). Maximum likelihood estimation of misspecified models. Econometrica, 50, 1–26.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A
1.1 Minimum Variance Unbiased Estimation of Factor Scores in the FA Model
We are interested in estimators of latent factors \(\hat{f}\) such that
and
Let us denote the estimator as \(\hat{f} = C y. \) Then \(E(\hat{f}-f) = E(C(\Uplambda f + \varepsilon)-f) = (C\Uplambda - I) E(f) = 0\) implies the following condition:
Thus we need to solve the following program:
Minimise \(V(\hat{f}-f)= (C\Uplambda - I) (C\Uplambda - I)^{\prime} + C\Uppsi C^{\prime}\) under the constraint
The Lagrangian is :
Substituting the constraint in the objective function we get
The first order conditions are given by:
Solving the above system, one obtains:
In the special case Ψ = I, C * = (Λ′Λ)−1 Λ′ and \(\tilde{f} = C^{\ast}x = \Uptheta^{-\frac{1}{2}}A^{\prime}x = \Uptheta^{-\frac{1}{2}} p .\)
Appendix B
2.1 “Unbiased" Principal Components
If we require the first m principal components to be also unbiased estimators of the latent factors that they are supposed to represent then we should find B such that
This implies
or
or
or
Thus the ‘unbiased’ principal component estimator is given by
Appendix C
3.1 Expression of MIMIC Estimator
Following Jöreskog and Goldberger (1975), the conditional expectation of f given y,x is given by:
where
Using
we obtain
which can be simplified to
Appendix D
4.1 Latent Factor Estimators and Their Variances in the Linear SEM
As explained in the text, the latent factors are estimated as the expectation of the posterior distribution of these factors given the sample i.e. given y,x. For a pure measurement model (with exogenous variables w) written as
the latent factor (Empirical Bayes) estimator is derived in Skrondal and Rabe-Hesketh (2004) as follows:
Here we take the above formula and adapt it to our case in which we have a SEM for explaining the latent factors. Our model is reproduced below for reference:
with
To make use of the above result we substitute the reduced form of our SEM given by
into the measurement equation (15) to get
Identifying (16) with (14) and η with u one can obtain the ‘estimator’ of u as
The factor estimators are then obtained by substituting \(\hat{u}\) for u in the SEM model (15):
which is the equation given in the text.
Finally, the variance of \(\hat{y}^*\) is derived by noting that
and
and using the above to calculate \(V(\hat{y}^{\ast})\) according to (17).
Alternatively, Muthen (1998-2004) gives another expression of the latent factor estimator based on maximisation of posterior likelihood. The model is written as
where
with
and
Thus the model is in fact
The factor score estimator is then:
where
and
Replacing the above partitioned matrices and vectors in (18) and performing all the calculations, one gets:
and
The last result is expected as we assume that the x’s are directly observed.
Assuming y is centered and regrouping the intercept term A −1α and the ‘exogenous’ elements term A −1 Bx into one term denoting it with the same symbol A −1 Bx (i.e. assuming x incorporates a constant), one gets
Thus we see that it is the same expression as the Empirical Bayes estimator (17) (under our above assumptions) and hence has the same variance.
Appendix E
5.1 Monotonic Transformation and Posterior Distribution
The ordinality of latent factors implies that any monotonic transformation of y * will preserve the order in \(\hat{y^{\ast}}. \) We will show this in the case of a scalar latent factor y * with a vector indicator y. The proof can be extended to the vector case without any major difficulty.
The posterior distribution of the latent factor y * given the indicator y is given by
where p(y *|y) denotes the posterior density of y * given y, p(y *) is the prior density of y *, π(y|y *) is the distribution of y given y * and f(y) denotes the density of y.
Let us now transform y *: u * = g(y *).
Then, using
and
one can write
or
The first element of the product is positive if g(y *) is monotonic increasing and one can write the second part as p(g −1(u *)|y) ≡ p(u *|y).
Hence
Therefore if
then we have
and finally
Rights and permissions
About this article
Cite this article
Krishnakumar, J., Nagar, A.L. On Exact Statistical Properties of Multidimensional Indices Based on Principal Components, Factor Analysis, MIMIC and Structural Equation Models. Soc Indic Res 86, 481–496 (2008). https://doi.org/10.1007/s11205-007-9181-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11205-007-9181-8