Abstract
Exploratory factor analysis is a statistical method widely used in quantitative psychology for the construction of scales and measurement instruments. It aims to reduce the complexity of a data set and explain the common and unique variance using latent variables. In introductory textbooks, exploratory factor analysis is generally presented in contrast to confirmatory factor analysis as a theory- or a hypothesis-generating process that does not require prior background, theory or hypothesis to be performed. The aim of the present paper is to analyze this claim and clarify in which sense exploratory factor analysis is theory-laden. We provide a careful examination of the concepts it involves and thereby establish a clear limitation of the epistemic scope of exploratory factor analysis.
Similar content being viewed by others
Notes
See Comrey (1988) for methodological considerations regarding scale development.
Currently, CFA is used to refer to confirmatory factor analysis. It should be noted, however, that CFA has also been used previously in the psychology literature as an acronym for common factor analysis, which is, basically, factor analysis.
See Borsboom (2008) on the distinction between observed and latent variables.
EFA should not be confused with another different (although comparable) statistical technique known as principal component analysis (cf. Borgatta and Stull 1986). In comparison with EFA, principal component analysis rather models the principal components as linear combinations of the observed variables (see Jolliffe and Morgan 1992). Principal component analysis, which is a descriptive statistical tool, is also used in psychological research, mainly in psychometrics (see for example Jolliffe 2002, ten Berge et al. 1992, Kiers and Berge 1994, ten Berge and Kiers 1997 and Kiers and Mechelen 2001). The main difference between principal component analysis and EFA is that the latter accounts for potential error in measurement.
Many factor extraction techniques assume that the variables are continuous and normally distributed (Byrne 2012, p.128-9). However, these assumptions are not necessary conditions to perform an EFA. First, different estimators have been developed to deal with categorical data and, further, if the data approximates a normal distribution, then not addressing the fact that the data is categorical is likely negligible (see Byrne 2012, pp.128-32). Second, there are extraction techniques that address the violation of the assumption of normality (see Flora et al. 2012, pp.10-1). See Michell (1997, 2003, 2004) for a criticism of the violation of the assumption of continuity as well as Borsboom and Mellenbergh (2004) for an answer to Michell.
An eigenvalue represents the proportion of the variance explained by the factor (cf. Tabachnick and Fidell 2013).
When the variances of observed and latent variables are equal to one and factors are orthogonal, the loading of a variable on a factor is the correlation between the two (Tabachnick and Fidell 2013, p.614).
In contrast, principal component analysis provides a formative measurement model.
Formally, CFA generates an estimated population covariation matrix using the hypothesized model and determines whether it fits the covariation matrix of the sample.
A formal definition will be provided below.
During CFA, one can add covariation relations between items as well as covariation relations between factors. Hence, formally, measurement models for CFA are defined on the grounds of \(\mathcal {R}_{M}\subseteq \mathcal {V}\times \mathcal {V}_{j}, \text { with } \mathcal {V}_{j} \in \mathcal {V}_{\times } \). The measurement model provided by EFA, however, only yields relationships between items and factors.
As a caveat, the reader should keep in mind that we are not arguing in favor of a specific terminology that we think should be used in psychology. The aim of this paper is to examine the extent of EFA’s theory-ladenness as it is used in psychological research during scale development. Accordingly, we adopt the aforementioned definitions only to make our analysis as explicit and non-ambiguous as possible. Though some readers might disagree with the terminology we use, the emphasis should rather be on the conclusions we reach and the notions we analyze.
Given that a conceptual background is understood as a set, its size corresponds to its cardinality.
This would not be the case for principal component analysis, which is a tool that aims to describe samples (cf. Kiers and Mechelen 2001).
References
Akaike, H. 1987. Factor analysis and AIC. Psychometrika 52(3): 317–332.
American Psychiatric Association. 2013. Diagnostic and statistical manual of mental disorders: DSM-5, 5th edition. Washington: American Psychiatric Association.
Ariew, R. 1984. The Duhem thesis. The British Journal for the Philosophy of Science 35(4): 313–325.
Armstrong, J.S. 1967. Derivation of theory by means of factor analysis or Tom swift and his electric factor analysis machine. The American Statistician 21(5): 17–21.
Aron, A., E.J. Coups, and E.N. Aron. 2013. Statistics for psychology. Pearson, 6th edition.
Baird, D. 1987. Exploratory factor analysis, instruments and the logic of discovery. The British Journal for the Philosophy of Science 38(3): 319–337.
Borgatta, E.F. 1964. The structure of personality characteristics. Behavioral Science 9(1): 8–17.
Borgatta, E.F., and D.E. Stull. 1986. A cautionary note on the use of principal component analysis. Sociological Methods & Research 15(1-2): 160–168.
Borkenau, P., and F. Ostendorf. 1990. Comparing exploratory and confirmatory factor analysis: A study on the 5-factor model of personality. Personality and Individual Differences 11(5): 515– 524.
Borsboom, D. 2008. Latent variable theory. Measurement 6(1-2): 25–53.
Borsboom, D., and G.J. Mellenbergh. 2004. Why psychometrics is not pathological: A comment on Michell. Theory & Psychology 14(1): 105–120.
Borsboom, D., G.J. Mellenbergh, and J. van Heerden. 2003. The theoretical status of latent variables. Psychological Review 110(2): 203–219.
Byrne, B.M. 2012. Structural Equation Modeling with Mplus. Routledge.
Cattell, R.B. 1943. The description of personality: Basic traits resolved into clusters. The Journal of Abnormal and Social Psychology 38(4): 476–506.
Cattell, R.B. 1945. The description of personality: Principles and findings in a factor analysis. The American Journal of psychology 58(1): 69–90.
Comrey, A.L. 1988. Factor-analytic methods of scale development in personality and clinical psychology. Journal of Consulting and Clinical Psychology 56(5): 754–761.
Deci, E.L. 1985. Intrinsic Motivation and Self-determination in Human Behavior. Plenum.
Digman, J.M., and N.K. Takemoto-Chock. 1981. Factors in the natural language of personality: Re-analysis, comparison, and interpretation of six major studies. Multivariate Behavioral Research 16(2): 149–170.
Duhem, P. 1906. La théorie physique, son objet et sa structure. Chevalier & Rivière.
Edwards, J.R., and R.P. Bagozzi. 2000. On the nature and direction of relationships between constructs and measures. Psychological Methods 5(2): 155–174.
Einarsen, S. 2000. Harassment and bullying at work: A review of the Scandinavian approach. Aggression and Violent Behavior 5(4): 379–401.
Einarsen, S., H. Hoel, and G. Notelaers. 2009. Measuring exposure to bullying and harassment at work: Validity, factor structure and psychometric properties of the Negative Acts Questionnaire-Revised. Work & Stress 23(1): 24–44.
Field, A. 2009. Discovering statistics using SPSS. Sage, 3rd edition.
Finch, J.F., and S.G. West. 1997. The investigation of personality structure: Statistical models. Journal of Research in Personality 31(4): 439–485.
Fiske, D.W. 1949. Consistency of the factorial structures of personality ratings from different sources. The Journal of Abnormal and Social Psychology 44(3): 329–344.
Flora, D.B., C. LaBrish, and R.P. Chalmers. 2012. Old and new ideas for data screening and assumption testing for exploratory and confirmatory factor analysis. Frontiers in Psychology 3(55): 1–21.
Frigg, R., and S. Hartmann. 2012. Models in science. The Stanford Encyclopedia of Philosophy, ed. Zalta E. N.
Glymour, C. 2001. The mind’s arrows: Bayes nets and graphical causal models in psychology. MIT Press.
Goldberg, L.R. 1990. An alternative “description of personality”: The Big-Five factor structure. Journal of Personality and Social Psychology 59(6): 1216–1229.
Haig, B.D. 2005. Exploratory factor analysis, theory generation, and scientific method. Multivariate Behavioral Research 40(3): 303–329.
Huck, S.W. 2012. Reading statistics and research. Pearson, 6th edition.
Hurley, A.E., T.A. Scandura, C.A. Schriesheim, M.T. Brannick, A. Seers, R.J. Vandenberg, and L.J. Williams. 1997. Exploratory and confirmatory factor analysis: Guidelines, issues, and alternatives. Journal of Organizational Behavior 18 (6): 667–683.
John, O.P., and S. Srivastava. 1999. The Big Five trait taxonomy: History, measurement, and theoretical perspectives. Handbook of Personality: Theory and Research, 2nd, 102–138. Elsevier.
Jolliffe, I.T. 2002. Principal component analysis, 2nd. Berlin: Springer.
Jolliffe, I.T., and B.J.T. Morgan. 1992. Principal component analysis and exploratory factor analysis. Statistical Methods in Medical Research 1(1): 69–95.
Kiers, H.A., and I.V. Mechelen. 2001. Three-way component analysis: Principles and illustrative application. Psychological Methods 6(1): 84–110.
Kiers, H.A.L., and J.M.F. Berge. 1994. Hierarchical relations between methods for simultaneous component analysis and a technique for rotation to a simple simultaneous structure. British Journal of Mathematical and Statistical Psychology 47(1): 109–126.
Kotov, R., W. Gamez, F. Schmidt, and D. Watson. 2010. Linking “Big” personality traits to anxiety, depressive and substance use disorders: A meta-analysis. Psychological Bulletin 136(5): 768– 821.
Mayo, D.G. 1994. The new experimentalism, topical hypotheses, and learning from error. PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association, 270–279. University of Chicago Press.
Mellenbergh, G.J. 1994. Generalized linear item response theory. Psychological Bulletin 115(2): 300–307.
Michell, J. 1997. Quantitative science and the definition of measurement in psychology. British Journal of Psychology 88(3): 355–383.
Michell, J. 2003. The quantitative imperative positivism, naive realism and the place of qualitative methods in psychology. Theory & Psychology 13(1): 5–31.
Michell, J. 2004. The place of qualitative research in psychology. Qualitative Research in Psychology 1(4): 307–319.
Mulaik, S.A. 1987. A brief history of the philosophical foundations of exploratory factor analysis. Multivariate Behavioral Research 22(3): 267–305.
Mulaik, S.A. 1991. Factor analysis, information-transforming instruments, and objectivity: A reply and discussion. The British Journal for the Philosophy of Science 42(1): 87–100.
Muller, F.A. 2011. Reflections on the revolution at stanford. Synthese 183 (1): 87–114.
Norman, W.T. 1963. Toward an adequate taxonomy of personality attributes: Replicated factor structure in peer nomination personality ratings. The Journal of Abnormal and Social Psychology 66(6): 574– 583.
O’Connor, B. 2000. SPSS and SAS programs for determining the number of components using parallel analysis and Velicer’s MAP test. Behavioral Research Methods, Intruments and Computers 32(3): 396–402.
Park, H.S., R. Dailey, and D. Lemus. 2002. The use of exploratory factor analysis and principal component analysis in communication research. Human Communication Research 28(4): 562–577.
Schreiber, J.B., A. Nora, F.K. Stage, E.A. Barlow, and J. King. 2006. Reporting structural equation modeling and confirmatory factor analysis results: A review. The Journal of Educational Research 99(6): 323–338.
Spirtes, P., C. Glymour, and R. Scheines. 1991. From probability to causality. Philosophical Studies 64(1): 1–36.
Spirtes, P., C. Glymour, and R. Scheines. 2000. Causation, Prediction, and Search. MIT Press.
Stevens, S.S. 1946. On the theory of scales of measurement. Science 103 (2684): 677–680.
Suppe, F. 2000. Understanding scientific theories: An assessment of developments, 1969-1998. Philosophy of Science 67: S102–S115.
Suppes, P. 1960. A comparison of the meaning and uses of models in mathematics and the empirical sciences. Synthese 12(2): 287–301.
Suppes, P. 1962. Models of data. Logic, Methodology and Philosophy of Science: Proceedings of the 1960 International Congress, pages 252–261. Stanford University Press, eds. Nagel E., Suppes P., and Tarski A.
Suppes, P. 1967. What is a scientific theory?. Philosophy of Science Today, pages 55–67. Basic Books Inc, ed. Morgenbesser S.
Suppes, P. 2007. Statistical concepts in philosophy of science. Synthese 154 (3): 485–496.
Tabachnick, B.G., and L.S. Fidell. 2013. Using Multivariate Statistics. Pearson, 6th edition.
ten Berge, J.M.F., and H.A.L. Kiers. 1997. Are all varieties of pca the same? a reply to cadima & jolliffe. British Journal of Mathematical and Statistical Psychology 50(2): 367–368.
ten Berge, J.M.F., H.A.L. Kiers, and V. Van der Stel. 1992. Simultaneous components analysis. Statistica Applicata 4(4): 277–392.
Thompson, B. 1994. The pivotal role of replication in psychological research: Empirically evaluating the replicability of sample results. Journal of Personality 62 (2): 157–176.
Thompson, B. 2004. Exploratory and confirmatory factor analysis: Understanding concepts and applications. American Psychological Association.
Thurstone, L.L. 1947. Multiple factor analysis. University of Chicago Press.
Thurstone, L.L. 1954. An analytical method for simple structure. Psychometrika 19(3): 173–182.
Trépanier, S.-G., C. Fernet, and S. Austin. 2013. Workplace bullying and psychological health at work: The mediating role of the satisfaction of needs for autonomy, competence and relatedness. Work & Stress 27(2): 123–140.
Tupes, E.C., and R.E. Christal. 1992. Recurrent personality factors based on trait ratings. Journal of Personality 60(2): 225–251.
Ullman, J.B. 2013. Structural equation modeling. Using Multivariate Statistics. Pearson, 6th edition, eds. Tabachnick B. G. and Fidell L. S.
Van den Broeck, A., M. Vansteenkiste, H. De Witte, B. Soenens, and W. Lens. 2010. Capturing autonomy, competence and relatedness at work: Construction and initial validation of the work-related basic need satisfaction scale. Journal of Occupational and Organizational Psychology 83(4): 981–1002.
van Fraassen, B.C. 1980. The Scientific Image. Clarendon Press.
Velicer, W.F., and D.N. Jackson. 1990a. Component analysis versus common factor analysis: Some further observations. Multivariate Behavioral Research 25 (1): 97–114.
Velicer, W.F., and D.N. Jackson. 1990b. Component analysis versus common factor analysis: Some issues in selecting an appropriate procedure. Multivariate Behavioral Research 25(1): 1–28.
Winther, R.G. 2015. The structure of scientific theories. The Stanford Encyclopedia of Philosophy, ed. Zalta E. N.
Acknowledgments
I am indebted to the comments, suggestions, and criticisms made by anonymous referees on previous drafts of this paper, as well as those made by the editor. Thanks also to Stephan Hartmann and Gregory Gandenberger for valuable comments and suggestions. This research was financially supported by the Social Sciences and Humanities Research Council of Canada.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Peterson, C. Exploratory Factor Analysis and Theory Generation in Psychology. Rev.Phil.Psych. 8, 519–540 (2017). https://doi.org/10.1007/s13164-016-0325-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13164-016-0325-0