Multivariate Reliability Theory

Wittmann, Werner W.

doi:10.1007/978-1-4613-0893-5_16

Multivariate Reliability Theory

Principles of Symmetry and Successful Validation Strategies

Werner W. Wittmann⁵

Chapter

1098 Accesses
55 Citations

Part of the book series: Perspectives on Individual Differences ((PIDF))

Abstract

Reliability is one of the most basic concepts of science and a prerequisite for scientific work. It is also the psychometrician’s favorite concept. Without reliable measurements, not even scientific empirical psychology could exist, let alone multivariate psychology or multivariate experimental psychology.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Softcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ajzen, I., and Fishbein, M. Understanding attitudes and predicting social behavior. Englewood Cliffs, N.J.: Prentice—Hall, 1980.
Google Scholar
Alker, H. R. A typology of ecological fallacies. In M. Dogan and S. Rokkan (Eds.), Quantitative ecological analysis in social sciences. Cambridge, Mass.: MIT Press, 1969.
Google Scholar
Baltes, P. B., Reese, H. W., and Nesselroade, J. R. Life-span developmental psychology: Introduction to research methods. Monterey: Brooks/Cole, 1977.
Google Scholar
Baltes, P. B., Cornelius, S. W., and Nesselroade, J. R. Cohort effects in developmental psychology. In J. R. Nesselroade and P. B. Baltes (Eds.), Longitudinal research in the study of behavior and development. New York: Academic Press, 1979.
Google Scholar
Bentler, P. M. A lower-bound method for the dimension-free measurement of internal consistency. Social Science Research, 1972, 1, 343–357.
Google Scholar
Bentler, P. M., and Woodward, J. A. Inequalities among lower bounds to reliability: With applications to test construction and factor analysis. Psychometrika, 1980, 45, 249–267.
Google Scholar
Bentler, P. M., and Woodard, J. A. The greatest lower bound to reliability. In H. Wainer and S. Messick (Eds.), Principles of modern psychological measurement. A Festschrift for Frederic M. Lord. Hillsdale, N.J.: Erlbaum, 1983.
Google Scholar
Bereiter, C. Some persisting dilemmas in the measurement of change. In C. W. Harris (Ed.), Problems in measuring change. Madison: University of Wisconsin Press, 1963.
Google Scholar
Bock, R. D. Contributions of multivariate experimental designs to educational research. In R. B. Cattell (Ed.), Handbook of multivariate experimental psychology. Chicago: Rand McNally, 1966.
Google Scholar
Boruch, R. F., and Gomez, H. Sensitivity, bias, and theory in impact evaluations. Professional Psychology, 1977, 8, 411–434.
Google Scholar
Brunswik, E. Perception and the representative design of psychological experiments. Berkeley: University of California Press, 1956.
Google Scholar
Burstein, L. The analysis of multilevel data in educational research and evaluation. In D. C. Berliner (Ed.), Review of research in education. Vol. 8. Washington, D.C.: American Educational Research Association, 1980.
Google Scholar
Burt, C. The appropriate use of factor analysis and analysis of variance. In R. B. Cattell (Ed.), Handbook of multivariate experimental psychology. Chicago: Rand McNally, 1966.
Google Scholar
Buss, A. R. Toward a unified framework for psychometric concepts in the multivariate developmental situation: Intraindividual change and inter-and intraindividual differences. In J. R. Nesselroade and P. B. Baltes (Eds.), Longitudinal research in the study of behavior and development. New York: Academic Press, 1979.
Google Scholar
Campbell, D. T., and Fiske, D. W. Convergent and discriminant validation by the multitrait—multimethod matrix. Psychological Bulletin, 1959, 56, 81–105.
PubMed Google Scholar
Campbell, D. T., and Stanley, J. C. Experimental and quasi-experimental designs for research. Chicago: Rand McNally, 1966.
Google Scholar
Carnap, R. The methodological character of theoretical concepts. In H. Feigl and M. Scriven (Eds.), Minnesota studies in the philosophy of science. Vol. 1. Minneapolis: University of Minnesota Press, 1956.
Google Scholar
Cattell, R. B. (Ed.) Handbook of multivariate experimental psychology. Chicago: Rand McNally, 1966.
Google Scholar
Cattell, R. B. Personality and mood by questionnaire. San Francisco: Jossey—Bass, 1973.
Google Scholar
Cattell, R. B. The scientific use of factor analysis in behavioral and life sciences. New York: Plenum Press, 1978.
Google Scholar
Cattell, R. B. Structural personality—learning theory: A wholistic multivariate research approach. New York: Praeger, 1983.
Google Scholar
Cohen, J. Multiple regression as a general data-analytic system. Psychological Bulletin, 1968, 70, 426–443.
Google Scholar
Cohen, J. Set correlation as a general multivariate data analytic method. Multivariate Behavioral Research, 1982, 17, 301–341.
Google Scholar
Cohen, J., and Cohen, P. Applied multiple regression/correlation analysis for the behavioral sciences. Hillsdale, N.J.: Erlbaum, 1975 (1st ed.), 1983 (2nd ed.).
Google Scholar
Conger, A. J. Estimating profile reliability and maximally reliable composites. Multivariate Behavioral Research, 1974, 9, 85–104.
Google Scholar
Conger, A. J. Multivariate reliability: Implications for evaluating profiles. In N. Hirschberg and L. G. Humphreys (Eds.), Multivariate applications in the social sciences. Hillsdale, N.J.: Erlbaum, 1982.
Google Scholar
Conger, A. J., and Lipschitz, R. Measures of reliability for profiles and test batteries. Psychometrika, 1973, 38, 411–427.
Google Scholar
Conger, A. J., and Stallard, E. Equivalence among canonical factor analysis, canonical reliability analysis, and principal components analysis: Implications for data reduction of fallible measures. Educational and Psychological Measurement, 1976, 36, 619–626.
Google Scholar
Conger, A. J., Conger, J. C., Farell, A. D., and Ward, D. What can the WISC-R measure? Applied Psychological Measurement, 1979, 3, 421–436.
Google Scholar
Cook, T. D., and Campbell, D. T. The design and conduct of quasi-experiments and true experiments in field settings. In M. Dunnette (Ed.), Handbook of industrial and organizational psychology. Chicago: Rand McNally, 1976.
Google Scholar
Cook, T. D., and Campbell, D. T. Quasi-experimentation: Design and analysis issues for field settings. Chicago: Rand McNally, 1979.
Google Scholar
Cooley, W. W., and Leinhardt, G. The instructional dimension study. In H. E. Freeman and M. A. Solomon (Eds.), Evaluation studies review annual. Vol. 6. Beverly Hills: Sage, 1981.
Google Scholar
Cooley, W. W., and Lohnes, P. R. Evaluation research in education. New York: Irvington, 1976.
Google Scholar
Cramer, E. M., and Nicewander, W. A. Some symmetric invariant measures of multivariate association. Psychometrika, 1979, 44, 43–54.
Google Scholar
Cronbach, L. J. The two disciplines of scientific psychology. American Psychologist, 1957, 12, 671–684.
Google Scholar
Cronbach, L. J. Beyond the two disciplines of scientific psychology. American Psychologist, 1975, 30, 116–127.
Google Scholar
Cronbach, L. J., and Furby, L. How we should measure “change” or should we? Psychological Bulletin, 1970, 74, 68–80.
Google Scholar
Cronbach, L. J., Gleser, G. C., Nanda, H., and Rajaratnam, N. The dependability of behavioral measurements: Theory of generalizability for scores and profiles. New York: Wiley, 1972.
Google Scholar
Drenth, P. J. D. Der psychologische Test: Eine Einführung in seine Theorie und seine Anwendungen. Leipzig: Barth, 1969.
Google Scholar
Epstein, S. Traits are alive and well. In D. Magnusson and N. S. Endler (Eds.), Personality at the crossroads: Current issues in interactional psychology. Hillsdale, N.J.: Erlbaum, 1977.
Google Scholar
Epstein, S. The stability of behavior. I. On predicting most of the people much of the time. Journal of Personality and Social Psychology, 1979, 37, 1097–1126.
Google Scholar
Epstein, S. The stability of behavior. II. Implications for psychological research. American Psychologist, 1980, 35, 790–806.
Google Scholar
Epstein, S. Aggregation and beyond: Some basic issues on the prediction of behavior. Journal of Personality, 1983a, 51, 360–392.
Google Scholar
Epstein, S. The stability of confusion: A reply to Mischel and Peake. Psychological Review, 1983b, 90, 179–184.
Google Scholar
Epstein, S. The stability of behavior across time and situations. In R. A. Zucker, J. Aronoff, and A. I. Rabin (Eds.), Personality and the prediction of behavior. Orlando: Academic Press, 1984.
Google Scholar
Eysenck, H. J. The biological basis of personality. Springfield, Ill.: Thomas, 1967.
Google Scholar
Fahrenberg, J., Myrtek, M., Kulick, B., and Frommelt, P. Eine psychologische Zeitreihenstudie an 20 Studenten über 8 Wochen. Archiv für Psychologie, 1977, 129, 242–264.
PubMed Google Scholar
Fahrenberg, J., Selg, H., and Hampel, R. Das Freiburger Persönlichkeitsinventar FPI ( 3rd ed. ). Göttingen: Hogrefe, 1978.
Google Scholar
Fishbein, M., and Ajzen, I. Belief, attitudes, intention, and behavior: An introduction to theory and research. Reading, Mass.: Addison-Wesley, 1975.
Google Scholar
Fleiss, J. L. Comment on Overall and Woodward’s asserted paradox concerning the measurement of change. Psychological Bulletin, 1976, 83, 774–775.
Google Scholar
Gulliksen, H. Theory of mental tests. New York: Wiley, 1950.
Google Scholar
Guttman, L. What lies ahead for factor analysis? Educational and Psychological Measurement, 1958, 18, 497–515.
Google Scholar
Guttman, L. Measurement as structural theory. Psychometrika, 1971, 36, 329–347.
Google Scholar
Hammond, K. R. (Ed.) The psychology of Egon Brunswik. New York: Holt, Rinehart & Winston, 1966.
Google Scholar
Hammond, K. R. Toward increasing competence of thought in public policy formation. In K. R. Hammond (Ed.), Judgment and decision in public policy formation. Boulder, Colo.: Westview Press, 1978.
Google Scholar
Hammond, K. R., Hamm, R. M., and Grassia, J. Achieving generality over conditions: Combining the multitrait multimethod matrix and the representative designs of experiments. Boulder, Colo.: Center for Research on Jugment and Policy. University of Colorado Report No. 256, 1984.
Google Scholar
Harris, C. W. (Ed.) Problems in measuring change. Madison: University of Wisconsin Press, 1963.
Google Scholar
Hays, W. L. Statistics for the social sciences. New York: Holt, Rinehart & Winston, 1973.
Google Scholar
Hoyt, C. Test reliability estimated by analysis of variance. Psychometrika, 1941, 6, 153–160.
Google Scholar
Jackson, P. H., and Agunwamba, C. C. Lower bounds for the reliability of the total score on a test composed of non-homogeneous items. I. Algebraic lower bounds. Psychometrika, 1977, 42, 567–578.
Google Scholar
Jäger, A. O. Dimensionen der Intelligenz ( 3rd ed. ). Göttingen: Hogrefe, 1973.
Google Scholar
Jäger, A. O. Mehrmodale Klassifikation von Intelligenzleistungen: Experimentell kontrollierte Weiterentwicklung eines deskriptiven Intelligenzstrukturmodells. Diagnostica, 1982, 28, 195–225.
Google Scholar
Joe, G. W., and Woodward, J. A. Some developments in multivariate generalizability. Psychometrika, 1976, 41, 205–217.
Google Scholar
Jöreskog, K. G. Statistical estimation of structural models in longitudinal–developmental investigations. In J. R. Nesselroade and P. B. Baltes (Eds.), Longitudinal research in the study of behavior and development. New York: Academic Press, 1979.
Google Scholar
Kaiser, H. F., and Michael, W. B. Domain validity and generalizability. Educational and Psychological Measurement, 1975, 35, 31–35.
Google Scholar
Kerlinger, F. N. Foundations of behavioral research ( 2nd ed. ). New York: Holt, Rinehart & Winston, 1973.
Google Scholar
Kerlinger, F. N., and Pedhazur, E. J. Multiple regression in behavioral research. New York: Holt, Rinehart & Winston, 1973.
Google Scholar
Labouvie, E. W. Identity versus equivalence of psychological measures and constructs. In W. Poon (Ed.), Aging in the 1980s: Psychological Issues. Washington, D.C.: APA, 1980.
Google Scholar
Labouvie, E. W. The study of multivariate change structures: A conceptual perspective. Multivariate Behavioral Research, 1981, 16, 23–35.
Google Scholar
Leinhardt, G., and Seewald, A. M. Overlap: What’s tested, what’s taught? Journal of Educational Measurement, 1981, 18, 85–96.
Google Scholar
Liebert, R. M., and Spiegler, M. D. Personality: Strategies for the study of man. Homewood, Ill.: Dorsey, 1974.
Google Scholar
Lord, F. M. A paradox in the interpretation of group comparisons. Psychological Bulletin, 1967, 68, 304–305.
PubMed Google Scholar
Lord, F. M., and Novick, M. R. Statistical theories of mental test scores. Reading, Mass.: Addison–Wesley, 1968.
Google Scholar
Lumsden, J. Test theory. Annual Review of Psychology, 1976, 27, 251–280.
Google Scholar
Mason, W. M., Wong, G. Y., and Entwistle, B. Contextual analysis through the multilevel linear model. In S. Leinhardt (Ed.), Sociological methodology 1983–1984. San Francisco: Josey–Bass, 1983.
Google Scholar
Meehl, P. E. Theoretical risks and tabular asterisks: Sir Karl, Sir Ronald, and the slow progress of soft psychology. Journal of Consulting and Clinical Psychology, 1978, 46, 806–834.
Google Scholar
Michhel, W. Personality and assessment. New York: Wiley, 1968.
Google Scholar
Michhel, W., and Peake, P. K. Beyond déjà vu in the search for crossituational consistency. Psychological Review, 1982, 89, 730–755.
Google Scholar
Mulaik, S. A. The foundations of factor analysis. New York: McGraw–Hill, 1972.
Google Scholar
Murray, H. A. Explorations in personality. London: Oxford University Press, 1938.
Google Scholar
Nesselroade, J. R., and Baltes, P. B. (Eds.) Longitudinal research in the study of behavior and development. New York: Academic Press, 1979.
Google Scholar
Nesselroade, J. R., and Bartsch, T. W. Multivariate perspectives on the construct validity of the trait–state distinction. In R. B. Cattell and R. M. Dreger (Eds.), Handbook of modern personality theory. Washington, D.C.: Hemisphere, 1977.
Google Scholar
Nesselroade, J. R., and Cable, D. G. “Sometimes it’s okay to factor difference scores”—The separation of state and trait anxiety. Multivariate Behavioral Research, 1974, 9, 273–282.
Google Scholar
Nesselroade, J. R., Jacobs, A., and Pruchno, R. Reliability versus stability in the measurement of psychological states: An illustration with anxiety measures. Unpublished manuscript, 1981.
Google Scholar
Nicewander, W. A. A relationship between Harris factors and Guttman’s sixth lower bound to reliability. Psychometrika, 1975, 40, 197–203.
Google Scholar
Nicewander, W. A., and Price, J. M. Dependent variable reliability and the power of statistical tests. Psychological Bulletin, 1978, 85, 405–409.
Google Scholar
Nicewander, W. A., and Price, J. M. Reliability of measurement and the power of statistical tests: Some new results. Psychological Bulletin, 1983, 94, 524–533.
Google Scholar
Nunnally, J. C. Psychometric theory ( 2nd ed. ). New York: McGraw–Hill, 1978.
Google Scholar
Overall, J. E., and Woodward, J. A. Unreliability of difference scores: A paradox for the measurement of change. Psychological Bulletin, 1975, 82, 85–86.
Google Scholar
Robinson, W. S. Ecological correlations and the behavior of individuals. American Sociological Review, 1950, 15, 351–357.
Google Scholar
Rogosa, D., Brandt, D., and Zimowski, M. A growth curve approach to the measurement of change. Psychological Bulletin, 1982, 92, 726–748.
Google Scholar
Rushton, J. P., Brainerd, C. J., and Pressley, M. Behavioral development and construct validity: The principle of aggregation. Psychological Bulletin, 1983, 94, 18–38.
Google Scholar
Sechrest, L., and Yeaton, W. H. Assessing the effectiveness of social programs: Methodological and conceptual issues. New Directions in Program Evaluation, 1981, 9, 41–56.
Google Scholar
Sechrest, L., and Yeaton, W. H. Magnitudes of experimental effects in social science research. Evaluation Review, 1982, 6, 579–600.
Google Scholar
Sechrest, L., West, S. G., Phillips, M. A., Redner, R., and Yeaton, W. Some neglected problems in evaluation research: Strength and integrity of treatments. In L. Sechrest, S. G. West, M. A. Phillips, R. Redner, and W. Yeaton (Eds.), Evaluation studies review annual. Vol. 4. Beverly Hills: Sage, 1979.
Google Scholar
Shavelson, R. J., and Webb, N. M. Generalizability theory: 1973–1980. British Journal of Mathematical and Statistical Psychology, 1981, 34, 133–166.
Google Scholar
Spearman, C. The proof and measurement of association between two things. American Journal of Psychology, 1904, 15, 72–101.
Google Scholar
Stanley, J. C. Reliability. In R. L. Thorndike (Ed.), Educational measurement (2nd ed.). Washington, D.C.: American Council on Education, 1971.
Google Scholar
Stewart, D., and Love, W. A general canonical correlation index. Psychological Bulletin, 1968, 70, 160–163.
PubMed Google Scholar
Sutcliffe, J. P. On the relationship of reliability to statistical power. Psychological Bulletin, 1980, 88, 509–515.
Google Scholar
Thomdike, R. L. On the fallacy of imputing the correlations found for groups to the individuals or smaller groups composing them. American Journal of Psychology, 1939, 52, 122–124.
Google Scholar
Thorndike, R. L. Personnel selection: Test and measurement techniques. New York: Wiley, 1949.
Google Scholar
Thurstone, L. L. Multiple factor analysis. Chicago: University of Chicago Press, 1947.
Google Scholar
Tucker, L. R. A suggested alternative formulation in the developments by Hursch, Hammond and Hursch, and by Hammond, Hursch, and Todd. Psychological Review, 1964, 71, 528–530.
PubMed Google Scholar
Tukey, J. W. Analyzing data: Sanctification or detective work. American Psychologist, 1969, 24, 83–91.
Google Scholar
von Linné, C. Systemae naturae per regna tria naturae (10th ed.). Stockholm, 1758.
Google Scholar
Weiss, D. J., and Davison, M. L. Test theory and methods. Annual Review of Psychology, 1981, 32, 629–658.
Google Scholar
Williams, R. H., and Zimmerman, D. W. The reliability of difference scores when errors are correlated. Educational and Psychological Measurement, 1977, 37, 679–689.
Google Scholar
Wittgenstein, L. Tractratus logico-philosophicus. New York: Harcourt, Brace, 1922.
Google Scholar
Wittgenstein, L. Philosophical investigations. New York: Macmillan Co., 1953.
Google Scholar
Wittmann, W. W. Faktorenanalytische Modelle, Methodenstudien und Probleme der Reproduzierbarkeit. Freiburg im Breisgau: Phil. dissertation, 1977.
Google Scholar
Wittmann, W. W. Drei Klassen verschiedener faktorenanalytischer Modelle und deren Zusammenhang mit dem Konzept der Alpha-Generalisierbarkeit der klassischen Testtheorie. Psychologische Beiträge, 1978, 20, 456–470.
Google Scholar
Wittmann, W. W. Der Mangel an multivariaten Betrachtungsweisen in der psychologischen Forschungspraxis mit besonderer Berücksichtigung der Interaktionismus-Debatte. In E. D. Lantermann (Ed.), Wechselwirkungen: Psychologische Analysen der Mensch-Umwelt-Beziehung. Göttingen: Hogrefe, 1982.
Google Scholar
Wittmann, W. W. Evaluationsforschung: Aufgaben, Probleme und Anwendungen. Berlin: Springer, 1985.
Google Scholar
Wittmann, W. W., and Schmidt, J. Die Vorhersagbarkeit des Verhaltens aus Trait-Inventaren. Theoretische Grundlagen und empirische Ergebnisse mit dem Freiburger Persönlichkeitsinventar (FPI). Research Reports No. 10, Psychological Institute, University of Freiburg, West Germany, 1983.
Google Scholar
Woodhouse, B., and Jackson, P. H. Lower bounds for the reliability of the total score on a test composed of non-homogeneous items. II. A search procedure to locate the greatest lower bound. Psychometrika, 1977, 42, 579–591.
Google Scholar
Yeaton, W. H., and Redner, R. Measuring strength and integrity of treatments: Rationale, techniques, and examples. In R. F. Conner (Ed.), Methodological advances in evaluation research. Beverly Hills: Sage, 1981.
Google Scholar
Yeaton, W. H., and Sechrest, L. Critical dimensions in the choice and maintenance of successful treatments: Strength, integrity, and effectiveness. Journal of Consulting and Clinical Psychology, 1981, 49, 156–167.
PubMed Google Scholar
Zimmerman, D. W., and Williams, R. H. The relative error magnitude in three measures of change. Psychometrika, 1982a, 47, 141–147.
Google Scholar
Zimmerman, D. W., and Williams, R. H. Gain scores in research can be highly reliable. Journal of Educational Measurement, 1982b, 19, 149–154.
Google Scholar
Zimmerman, D. W., and Williams, R. H. On the high predictive potential of change and growth measures. Educational and Psychological Measurement, 1982c, 42, 961–968.
Google Scholar
Zimmerman, D. W., Brotohusodo, T. L., and Williams, R. H. The reliability of sums and differences of test scores: Some new results and anomalies. Journal of Experimental Eduction, 1981, 49, 177–186.
Google Scholar

Download references

Author information

Authors and Affiliations

Psychological Institute, Department of Personality Psychology, University of Freiburg, D7800, Freiburg im Breisgau, Federal Republic of Germany
Werner W. Wittmann

Authors

Werner W. Wittmann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Pennsylvania State University, University Park, Pennsylvania, USA
John R. Nesselroade
University of Hawaii at Manoa, 96844, Honolulu, Hawaii, USA
Raymond B. Cattell

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wittmann, W.W. (1988). Multivariate Reliability Theory. In: Nesselroade, J.R., Cattell, R.B. (eds) Handbook of Multivariate Experimental Psychology. Perspectives on Individual Differences. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-0893-5_16

Download citation

DOI: https://doi.org/10.1007/978-1-4613-0893-5_16
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4612-8232-7
Online ISBN: 978-1-4613-0893-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics