Abstract
Reliability is one of the most basic concepts of science and a prerequisite for scientific work. It is also the psychometrician’s favorite concept. Without reliable measurements, not even scientific empirical psychology could exist, let alone multivariate psychology or multivariate experimental psychology.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ajzen, I., and Fishbein, M. Understanding attitudes and predicting social behavior. Englewood Cliffs, N.J.: Prentice—Hall, 1980.
Alker, H. R. A typology of ecological fallacies. In M. Dogan and S. Rokkan (Eds.), Quantitative ecological analysis in social sciences. Cambridge, Mass.: MIT Press, 1969.
Baltes, P. B., Reese, H. W., and Nesselroade, J. R. Life-span developmental psychology: Introduction to research methods. Monterey: Brooks/Cole, 1977.
Baltes, P. B., Cornelius, S. W., and Nesselroade, J. R. Cohort effects in developmental psychology. In J. R. Nesselroade and P. B. Baltes (Eds.), Longitudinal research in the study of behavior and development. New York: Academic Press, 1979.
Bentler, P. M. A lower-bound method for the dimension-free measurement of internal consistency. Social Science Research, 1972, 1, 343–357.
Bentler, P. M., and Woodward, J. A. Inequalities among lower bounds to reliability: With applications to test construction and factor analysis. Psychometrika, 1980, 45, 249–267.
Bentler, P. M., and Woodard, J. A. The greatest lower bound to reliability. In H. Wainer and S. Messick (Eds.), Principles of modern psychological measurement. A Festschrift for Frederic M. Lord. Hillsdale, N.J.: Erlbaum, 1983.
Bereiter, C. Some persisting dilemmas in the measurement of change. In C. W. Harris (Ed.), Problems in measuring change. Madison: University of Wisconsin Press, 1963.
Bock, R. D. Contributions of multivariate experimental designs to educational research. In R. B. Cattell (Ed.), Handbook of multivariate experimental psychology. Chicago: Rand McNally, 1966.
Boruch, R. F., and Gomez, H. Sensitivity, bias, and theory in impact evaluations. Professional Psychology, 1977, 8, 411–434.
Brunswik, E. Perception and the representative design of psychological experiments. Berkeley: University of California Press, 1956.
Burstein, L. The analysis of multilevel data in educational research and evaluation. In D. C. Berliner (Ed.), Review of research in education. Vol. 8. Washington, D.C.: American Educational Research Association, 1980.
Burt, C. The appropriate use of factor analysis and analysis of variance. In R. B. Cattell (Ed.), Handbook of multivariate experimental psychology. Chicago: Rand McNally, 1966.
Buss, A. R. Toward a unified framework for psychometric concepts in the multivariate developmental situation: Intraindividual change and inter-and intraindividual differences. In J. R. Nesselroade and P. B. Baltes (Eds.), Longitudinal research in the study of behavior and development. New York: Academic Press, 1979.
Campbell, D. T., and Fiske, D. W. Convergent and discriminant validation by the multitrait—multimethod matrix. Psychological Bulletin, 1959, 56, 81–105.
Campbell, D. T., and Stanley, J. C. Experimental and quasi-experimental designs for research. Chicago: Rand McNally, 1966.
Carnap, R. The methodological character of theoretical concepts. In H. Feigl and M. Scriven (Eds.), Minnesota studies in the philosophy of science. Vol. 1. Minneapolis: University of Minnesota Press, 1956.
Cattell, R. B. (Ed.) Handbook of multivariate experimental psychology. Chicago: Rand McNally, 1966.
Cattell, R. B. Personality and mood by questionnaire. San Francisco: Jossey—Bass, 1973.
Cattell, R. B. The scientific use of factor analysis in behavioral and life sciences. New York: Plenum Press, 1978.
Cattell, R. B. Structural personality—learning theory: A wholistic multivariate research approach. New York: Praeger, 1983.
Cohen, J. Multiple regression as a general data-analytic system. Psychological Bulletin, 1968, 70, 426–443.
Cohen, J. Set correlation as a general multivariate data analytic method. Multivariate Behavioral Research, 1982, 17, 301–341.
Cohen, J., and Cohen, P. Applied multiple regression/correlation analysis for the behavioral sciences. Hillsdale, N.J.: Erlbaum, 1975 (1st ed.), 1983 (2nd ed.).
Conger, A. J. Estimating profile reliability and maximally reliable composites. Multivariate Behavioral Research, 1974, 9, 85–104.
Conger, A. J. Multivariate reliability: Implications for evaluating profiles. In N. Hirschberg and L. G. Humphreys (Eds.), Multivariate applications in the social sciences. Hillsdale, N.J.: Erlbaum, 1982.
Conger, A. J., and Lipschitz, R. Measures of reliability for profiles and test batteries. Psychometrika, 1973, 38, 411–427.
Conger, A. J., and Stallard, E. Equivalence among canonical factor analysis, canonical reliability analysis, and principal components analysis: Implications for data reduction of fallible measures. Educational and Psychological Measurement, 1976, 36, 619–626.
Conger, A. J., Conger, J. C., Farell, A. D., and Ward, D. What can the WISC-R measure? Applied Psychological Measurement, 1979, 3, 421–436.
Cook, T. D., and Campbell, D. T. The design and conduct of quasi-experiments and true experiments in field settings. In M. Dunnette (Ed.), Handbook of industrial and organizational psychology. Chicago: Rand McNally, 1976.
Cook, T. D., and Campbell, D. T. Quasi-experimentation: Design and analysis issues for field settings. Chicago: Rand McNally, 1979.
Cooley, W. W., and Leinhardt, G. The instructional dimension study. In H. E. Freeman and M. A. Solomon (Eds.), Evaluation studies review annual. Vol. 6. Beverly Hills: Sage, 1981.
Cooley, W. W., and Lohnes, P. R. Evaluation research in education. New York: Irvington, 1976.
Cramer, E. M., and Nicewander, W. A. Some symmetric invariant measures of multivariate association. Psychometrika, 1979, 44, 43–54.
Cronbach, L. J. The two disciplines of scientific psychology. American Psychologist, 1957, 12, 671–684.
Cronbach, L. J. Beyond the two disciplines of scientific psychology. American Psychologist, 1975, 30, 116–127.
Cronbach, L. J., and Furby, L. How we should measure “change” or should we? Psychological Bulletin, 1970, 74, 68–80.
Cronbach, L. J., Gleser, G. C., Nanda, H., and Rajaratnam, N. The dependability of behavioral measurements: Theory of generalizability for scores and profiles. New York: Wiley, 1972.
Drenth, P. J. D. Der psychologische Test: Eine Einführung in seine Theorie und seine Anwendungen. Leipzig: Barth, 1969.
Epstein, S. Traits are alive and well. In D. Magnusson and N. S. Endler (Eds.), Personality at the crossroads: Current issues in interactional psychology. Hillsdale, N.J.: Erlbaum, 1977.
Epstein, S. The stability of behavior. I. On predicting most of the people much of the time. Journal of Personality and Social Psychology, 1979, 37, 1097–1126.
Epstein, S. The stability of behavior. II. Implications for psychological research. American Psychologist, 1980, 35, 790–806.
Epstein, S. Aggregation and beyond: Some basic issues on the prediction of behavior. Journal of Personality, 1983a, 51, 360–392.
Epstein, S. The stability of confusion: A reply to Mischel and Peake. Psychological Review, 1983b, 90, 179–184.
Epstein, S. The stability of behavior across time and situations. In R. A. Zucker, J. Aronoff, and A. I. Rabin (Eds.), Personality and the prediction of behavior. Orlando: Academic Press, 1984.
Eysenck, H. J. The biological basis of personality. Springfield, Ill.: Thomas, 1967.
Fahrenberg, J., Myrtek, M., Kulick, B., and Frommelt, P. Eine psychologische Zeitreihenstudie an 20 Studenten über 8 Wochen. Archiv für Psychologie, 1977, 129, 242–264.
Fahrenberg, J., Selg, H., and Hampel, R. Das Freiburger Persönlichkeitsinventar FPI ( 3rd ed. ). Göttingen: Hogrefe, 1978.
Fishbein, M., and Ajzen, I. Belief, attitudes, intention, and behavior: An introduction to theory and research. Reading, Mass.: Addison-Wesley, 1975.
Fleiss, J. L. Comment on Overall and Woodward’s asserted paradox concerning the measurement of change. Psychological Bulletin, 1976, 83, 774–775.
Gulliksen, H. Theory of mental tests. New York: Wiley, 1950.
Guttman, L. What lies ahead for factor analysis? Educational and Psychological Measurement, 1958, 18, 497–515.
Guttman, L. Measurement as structural theory. Psychometrika, 1971, 36, 329–347.
Hammond, K. R. (Ed.) The psychology of Egon Brunswik. New York: Holt, Rinehart & Winston, 1966.
Hammond, K. R. Toward increasing competence of thought in public policy formation. In K. R. Hammond (Ed.), Judgment and decision in public policy formation. Boulder, Colo.: Westview Press, 1978.
Hammond, K. R., Hamm, R. M., and Grassia, J. Achieving generality over conditions: Combining the multitrait multimethod matrix and the representative designs of experiments. Boulder, Colo.: Center for Research on Jugment and Policy. University of Colorado Report No. 256, 1984.
Harris, C. W. (Ed.) Problems in measuring change. Madison: University of Wisconsin Press, 1963.
Hays, W. L. Statistics for the social sciences. New York: Holt, Rinehart & Winston, 1973.
Hoyt, C. Test reliability estimated by analysis of variance. Psychometrika, 1941, 6, 153–160.
Jackson, P. H., and Agunwamba, C. C. Lower bounds for the reliability of the total score on a test composed of non-homogeneous items. I. Algebraic lower bounds. Psychometrika, 1977, 42, 567–578.
Jäger, A. O. Dimensionen der Intelligenz ( 3rd ed. ). Göttingen: Hogrefe, 1973.
Jäger, A. O. Mehrmodale Klassifikation von Intelligenzleistungen: Experimentell kontrollierte Weiterentwicklung eines deskriptiven Intelligenzstrukturmodells. Diagnostica, 1982, 28, 195–225.
Joe, G. W., and Woodward, J. A. Some developments in multivariate generalizability. Psychometrika, 1976, 41, 205–217.
Jöreskog, K. G. Statistical estimation of structural models in longitudinal–developmental investigations. In J. R. Nesselroade and P. B. Baltes (Eds.), Longitudinal research in the study of behavior and development. New York: Academic Press, 1979.
Kaiser, H. F., and Michael, W. B. Domain validity and generalizability. Educational and Psychological Measurement, 1975, 35, 31–35.
Kerlinger, F. N. Foundations of behavioral research ( 2nd ed. ). New York: Holt, Rinehart & Winston, 1973.
Kerlinger, F. N., and Pedhazur, E. J. Multiple regression in behavioral research. New York: Holt, Rinehart & Winston, 1973.
Labouvie, E. W. Identity versus equivalence of psychological measures and constructs. In W. Poon (Ed.), Aging in the 1980s: Psychological Issues. Washington, D.C.: APA, 1980.
Labouvie, E. W. The study of multivariate change structures: A conceptual perspective. Multivariate Behavioral Research, 1981, 16, 23–35.
Leinhardt, G., and Seewald, A. M. Overlap: What’s tested, what’s taught? Journal of Educational Measurement, 1981, 18, 85–96.
Liebert, R. M., and Spiegler, M. D. Personality: Strategies for the study of man. Homewood, Ill.: Dorsey, 1974.
Lord, F. M. A paradox in the interpretation of group comparisons. Psychological Bulletin, 1967, 68, 304–305.
Lord, F. M., and Novick, M. R. Statistical theories of mental test scores. Reading, Mass.: Addison–Wesley, 1968.
Lumsden, J. Test theory. Annual Review of Psychology, 1976, 27, 251–280.
Mason, W. M., Wong, G. Y., and Entwistle, B. Contextual analysis through the multilevel linear model. In S. Leinhardt (Ed.), Sociological methodology 1983–1984. San Francisco: Josey–Bass, 1983.
Meehl, P. E. Theoretical risks and tabular asterisks: Sir Karl, Sir Ronald, and the slow progress of soft psychology. Journal of Consulting and Clinical Psychology, 1978, 46, 806–834.
Michhel, W. Personality and assessment. New York: Wiley, 1968.
Michhel, W., and Peake, P. K. Beyond déjà vu in the search for crossituational consistency. Psychological Review, 1982, 89, 730–755.
Mulaik, S. A. The foundations of factor analysis. New York: McGraw–Hill, 1972.
Murray, H. A. Explorations in personality. London: Oxford University Press, 1938.
Nesselroade, J. R., and Baltes, P. B. (Eds.) Longitudinal research in the study of behavior and development. New York: Academic Press, 1979.
Nesselroade, J. R., and Bartsch, T. W. Multivariate perspectives on the construct validity of the trait–state distinction. In R. B. Cattell and R. M. Dreger (Eds.), Handbook of modern personality theory. Washington, D.C.: Hemisphere, 1977.
Nesselroade, J. R., and Cable, D. G. “Sometimes it’s okay to factor difference scores”—The separation of state and trait anxiety. Multivariate Behavioral Research, 1974, 9, 273–282.
Nesselroade, J. R., Jacobs, A., and Pruchno, R. Reliability versus stability in the measurement of psychological states: An illustration with anxiety measures. Unpublished manuscript, 1981.
Nicewander, W. A. A relationship between Harris factors and Guttman’s sixth lower bound to reliability. Psychometrika, 1975, 40, 197–203.
Nicewander, W. A., and Price, J. M. Dependent variable reliability and the power of statistical tests. Psychological Bulletin, 1978, 85, 405–409.
Nicewander, W. A., and Price, J. M. Reliability of measurement and the power of statistical tests: Some new results. Psychological Bulletin, 1983, 94, 524–533.
Nunnally, J. C. Psychometric theory ( 2nd ed. ). New York: McGraw–Hill, 1978.
Overall, J. E., and Woodward, J. A. Unreliability of difference scores: A paradox for the measurement of change. Psychological Bulletin, 1975, 82, 85–86.
Robinson, W. S. Ecological correlations and the behavior of individuals. American Sociological Review, 1950, 15, 351–357.
Rogosa, D., Brandt, D., and Zimowski, M. A growth curve approach to the measurement of change. Psychological Bulletin, 1982, 92, 726–748.
Rushton, J. P., Brainerd, C. J., and Pressley, M. Behavioral development and construct validity: The principle of aggregation. Psychological Bulletin, 1983, 94, 18–38.
Sechrest, L., and Yeaton, W. H. Assessing the effectiveness of social programs: Methodological and conceptual issues. New Directions in Program Evaluation, 1981, 9, 41–56.
Sechrest, L., and Yeaton, W. H. Magnitudes of experimental effects in social science research. Evaluation Review, 1982, 6, 579–600.
Sechrest, L., West, S. G., Phillips, M. A., Redner, R., and Yeaton, W. Some neglected problems in evaluation research: Strength and integrity of treatments. In L. Sechrest, S. G. West, M. A. Phillips, R. Redner, and W. Yeaton (Eds.), Evaluation studies review annual. Vol. 4. Beverly Hills: Sage, 1979.
Shavelson, R. J., and Webb, N. M. Generalizability theory: 1973–1980. British Journal of Mathematical and Statistical Psychology, 1981, 34, 133–166.
Spearman, C. The proof and measurement of association between two things. American Journal of Psychology, 1904, 15, 72–101.
Stanley, J. C. Reliability. In R. L. Thorndike (Ed.), Educational measurement (2nd ed.). Washington, D.C.: American Council on Education, 1971.
Stewart, D., and Love, W. A general canonical correlation index. Psychological Bulletin, 1968, 70, 160–163.
Sutcliffe, J. P. On the relationship of reliability to statistical power. Psychological Bulletin, 1980, 88, 509–515.
Thomdike, R. L. On the fallacy of imputing the correlations found for groups to the individuals or smaller groups composing them. American Journal of Psychology, 1939, 52, 122–124.
Thorndike, R. L. Personnel selection: Test and measurement techniques. New York: Wiley, 1949.
Thurstone, L. L. Multiple factor analysis. Chicago: University of Chicago Press, 1947.
Tucker, L. R. A suggested alternative formulation in the developments by Hursch, Hammond and Hursch, and by Hammond, Hursch, and Todd. Psychological Review, 1964, 71, 528–530.
Tukey, J. W. Analyzing data: Sanctification or detective work. American Psychologist, 1969, 24, 83–91.
von Linné, C. Systemae naturae per regna tria naturae (10th ed.). Stockholm, 1758.
Weiss, D. J., and Davison, M. L. Test theory and methods. Annual Review of Psychology, 1981, 32, 629–658.
Williams, R. H., and Zimmerman, D. W. The reliability of difference scores when errors are correlated. Educational and Psychological Measurement, 1977, 37, 679–689.
Wittgenstein, L. Tractratus logico-philosophicus. New York: Harcourt, Brace, 1922.
Wittgenstein, L. Philosophical investigations. New York: Macmillan Co., 1953.
Wittmann, W. W. Faktorenanalytische Modelle, Methodenstudien und Probleme der Reproduzierbarkeit. Freiburg im Breisgau: Phil. dissertation, 1977.
Wittmann, W. W. Drei Klassen verschiedener faktorenanalytischer Modelle und deren Zusammenhang mit dem Konzept der Alpha-Generalisierbarkeit der klassischen Testtheorie. Psychologische Beiträge, 1978, 20, 456–470.
Wittmann, W. W. Der Mangel an multivariaten Betrachtungsweisen in der psychologischen Forschungspraxis mit besonderer Berücksichtigung der Interaktionismus-Debatte. In E. D. Lantermann (Ed.), Wechselwirkungen: Psychologische Analysen der Mensch-Umwelt-Beziehung. Göttingen: Hogrefe, 1982.
Wittmann, W. W. Evaluationsforschung: Aufgaben, Probleme und Anwendungen. Berlin: Springer, 1985.
Wittmann, W. W., and Schmidt, J. Die Vorhersagbarkeit des Verhaltens aus Trait-Inventaren. Theoretische Grundlagen und empirische Ergebnisse mit dem Freiburger Persönlichkeitsinventar (FPI). Research Reports No. 10, Psychological Institute, University of Freiburg, West Germany, 1983.
Woodhouse, B., and Jackson, P. H. Lower bounds for the reliability of the total score on a test composed of non-homogeneous items. II. A search procedure to locate the greatest lower bound. Psychometrika, 1977, 42, 579–591.
Yeaton, W. H., and Redner, R. Measuring strength and integrity of treatments: Rationale, techniques, and examples. In R. F. Conner (Ed.), Methodological advances in evaluation research. Beverly Hills: Sage, 1981.
Yeaton, W. H., and Sechrest, L. Critical dimensions in the choice and maintenance of successful treatments: Strength, integrity, and effectiveness. Journal of Consulting and Clinical Psychology, 1981, 49, 156–167.
Zimmerman, D. W., and Williams, R. H. The relative error magnitude in three measures of change. Psychometrika, 1982a, 47, 141–147.
Zimmerman, D. W., and Williams, R. H. Gain scores in research can be highly reliable. Journal of Educational Measurement, 1982b, 19, 149–154.
Zimmerman, D. W., and Williams, R. H. On the high predictive potential of change and growth measures. Educational and Psychological Measurement, 1982c, 42, 961–968.
Zimmerman, D. W., Brotohusodo, T. L., and Williams, R. H. The reliability of sums and differences of test scores: Some new results and anomalies. Journal of Experimental Eduction, 1981, 49, 177–186.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1988 Plenum Press, New York
About this chapter
Cite this chapter
Wittmann, W.W. (1988). Multivariate Reliability Theory. In: Nesselroade, J.R., Cattell, R.B. (eds) Handbook of Multivariate Experimental Psychology. Perspectives on Individual Differences. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-0893-5_16
Download citation
DOI: https://doi.org/10.1007/978-1-4613-0893-5_16
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4612-8232-7
Online ISBN: 978-1-4613-0893-5
eBook Packages: Springer Book Archive