Zusammenfassung
Dieses Kapitel vermittelt folgende Lernziele: Die unterschiedlichen Vorgehensweisen bei der Bildung theoretischer Konzepte in der qualitativen Forschung und der Operationalisierung theoretischer Konzepte in der quantitativen Forschung kennen. Wissen, was man unter der Konzeptspezifikation versteht und wie man dabei vorgeht. Latente und manifeste Variablen voneinander abgrenzen können. Die vier Skalenniveaus definieren und an Beispielen erläutern können. Die Ratingskala als Antwortformat von der psychometrischen Skala als Messinstrument unterscheiden können. Die Messung theoretischer Konzepte mit Einzelindikatoren, psychometrischen Skalen und Indizes als drei zentralen Operationalisierungsvarianten in ihren Vor- und Nachteilen abwägen können.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsLiteratur
Abdel-Khalek, A. M. (2006). Measuring happiness with single-item scale. Social Behavior and Personality, 34(2), 139–150.
Ahearn, E. P. (1997). The use of visual analog scales in mood disorders: A critical review. Journal of Psychiatric Research, 31(5), 569–579.
Aiken, L. R. (1985a). Evaluating ratings on bidirectional scales. Educational and Psychological Measurement, 45, 195–202.
Aiken, L. R. (1985b). Three coefficients for analyzing the reliability and validity of ratings. Educational and Psychological Measurement, 45, 131–142.
Aiken, L. R. (1987). Formulars for equating ratings on different scales. Educational and Psychological Measurement, 47, 51–54.
Aiken, L. R. (1996). Rating scales and checklists: Evaluating behavior, personality, and attitudes. Oxford: Wiley.
Aiken, L. R. (1997). Psychological testing and assessment. (9. Aufl.). Boston: Allyn & Bacon.
Alliger, G. M. & Williams, K. J. (1989). Confounding among measures of leniency and halo. Educational and Psychological Measurement, 49, 1–10.
Anderson, C. A., Shibuya, A., Ihori, N., Swing, E. L., Bushman, B. J., Sakamoto, A., et al. (2010). Violent video game effects on aggression, empathy, and prosocial behavior in Eastern and Western countries: A meta-analytic review. Psychological Bulletin, 136(2), 151–173.
Athey, T. R. & McIntyre, R. M. (1987). Effect of rater training on rater accuracy: Levels-of-processing theory and social facilitation theory perspectives. Journal of Applied Psychology, 72(4), 567–572.
Attneave, F. (1949). A method of graded dichotomies for the scaling of judgments. Psychological Review, 56(6), 334–340.
Ayalon, L., Goldfracht, M., & Bech, P. (2010). „Do you think you suffer from depression?“ Reevaluating the use of a single item question for the screening of depression in older primary care patients. International Journal of Geriatric Psychiatry, 25(5), 497–502.
Bach, E. (1980). Ein chemischer Index zur Überwachung der Wasserqualität von Fließgewässern. (24. Aufl.). Frankfurt/Main: DGM.
Baer, L. & Blais, M. A. (Eds.). (2009). Handbook of clinical rating scales and assessment in psychiatry and mental health (current clinical psychiatry). New York: Humana Press.
Baker, B. O., Hardyck, C. D., & Petrinovich, L. F. (1966). Weak measurement vs. strong statistics: An empirical critique of S. S. Stevens proscriptions of statistics. Educational and Psychological Measurement, 26, 291–309.
Bannister, B. D., Kinicki, A. J., Denisi, A. S., & Horn, P. W. (1987). A new method for the statistical control of rating error in performance ratings. Educational and Psychological Measurement, 47, 583–596.
Barr, M. A. & Raju, N. S. (2003). IRT-based assessments of rater effects in multiple-source feedback instruments. Organizational Research Methods, 6(1), 15–43.
Bearden, W. O., Netemeyer, R. G., & Haws, K. L. (2011). Handbook of marketing scales: Multi-item measures for marketing and consumer behavior research (vol. 3). Los Angeles: Sage.
Beaton, A. E. & Allen, N. L. (1992). Interpreting scales through scale anchoring. Journal of Educational and Behavioral Statistics, 17(2), 191–204.
Bergkvist, L. & Rossiter, J. R. (2007). The predictive validity of multiple-item vs. single-item measures of the same constructs. Journal of Marketing Research, 44, 175–184.
Bernardin, H. J. (1977). Behavioral expectation scales vs. summated ratings: A fairer comparison. Journal of Applied Psychology, 62, 422–427.
Bernardin, H. J. & Smith, P. C. (1981). A clarification of some issues regarding the development and use of behaviorally anchored ratings scales (BARS). Journal of Applied Psychology, 66(4), 458–463.
Bernardin, H. J. & Walter, C. S. (1977). Effects of rater training and diary-helping on psychometric error in ratings. Journal of Applied Psychology, 62, 64–69.
Bierhoff, H. W. (1996). Neue Erhebungsmethoden. In E. Erdfelder, R. Mausfeld & T. Meiser (Hrsg.), Handbuch Quantitative Methoden (S. 59–70). Weinheim: Beltz.
Bintig, A. (1980). The efficiency of various estimations of reliability of rating-scales. Educational and Psychological Measurement, 40, 619–644.
Blunt, A. (1983). Development of a Thurstone scale for measuring attitudes toward adult education. Adult Education Quarterly, 34(1), 16–28.
Böckenholt, U. (2001). Hierarchical modelling of paired comparison data. Psychological Methods, 6, 49–66.
Böckenholt, U. (2004). Comparative judgements as an alternative to ratings: Identifying the scale origin. Psychological Methods, 9, 453–465.
Bongers, D. & Rehm, G. (1973). Kontaktwunsch und Kontaktwirklichkeit von Bewohnern einer Siedlung. Unveröffentlichte Diplomarbeit. Universität Bonn.
Borg, I., Müller, M., & Staufenbiel, T. (1990). Ein empirischer Vergleich von fünf Standard-Verfahren zur eindimensionalen Skalierung. Archiv für Psychologie, 142, 25–33.
Borman, W. C. (1975). Effects of instructions to avoid error on reliability and validity of performance evaluation ratings. Journal of Applied Psychology, 60, 556–560.
Borman, W. C. (1986). Behavior-based rating scales. In R. A. Berk (Ed.), Performance Assessment: Methods and Applications. (pp. 100–120). Baltimore: Johns Hopkins University Press.
Bortz, J. & Lienert, G. A. (2008). Kurzgefaßte Statistik für die klinische Forschung. (3. Aufl.). Berlin: Springer.
Bortz, J., Lienert, G. A., & Boehnke, K. (2000). Verteilungsfreie Methoden in der Biostatistik. (2. Aufl.). Heidelberg: Springer.
Bortz, J., Lienert, G. A., & Boehnke, K. (2008). Verteilungsfreie Methoden in der Biostatistik. (3. Aufl.). Heidelberg: Springer.
Bortz, J. & Schuster, C. (2010). Statistik für Human- und Sozialwissenschaftler (Lehrbuch mit Online-Materialien). (7. Aufl.). Berlin: Springer.
Bradley, R. A. & Terry, M. E. (1952). The rank analysis of incomplete block designs. I: The method of paired comparison. Biometrika, 39, 324–345.
Brandt, L. W. (1978). Measuring of a measurement: Empirical investigation of the semantic differential. Probleme und Ergebnisse der Psychologie, 66, 71–74.
Breckler, S. J. (1994). A comparison of numerical indexes for measuring attitude ambivalence. Educational and Psychological Measurement, 54(2), 350–365.
Bühner, M. (2011). Einführung in die Test- und Fragebogenkonstruktion (3., aktualisierte Aufl.). München: Pearson Studium.
Campbell, J. P., Dunnette, M. D., Arvey, R. D., & Hellervik, L. V. (1973). The development and evaluation of behaviorally based rating scales. Journal of Applied Psychology, 57(1), 15–22.
Carbonell, L., Sendra, J. M., Bayarri, S., Izquierdo, L., & Tárrega, A. (2008). Thurstonian scales obtained by transformation of beta distributions. Food Quality and Preference, 19(4), 407–411.
Chatterjee, B. B. & Puhan, B. N. (1980). A Thurstone scale for measuring attitude towards sex. Indian Psychological Review, 19(3), 1–8.
Chignell, M. H. & Pattey, B. W. (1987). Unidimensional scaling with efficient ranking methods. Psychological Bulletin, 101, 304–311.
Clark, J. A. (1977). A method of scaling with incomplete pair-comparison data. Educational and Psychological Measurement, 37, 603–311.
Cogliser, C. C. & Schriesheim, C. A. (1994). Development and application of a new approach to testing the bipolarity of semantic differential. Educational and Psychological Measurement, 54(3), 594.
Cohen, J. (1969). Statistical power analysis for the behavioral sciences. Hillsdale: Erlbaum.
Conrad, E. & Maul, T. (1981). Introduction to experimental psychology. New York: Wiley.
Coombs, C. H., Dawes, R. M., & Tversky, A. (1970). Mathematical psychology. Englewood Cliffs: Prentice Hall.
Couper, M. P., Tourangeau, R., Conrad, F. G., & Singer, E. (2006). Evaluating the effectiveness of visual analog scales. A web experiment. Social Science Computer Review, 24(2), 227–245.
Crawshaw, L. (2009). Workplace bullying? Mobbing? Harassment? Distraction by a thousand definitions. Consulting Psychology Journal: Practice and Research, 61(3), 263–267.
Cronkhite, G. (1976). Effects of rater-concept-scale interactions and use of different factoring procedures upon evaluative factor structure. Human Communication Research, 2, 316–329.
Dalbert, C. (1992). Subjektives Wohlbefinden junger Erwachsener: Theoretische und empirische Analysen der Struktur und Stabilität. Zeitschrift für Differentielle und Diagnostische Psychologie, 13, 207–220.
David, H. A. (1963). The method of paried comparison. London: Griffin.
Dawis, R. V. (1987). Scale construction. Journal of Counseling Psychology, 34(4), 481–489.
De Cotiis, T. A. (1977). An analysis of the external validity and applied relevance of three rating formats. Organizational Behavior and Human Performance, 19, 247–266.
De Cotiis, T. A. (1978). A critique and suggested revision of behaviorally anchored rating scales developmental procedures. Educational and Psychological Measurement, 38, 681–690.
Diamantopoulos, A. (2005). The C-OAR-SE procedure for scale development in marketing: A comment. International Journal of Research in Marketing, 22, 1–9.
Diamantopoulos, A. & Winklhofer, H. M. (2001). Index construction with formative indicators: an alternative to scale development. Journal of Marketing Research, 38(269–277).
Diefenbacher, H. & Zieschank, R. (2008). Wohlfahrtsmessung in Deutschland. Ein Vorschlag für einen neuen Wohlfahrtsindex. Statusbericht zum Forschungsprojekt FKZ 3707 11 101/01. Zeitreihenrechnung zu Wohlfahrtsindikatoren. Abgerufen 22. Februar, 2012, unter http://www.beyond-gdp.eu/download/BMU_UBA_Endbericht_v20_endg.pdf.
Doll, J. (1988). Kognition und Präferenz: Die Bedeutung des Halo-Effektes für multiattributive Einstellungsmodelle. Zeitschrift für Sozialpsychologie, 19, 41–52.
Döring, N. (2005). Für Evaluation und gegen Evaluitis. Warum und wie Lehrevaluation an deutschen Hochschulen verbessert werden sollte. In B. Berendt, H.-P. Voss, & J. Wildt (Hrsg.), Neues Handbuch Hochschullehre (S. 1–22). Berlin: Raabe.
Döring, N. (2013). Zur Operationalisierung von Geschlecht im Fragebogen: Probleme und Lösungsansätze aus Sicht von Mess-, Umfrage-, Gender- und Queer-Theorie. Gender, 2, 94–113.
Dunn-Rankin, P., Knezek, G. A., Wallace, S., & Zhang, S. (2004). Scaling Methods. Mahwah: Erlbaum.
Edwards, A. L. & Kilpatrick, F. P. (1948). A technique for the construction of attitude scales. Journal of Applied Psychology, 32, 374–384.
Eiser, J. R. & Ströbe, W. (1972). Categorisation and social judgement. New York: Academic Press.
EKD. (2013). Evangelische Kirche in Deutschland. Zahlen und Fakten zum kirchlichen Leben. Abgerufen 21. August, 2013, unter http://www.ekd.de/download/zahlen_und_fakten_2013.pdf.
Evans, R. H. (1980). The upgraded semantic differential: a further test. Journal of the Market Research Society, 22(2), 143–147.
Ferguson, C. J. & Rueda, S. M. (2010). The Hitmann study: Violent video game exposure effects on aggressive behavior, hostile feelings, and depression. European Psychologist, 15(2), 99–108.
Finn, A. & Kayande, U. (2005). How fine is C-OAR-SE? A generalizability theory perspective on Rossiter’s procedure. International Journal of Research in Marketing, 22, 11–21.
Finstuen, K. (1977). Use of Osgood’s semantic differential. Psychological Reports, 41, 1219–1222.
Flade, A. (1978). Die Beurteilung umweltpsychologischer Konzepte mit einem konzeptspezifischen und einem universellen semantischen Differential. Zeitschrift für experimentelle und angewandte Psychologie, 25, 367–378.
Flynn, L. R. (1993). Do standard scales work in older samples? Marketing Letters, 4(2), 127–137.
Frank, D. & Schlund, W. (2000). Eine neue Lösung des alten Skalenproblems. Planung und Analyse, 6, 56 ff.
Friedman, B. A. & Cornelius III, E. T. (1976). Effect of rater participation on scale construction on the psychometric characteristics of two ratingscale formats. Journal of Applied Psychology, 61, 210–216.
Friedman, H. H., Friedman, L. W., & Gluck, B. (1988). The effects of scale-checking styles on responses to a semantic differential scale. Journal of the Market Research Society, 30(4), 477–481.
Gaito, J. (1980). Measurement scales and statistics. Resurgence of an old misconception. Psychological Bulletin, 87, 564–567.
Galovski, T. E., Malta, L. S., & Blanchard, E. B. (2006). Road rage: Assessment and treatment of the angry, aggressive driver. Washington: American Psychological Association.
Gardner, D. G., Cummings, L. L., Dunham, R. B., & Pierce, J. L. (1998). Single-item vs. multiple-item measurement scales: An empirical comparison. Educational and Psychological Measurement, 58, 898–915.
Garland, R. (1990). A comparison of three forms of the semantic differential. Marketing Bulletin, 1, 19.
Garner, W. R. & Hake, H. W. (1951). The amount of information in absolute judgments. Psychological Review, 58(6), 446–459.
Gescheider, G. A. (1988). Psychophysical scaling. Annual Revue of Psychology, 33, 169–200.
Glaser, B. G. (2002). Conceptualization: On theory and theorizing using grounded theory. International Journal of Qualitative Methods, 1(2), 3rd Article. Retrieved August 29, 2011, from http://www.ualberta.ca/~iiqm/backissues/1_2Final/pdf/glaser.pdf.
Gluth, S., Ebner, N. C., & Schmiedek, F. (2010). Attitudes toward younger and older adults: The German aging semantic differential. International Journal of Behavioral Development, 34(2), 147–158.
Gonzales, E., Tan, J., & Morrow-Howell, N. (2010). Assessment of the refined Aging Semantic Differential: Recommendations for enhancing validity. Journal of Gerontological Social Work, 53(4), 304–318.
Goodstadt, M. S. & Magid, S. (1977). When Thurstone and Likert agree: A confounding of methodologies. Educational and Psychological Measurement, 37(4), 811–818.
Granberg-Rademacker, J. S. (2010). An algorithm for converting ordinal scale measurement data to interval/ratio scale. Educational and Psychological Measurement, 70(1), 74–90.
Green, S. B., Sauser, W. I., Fagg, J. N., & Champion, C. H. (1981). Shortcut methods for deriving behaviorally anchored rating scales. Educational and Psychological Measurement, 41(3), 761–775.
Greenberg, J. (1990). Organizational justice: Yesterday, today, and tomorrow. Journal of Management, 16, 399–432.
Guilford, J. P. (1938). The computation of psychological values from judgements in absolute categories. Journal of Experimental Psychology, 22(1), 32–42.
Guttman, L. (1950). The basis of scalogram analysis. In S. A. Stouffer, L. Guttman, E. A. Suchman, P. F. Lazarsfeld, S. A. Star, & J. A. Clausen (Eds.), Measurement and prediction. Studies in social psychology in World War II (vol. 4, pp. 60–90). Princeton: Princeton University Press.
Hand, D. J. (1996). Statistics and the theory of measurement. Journal of the Royal Statistical Society. Series A (Statistics in Society), 159(3), 445–492.
Hauenstein, N. M. A., Brown, R. D., & Sinclair, A. L. (2010). BARS and those mysterious, missing middle anchors. Journal of Business and Psychology, 25(4), 663–672.
Helmholtz, H. (1887). Zur Geschichte des Princips der kleinsten Action. Berlin: Reichsdruckerei.
Helmholtz, H. (1959). Die Tatsachen in der Wahrnehmung. Zählen und Messen erkenntnistheoretisch betrachtet. Darmstadt: Wissenschaftliche Buchgesellschaft.
Henss, R. (1989). Zur Vergleichbarkeit von Ratingskalen unterschiedlicher Kategorienzahl. Psychologische Beiträge, 31, 264–284.
Himmelfarb, S. (1993). The measurement of attitudes. In A. H. Eagly & S. Chaiken (Eds.), Psychology of attitudes (pp. 23–88). Belmont: Thomson/Wadsworth.
Hofacker, C. F. (1984). Categorical judgment scaling with ordinal assumptions. Multivariate Behavioral Research, 19(1), 91–106.
Hofstätter, P. R. (1957). Psychologie. Frankfurt/Main: Fischer.
Hofstätter, P. R. (1963). Einführung in die Sozialpsychologie. Stuttgart: Kröner.
Hofstätter, P. R. (1977). Persönlichkeitsforschung. Stuttgart: Kröner.
Horowitz, L. M., Inouye, D., & Seigelmann, E. Y. (1979). On avaraging judges’ rating to increase their correlation with an external criterion. Journal of Consulting and Clinical Psychology, 47, 453–458.
Hoyt, W. T. (2000). Rater bias in psychological research: When is it a problem and what can we do about it? Psychological Methods, 5(1), 64–86.
Hoyt, W. T. (2002). Bias in participant ratings of psychotherapy process: An initial generalizability study. Journal of Counseling Psychology, 49(1), 35–46.
Hoyt, W. T. & Kerns, M. D. (1999). Magnitude and moderators of bias in observer ratings: A meta-analysis. Psychological Methods, 4, 403–424.
Hull, R. B. & Buhyoff, G. J. (1981). On the „Law of Comparative Judgement“: Scaling with intransitive observers and multidimensional stimuli. Educational and Psychological Measurement, 41, 1083–1089.
Igou, E. R., Bless, H., & Schwarz, N. (2002). Making sense of standardized survey questions: The influence of reference periods and their repetition. Communication Monographs, 69(2), 179–187.
Inglehart, R. (1977). The silent revolution: Changing values and political styles among western publics. Princeton: Princeton University Press.
Inglehart, R. (1997). Modernization and postmodernization: Cultural, economic and political change in 43 societies. Princeton: Princeton University Press.
Jäger, R. (1998). Konstruktion einer Ratingskala mit Smilies als symbolische Marken. Institut für Psychologie, Technische Universität Berlin.
Jäger, R. S. & Petermann, F. (1992). Psychologische Diagnostik. (2. Aufl.). Weinheim: Psychologie Verlags Union.
Johnson, D.–M. & Vidulich, R. N. (1956). Experimental Manipulation of the Halo-Effect. Journal of Applied Psychology, 40, 130–134.
Jones, L. V. (1959). Some Invariant Findings under the Method of Successive Intervalls. American Journal of Psychology, 72, 210–220.
Jones, L. V. & Thurstone, L. L. (1955). The psychophysics of semantics: An empirical investigation. Journal of Applied Psychology, 39, 31–36.
Kahneman, D. & Tversky, A. (Eds.). (2000). Choices, values, and frames. Cambridge: Cambridge University Press.
Kane, R. B. (1971). Minimizing order effects in the semantic differential. Educational and Psychological Measurement, 31(137–144).
Kane, J. S., Bernardin, H. J., Villanova, P., & Peyrefitte, J. (1995). Stability of rater leniency: Three studies. Academy of Management Journal, 1995, 1036–1051.
Kaplan, K. J. (1972). On the ambivalence-indifference problem in attitude theory and measurement: A suggested modification of the semantic differential technique. Psychological Bulletin, 77(5), 361–372.
Keller, J. & Wagner-Steh, K. (2005). A Guttman scale for empirical prediction of level of domestic violence. Journal of Forensic Psychology Practice, 5(4), 37–48.
Kelley, H. H., Hovland, C. J., Schwartz, M., & Abelson, R. P. (1955). The influence of judges attitudes in three modes of attitude scaling. Journal of Social Psychology, 42, 147–158.
Kendall, M. G. (1955). Further contributions to the theory of paired comparison. Biometrics, 11, 43–62.
Kessler, J. (2009). Der Mythos vom globalen Dorf. Zur räumlichen Differenzierung von Globalisierungsprozessen. In J. Kessler & C. Steiner (Hrsg.), Facetten der Globalisierung: Zwischen Ökonomie, Politik und Kultur (S. 28–79). Wiesbaden: VS Verlag.
King, B. M., Rosopa, P. J., & Minium, E. W. (2010). Statistical Reasoning in the Behavioral Sciences (6. Aufl.). Hoboken: John Wiley & Sons.
Kingstrom, P. O. & Bass, A. R. (1981). A Critical Analysis of Studies Comparing Behaviorally Anchored Rating Scales (BARS) and Other Rating Formats. Personnel Psychology, 34(2), 263–289.
Kinicki, A. J. & Bannister, B. D. (1988). A test of the measurement assumptions underlying behaviorally anchored rating scales. Educational and Psychological Measurement, 48(1), 17–27.
Kinicki, A. J., Bannister, B. D., Hom, P. W., & Denisi, A. S. (1985). Behaviorally anchored rating scales vs. summated rating scales: Psychometric properties and susceptibility to rating bias. Educational & Psychological Measurement, 45(3), 535–549.
Klauer, K. C. (1989). Untersuchungen zur Robustheit von Zuschreibungs-mal-Bewertungsmodellen: Die Bedeutung von Halo-Effekten und Dominanz. Zeitschrift für Sozialpsychologie, 20, 14–26.
Klauer, K. C. & Schmeling, A. (1990). Sind Halo-Fehler Flüchtigkeitsfehler? Zeitschrift für experimentelle und angewandte Psychologie, 37, 594–607.
Knezek, G., Wallace, S., & Dunn–Rankin, P. (1998). Accuracy of Kendall’s chi-square. Approximation to circular triad distributions. Psychometrica, 63, 23–34.
Korman, A. K. (1971). Industrial and organizational psychology. Englewood Cliffs: Prentice Hall.
Krabbe, P. F. M. (2008). Thurstone scaling as a measurement method to quantify subjective health outcomes. Medical Care, 46(4), 357–365.
Krantz, D. H., Luce, R. D., Suppes, P., & Tversky, A. (2006a). Foundations of measurement volume II: Geometrical, threshold, and probabilistic representations. Mineola: Dover Publications.
Krantz, D. H., Luce, R. D., Suppes, P., & Tversky, A. (2006b). Foundations of measurements volume I: Additive and polynomial representations. Mineola: Dover Publications.
Krebs, D. & Hoffmeyer–Zlotnik, J. H. P. (2009). Bipolar vs. unipolar scale format in fully vs. endpoint verbalized scale. Paper presented at the Cognition in Survey Research, 3rd Conference of the European Survey Research Association. Warschau, 29th June – 3rd July, 2009.
Kromrey, H. (2000a). Empirische Sozialforschung: Modelle und Methoden der standardisierten Datenerhebung und Datenausweitung: Modelle und Methoden der Datenerhebung und Datenauswertung (12. Aufl.). Stuttgart: UTB.
Kromrey, H. (2000b). Qualität und Evaluation im System Hochschule. In R. Stockmann (Hrsg.), Evaluationsforschung (S. 233–258). Opladen: Leske & Budrich.
Krosnick, J. A. & Fabrigar, L. R. (2006). Designing great questionnaires: Insights from psychology. New York: Oxford University Press.
Latham, G. P., Wexley, K. N., & Pursell, E. D. (1975). Training managers to minimize rating error in the observation of behavior. Journal of Applied Psychology, 60, 550–555.
Lei, M. & Lomax, R. G. (2005). The effect of varying degrees of nonnormality in structural equation modeling. Structural Equation Modeling, 12(1), 1–27.
Leonhart, R. (2009). Lehrbuch Statistik. Einstieg und Vertiefung (2. Aufl.). Bern: Huber.
Li, F., Wang, E., & Zhang, F. (2002). The multitrait-multirater approach to analyzing rating biases. Acta Psychologica Sinica, 34(1), 89–96.
Likert, R. (1932). A technique for the measurement of attitudes. Archives of Psychology, 140, 1–55.
Lindemann, D. F. & Brigham, T. A. (2003). A Guttman scale for assessing condom use skills among college students. AIDS and Behavior, 7(1), 23–27.
Lissitz, R. W. & Green, S. B. (1975). Effect of number of scale points on reliability: A Monte Carlo approach. Journal of Applied Psychology, 60, 10–13.
Lohaus, D. (1997). Reihenfolgeeffekte in der Eindrucksbildung. Eine differenzierte Untersuchung verschiedener Meßzeiträume. Zeitschrift für Sozialpsychologie, 28, 298–308.
Lord, F. M. (1953). On the statistical treatmen of football numbers. American Psychologist, 8, 750–751.
Lozano, L. M., García–Cueto, E., & Muñiz, J. (2008). Effect of the number of response categories on the reliability and validity of rating scales. Methodology: European Journal of Research Methods for the Behavioral and Social Science, 4(2), 73–79.
Luce, R. D. (1959). Individual choice behavior. New York: Wiley.
Lütters, H. (2008). Serious fun in market research: The sniper scale. Marketing Review St. Gallen, 25(6), 17–22.
Maier, J., Maier, M., Maurer, M., Reinemann, C., & Meyer, V. (Eds.). (2009). Real-time response measurement in the social sciences: Methodological perspectives and applications. Frankfurt/Main: Lang.
Maier, J., Maurer, M., Reinemann, C., & Faas, T. (2006). Reliability and validity of real-time response measurement: A comparison of two studies of a televised debate in Germany. International Journal of Public Opinion Research, 19(1), 53–73.
Mann, I. T., Phillips, J. L., & Thompson, E. G. (1979). An examination of methodological issues relevant to the use and interpretation of the semantic differential. Applied Psychological Measurement, 3(2), 213–229.
Marcus, B. & Schuler, H. (2001). Leistungsbeurteilung. In H. Schuler (Hrsg.), Lehrbuch der Personalpsychologie (S. 397–433). Stuttgart: Schäffer-Poeschel.
Mari, L. (2005). The problem of foundations of measurement. Measurement, 38(4), 259–266.
Matell, M. S. & Jacoby, J. (1971). Is there an optimal number for Likert scale items? Study I: Reliability and validity. Educational and Psychological Measurement, 31, 657–674.
Maxwell, S. E. & Delaney, H. D. (1993). Bivariate median splits and spurious statistical significance. Psychological Bulletin, 113(1), 181–190.
McCarty, J. A. & Shrum, L. J. (2000). The measurement of personal values in survey research. A test of alternative rating procedures. Public Opinion Quarterly, 64, 271–298.
McCormack, B., Boldy, D., Lewin, G., & McCormack, G. R. (2011). Screening for depression among older adults referred to home care services: A single-item depression screener vs. the geriatric depression scale. Home Health Care Management and Practice, 23(1), 13–19.
McCormack, H. M., Horne, D. J., & Sheather, S. (1988). Clinical applications of visual analogue scales: a critical review. Psychological Medicine, 18, 1007–1019.
Michell, J. (1986). Measurement scales and statistics. A clash of paradigms. Psychological Bulletin, 100. 398–407.
Michell, J. (2005). The logic of measurement: A realistic overview. Measurement, 38(4), 285–294.
Mosier, C. J. (1941). A psychometric study of meaning. Journal of Social Psychology, 13, 123–140.
Mount, M. K., Sytsma, M. R., Hazucha, J. F., & Holt, K. E. (1997). Rater-ratee race effects in developmental performance rating of managers. Personnel Psychology, 50(1), 51–69.
Murakami, T. & Kroonenberg, P. M. (2003). Three-mode models and individual differences in semantic differential data. Multivariate Behavioral Research, 38(2), 247–283.
Myford, C. M. & Wolfe, E. W. (2003). Detecting and measuring rater effects using many-facet Rasch measurement: Part I. Journal of Applied Measurement, 4(4), 386–422.
Myford, C. M. & Wolfe, E. W. (2004). Detecting and measuring rater effects using many-facet Rasch measurement: Part II. Journal of Applied Measurement, 5(2), 189–227.
Nagy, M. S. (2002). Using a single-item approach to measure facet job satisfaction. Journal of Occupational and Organizational Psychology, 75, 77–86.
Neumann, W. L. (2003). Social research methods. Qualitative and quantitative approaches (5th edn.). Bosten: Pearson.
Newcomb, T. (1931). An experimant designed to test the validity of a rating technique. Journal of Educational Psychology, 22(4), 279–289.
Newstead, S. E. & Arnold, J. (1989). The effect of response format on ratings of teaching. Educational and Psychological Measurement, 49(1), 33–43.
Niederée, R. & Mausfeld, R. (1996a). Das Bedeutsamkeitsproblem in der Statistik. In E. Erdfelder, R. Mausfeld, & T. Meiser (Hrsg.), Handbuch Quantitative Methoden (S. 399–410). Weinheim: Psychologie Verlags Union.
Niederée, R. & Mausfeld, R. (1996b). Skalenniveau, Invarianz und „Bedeutsamkeit“. In E. Erdfelder, R. Mausfeld, & T. Meiser (Hrsg.), Handbuch Quantitative Methoden (S. 385–398). Weinheim: Psychologie Verlags Union.
Noll, H.-H. (2002). Globale Wohlfahrtsmaße als Instrumente der Wohlfahrtsmessung und Sozialberichterstattung: Funktionen, Ansätze und Probleme. In W. Glatzer, R. Habich, & K. U. Mayer (Hrsg.), Sozialer Wandel und Gesellschaftliche Dauerbeobachtung. Festschrift für Wolfgang Zapf (S. 317–336). Opladen: Leske & Budrich.
North, K. & Reinhardt, K. (2011). Kompetenzmanagement in der Praxis: Mitarbeiterkompetenzen systematisch identifizieren, nutzen und entwickeln (2. Aufl.). Wiesbaden: Gabler.
Ofir, C., Reddy, S. K., & Bechtel, G. G. (1987). Are semantic response scales equivalent? Multivariate Behavioral Research, 22(1), 21.
Orpinas, P. & Horne, A. M. (2006). Bullies and victims: A challenge for schools. In J. R. Lutzker (Ed.), Preventing violence: Research and evidence-based intervention strategies (pp. 147–165). Washington: American Psychological Association.
Orth, B. (1983). Grundlagen des Messens. In H. Feger & J. Bredenkamp (Hrsg.), Enzyklopädie der Psychologie: Themenbereich B, Serie I Forschungsmethoden der Psychologie, Bd. 3: Messen und Testen (S. 136–180). Göttingen: Hogrefe.
Osgood, C. E., Suci, G. J., & Tannenbaum, D. H. (1957). The measurement of meaning. Urbana: University of Illinois Press.
Parducci, A. (1963). Range-frequency compromise in judgement. Psychological Monographs, 77(2), 1–29.
Parducci, A. (1965). Category-judgement: a range-frequency model. Psychological Review, 72, 407–418.
Pepels, W. (2007). Market Intelligence: Moderne Marktforschung für Praktiker: Auswahlverfahren, Datenerhebung, Datenauswertung, Praxisanwendung, Marktprognose. Düsseldorf: Publics Publishing.
Perloff, J. M. & Persons, J. B. (1988). Biases resulting from the use of indexes: An application to attributional style and depression. Psychological Bulletin, 103(1), 95–104.
Peterson, R. A. (1999). Constructing effective questionnaires. Thousand Oaks: Sage.
Potosky, D. & Bobko, P. (1998). The Computer Understanding and Experience Scale: A Self-report measure of computer experience. Computers in Human Behavior, 14(2), 337–348.
Preston, C. C. & Colman, A. M. (2000). Optimal number of response categories in rating scales: reliability, validity, discriminating power, and respondent preferences. Acta Psychologica, 104(1), 1–15.
Rambo, W. W. (1963). The distribution of successive interval judgements of attitude statements: A note. Journal of Social Psychology, 60, 251–254.
Ramírez, J. M. & Andreu, J. M. (2009). The main sympthoms of the AHA-syndrome: Relationships between anger, hostility and agression on a normal population. In S. Bhave & S. Saini (Eds.), The AHA-syndrome and cardiovascular diseases 2009 (pp. 16–29). New Delhi: Anamaya.
Rasmussen, J. L. (1989). Analysis of Likert-scale data: A reinterpretation of Gregoire and Driver. Psychological Bulletin, 105(1), 167–170.
Reinemann, C., Maier, J., Faas, T., & Maurer, M. (2005). Reliabilität und Validität von RTR-Messungen. Ein Vergleich zweier Studien zur zweiten Fernsehdebatte im Bundestagswahlkampf 2002. Publizistik, 20, 56–73.
Reiss, I. L. (1964). The scaling of premarital sexual permissiveness. Marriage Family, 26, 188–198.
Roberts, J. S., Laughlin, J. E., & Wedell, D. H. (1999). Validity issues in the Likert and Thurstone approaches to attitude measurement. Educational & Psychological Measurement, 59, 211–233.
Robins, R. W., Hendin, H. M., & Trzesniewski, K. H. (2001). Measuring global self-esteem: Construct validation of a single-item measure and the Rosenberg Self-Esteem Scale. Personality and Social Psychology Bulletin, 27(2), 151–161.
Rohrmann, B. (1978). Empirische Studie zur Entwicklung von Antwortskalen für die sozialwissenschaftliche Forschung. Zeitschrift für Sozialpsychologie, 9, 222–245.
Rohrmann, B. (2007). Verbal qualifiers for rating scales: Sociolinguistic considerations and psychometric data. Project Report. Retrieved 22.02.2012, from http://www.rohrmannresearch.net/pdfs/rohrmann-vqs-report.pdf
Rohwer, G. & Pötter, U. (2002). Methoden sozialwissenschaftlicher Datenkonstruktion. Weinheim: Juventa.
Roskam, E. E. (1996). Latent-trait Modelle. In E. Erdfelder, R. Mausfeld, & T. Meiser (Hrsg.), Handbuch Quantitative Methoden (S. 431–458). Weinheim: Psychologie Verlags Union.
Rossiter, J. R. (2002). The C-OAR-SE procedure for scale development in marketing. International Journal of Research in Marketing, 19, 305–335.
Rossiter, J. R. (2010). Measurement for the social sciences: The C-OAR-SE method and why it must replace psychometrics. New York: Springer.
Rössler, P. (2011). Skalenhandbuch Kommunikationswissenschaft. Wiesbaden: VS Verlag.
Rost, J. (2004). Lehrbuch Testtheorie Testkonstruktion. (2. Aufl.). Bern: Huber.
Rozeboom, W. W. & Jones, L. V. (1956). The validity of the successive intervals method of psychometric scaling. Psychometrika, 21, 165–183.
Saal, F. E., Downey, R. G., & Lahey, M. A. (1980). Rating the ratings: Assessing the psychometric quality of rating data. Psychological Bulletin, 88(2), 413–428.
Saal, F. E. & Landy, F. J. (1977). The Mixed Standard Rating Scale: An evaluation. Organizational Behavior and Human Performance, 18, 19–35.
Sackett, P. R. & DuBois, C. L. (1991). Rater-ratee race effects on performance evaluation: Challenging meta-analytic conclusions. Journal of Applied Psychology, 76(6), 873–877.
Saito, T. (1994). Psychological scaling of the asymmetry observed in comparative judgement. British Journal of Mathematical and Statistical Psychology, 47(1), 41–62.
Scheuch, E. K. (1961). Sozialprestige und soziale Schichtung. In D. W. Glass & R. König (Hrsg.), Soziale Schichtung und soziale Mobilität. Sonderheft 5 der „Kölner Zeitschrift für Soziologie und Sozialpsychologie“ (S. 65–103). Opladen: Westdeutscher Verlag.
Scheuring, B. (1991). Primacy-Effekte, ein Ermündungseffekt? Neue Aspekte eines alten Phänomens. Zeitschrift für Sozialpsychologie, 22, 270–274.
Schmeisser, D. R., Bente, G., & Isenbart, J. (2004). Am Puls des Geschehens. Die integrierte Rezeptionsprozessanalyse. Zum Mehrwert rezeptionsbegleitender Untersuchungsmethoden in der Werbewirkungsforschung. Planung und Analyse, 2004(1), 28–34.
Schneider, F. M., Erben, J., Altzschner, R.-S., Kockler, T., Petzold, S., & Satzl, I. (2011). Die Übungssequenz macht den Meister.…Eine experimentelle Studie zu Kontext-Effekten von Übungsstimuli bei Real-Time Response Messungen. In M. Suckfüll, H. Schramm, & C. Wünsch (Hrsg.), Rezeption und Wirkung in zeitlicher Perspektive (S. 253–270). Baden-Baden: Nomos.
Schnell, R., Hill, P. B., & Esser, E. (1999). Methoden der empirischen Sozialforschung. München: Oldenbourg.
Schnell, R., Hill, P. B., & Esser, E. (2008). Methoden der empirischen Sozialforschung (8. Aufl.). München: Oldenbourg.
Schulenberg, S. E. & Melton, A. M. (2007). Confirmatory factor analysis of the Computer Understanding and Experience Scale. Psychological Reports, 100(3), 1263–1269.
Schwab, D. P., Heneman, H. G., & DeCotiis, T. A. (1975). Behaviorally anchored rating scales. A review of the literature. Personnel Psychology, 28(4), 549–562.
Schwarz, N. (2008). Self-Reports: How the questions shape the answers. In R. H. Fazio & R. E. Petty (Eds.), Attitudes: Their structure, function, and consequences (pp. 49–67). New York: Psychology Press.
Schwarz, N., Knäuper, B., Hippler, H.-P., Noelle-Neumann, E., & Clark, L. (1991). Rating scales: Numeric values may change the meaing of scale labels. Public Opinion Quarterly, 55, 570–582.
Schwarz, N. & Oyserman, D. (2001). Asking Questions About Behavior: Cognition, Communication, and Questionnaire Construction. American Journal of Evaluation, 22(2), 127–160.
Schwarz, N., Wänke, M., Sedlmeier, P., & Betsch, T. (2002). Experiential and contextual heuristics in frequency judgement: Ease of recall and response scales. In P. Sedlmeier & T. Betsch (Eds.), Etc.: Frequency processing and cognition (pp. 89–108). New York: Oxford University Press.
Shapira, Z. & Shirom, A. (1980). New Issues in the use of behaviorally anchored rating scales: Level of analysis, the effects of incident frequency, and external validation. Journal of Applied Psychology, 65(5), 517–523.
Sherif, M. & Hovland, C. I. (1961). Social judgement. Assimilation and contrast effects in communication and attitude change. New Haven: Yale University Press.
Shore, T. H. & Tashchian, A. (2003). Effects of sex on raters’ accountability. Psychological Reports, 92(2), 693–702.
Sixtl, F. (1967). Meßmethoden der Psychologie. Weinheim: Beltz.
Smith, P. C. & Kendall, L. M. (1963). Retranslation of expectations: An approach to unambiguous anchors for rating scales. Journal of Applied Psychology, 47, 149–155.
Statistisches Bundesamt. (2011). Haushaltsbefragung auf Stichprobenbasis zum Zensus 2011. Abgerufen 17. Juni, 2013, unter https://cdn.zensus2011.de/live/uploads/tx_templavoila/Fragebogen_Haushaltebefragung_20101007a.pdf.
Stevens, S. S. (1946). On the theory of scales of measurement. Science, 103(2684), 677–680.
Stevens, S. S. (1951). Mathematics, measurement and psychophysics. In S. S. Stevens (Ed.), Handbook of Experimental Psychology (pp. 1–49). New York: Wiley.
Steyer, R. & Eid, M. (1993). Messen und Testen. Heidelberg: Springer.
Stine, W. W. (1989). Meaningful inference: The role of measurement in statistics. Psychological Bulletin, 105(1), 147–155.
Strack, F., Schwarz, N., Ash, M. G., & Sturm, T. (2007). Asking questions: Measurement in the social sciences. In M. G. Ash & T.Sturm. (Eds.) Psychology’s territories: Historical and contemporary perspectives from different disciplines (pp. 225–250). Mahwah: Lawrence Erlbaum Associates.
Strahan, R. F. (1980). More on averaging judges’ ratings: Determining the most reliable composite. Journal of Consulting and Clinical Psychology, 48, 587–589.
Subkoviak, M. J. (1974). Remarks on the method of paired comparisons: The effect on non-normality in Thurstone’s Comparative Judgement Model. Educational & Psychological Measurement, 34, 829–834.
Suppes, P., Krantz, D. H., Luce, R. D., & Tversky, A. (2006). Foundations of measurement volume III: Representation, axiomatization, and invariance. Mineola: Dover Publications.
Taylor, J. B., Haefele, E., Thompson, P., & O’Donoghue, C. (1970). Rating scales as measures of clinical judgement II: The reliability of example-anchored scales und conditions of rater heterogeneity and divergent behavior sampling. Educational and Psychological Measurement, 30(2), 301–310.
Thomas, A., Palmer, J. K., & Feldman, J. M. (2009). Examination and measurement of halo via curvilinear regression: A new approach to halo. Journal of Applied Social Psychology, 39(2), 350–358.
Thorndike, E. L. (1920). A constant error in psychological ratings. Journal of Applied Psychology, 4, 469–477.
Thurstone, L. L. (1927). A „Law of Comparative Judgement“. Psychological Review, 34, 273–286.
Thurstone, L. L. & Chave, E. J. (1929). The measurement of attitudes. Chicago: University of Chicago Press.
Torgerson, W. S. (1958). Theory and methods of scaling. New York: Wiley.
Trommsdorff, V. (1975). Die Messung von Produktimages für das Marketing. Grundlagen und Operationalisierung. Köln: Heymanns.
Tziner, A., Joanis, C., & Murphy, K. R. (2000). A comparison of three methods of performance appraisal with regard to goal properties, goal perception, and ratee satisfaction. Group and Organization Management, 25(2), 175–190.
Upmeyer, A. (1985). Soziale Urteilsbildung. Stuttgart: Kohlhammer.
Upshaw, H. S. (1962). Own attitude as an anchor in equal appearing intervals. Journal of Abnormal and Social Psychology, 64, 85–96.
Van der Ven, A. (1980). Einführung in die Skalierung. Bern: Huber.
Wade Savage, C. & Ehrlich, P. (Eds.). (1991). Philosophical and foundational issues in measurement theory. Hillsdale: Erlbaum.
Waldman, D. A. & Avolio, B. J. (1991). Race effects in performance evaluations: Controlling for ability, education, and experience. Journal of Applied Psychology, 76(6), 897–901.
Wänke, M. & Fiedler, K. (2007). What is said and what is meant: Conversational implicatures in natural conversations, research settings, media, and advertising. Social Communication (pp. 223–255). New York: Psychology Press.
Wanous, J. P. & Hudy, M. J. (2001). Single-item reliability: A replication and extension. Organizational Research Methods, 4(4), 361–375.
Wanous, J. P., Reichers, A. E., & Hudy, M. J. (1997). Overall job satisfaction: How good are single-item measures? Journal of Applied Psychology, 82(2), 247–252.
Waxweiler, R. (1980). Psychotherapie im Strafvollzug. Eine empirische Erfolgsuntersuchung am Beispiel der sozialtherapeutischen Abteilung in einer Justizvollzugsanstalt. Basel: Beltz.
Wessels, M. G. (1994). Kognitive Psychologie (3. Aufl.). München: Reinhardt.
West, C. P., Dyrbye, L. N., Sloan, J. A., & Shanafelt, T. D. (2009). Single item measures of emotional exhaustion and depersonalization aure useful for assessing burnout in medical professionals. Journal of General Internal Medicine, 24(12), 1318–1321.
Westermann, R. (1985). Empirical tests of scale type for individual ratings. Applied Psychological Measurement, 9, 265–274.
Wewers, M. E. & Lowe, N. K. (1990). A critical review of visual analogue scales in the measurement of clinical phenomena. Research in Nursing & Health, 13(4), 227–236.
Wirtz, M. A. & Caspar, F. (2002). Beurteilerübereinstimmung und Beurteilerreliabilität. Methoden zur Bestimmung und Verbesserung der Zuverlässigkeit von Einschätzungen mittels Kategoriensystemen und Ratingskalen. Göttingen: Hogrefe.
Wolfe, E. W. (2004). Identifying rater effects using latent trait models. Psychology Science, 46(1), 35–51.
Young, R. K. & Thiessen, D. D. (1991). Washing, drying, and anointing in adult humans (Homo sapiens): Commonalities with grooming sequences in rodents. Journal of Comparative Psychology, 105(4), 340–344.
Yu, J. H., Albaum, G., & Swenson, M. (2003). Is a central tendency error inherent in the use of semantic differential scales in different cultures? International Journal of Market Research, 45(2), 213–228.
Zakour, M. J. (1994). Measuring career-development volunteerism: Guttman scale analysis using red cross volunteers. Journal of Social Service Research, 19(3–4), 103–120.
Zhikun, D. & Fungfai, N. (2008). A new way of developing semantic differential scales with personal construct theory. Construction Management & Economics, 26(11), 1213–1226.
Zumbo, B. D. & Zimmerman, D. W. (1993). Is the selection of statistical methods governed by level of measurement? Canadian Psychology/Psychologie Canadienne, 34(4), 390–400.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Döring, N., Bortz, J. (2016). Operationalisierung. In: Forschungsmethoden und Evaluation in den Sozial- und Humanwissenschaften. Springer-Lehrbuch. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41089-5_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-41089-5_8
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41088-8
Online ISBN: 978-3-642-41089-5
eBook Packages: Psychology (German Language)