Zusammenfassung
Dieses Kapitel befasst sich mit grundlegenden Anforderungen an diagnostische Instrumente. Der Schwerpunkt liegt dabei auf psychologischen Tests. Neben einer Definition und beispielhaften Darstellungen psychologischer Tests werden Kernannahmen des Messens sowie der Klassischen Testtheorie und von Item-Response-Theorien skizziert. Wesentliche Strategien und Schritte der Testkonstruktion werden eingeführt. Schließlich erfolgt eine Betrachtung wesentlicher Testgütekriterien.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Literatur
American Educational Research Association (AERA), American Psychological Association (APA), & National Council on Measurement in Education (NCME). (2014). Standards for educational and psychological testing. Washington, DC: American Educational Research Association.
American Psychiatric Association. (2015). Diagnostisches und Statistisches Manual Psychischer Störungen DSM-5: Deutsche Ausgabe herausgegeben von Peter Falkai und Hans-Ulrich Wittchen. Göttingen: Hogrefe.
Anderson, J. C., & Gerbing, D. W. (1991). Predicting the performance of measures in a confirmatory factor analysis with a pretest assessment of their substantive validities. Journal of Applied Psychology 76, 732–740.
Back, M. D., Küfner, A. C. P., Dufner, M., Gerlach, T. M., Rauthmann, J. F., & Denissen, J. J. A. (2013). Narcissistic admiration and rivalry: Disentangling the bright and dark sides of narcissism. Journal of Personality and Social Psychology 105, 1013–1037.
Barrick, M. R., Mount, M. K., & Judge, T. A. (2001). Personality and performance at the beginning of the new millennium: What do we know and where do we go next? International Journal of Selection and Assessment 9, 9–30.
Baumgärtel, F., & Thomas-Langel, R. (2015). TBS-TK Rezension. Psychologische Rundschau 66, 152–154.
Bentler, P. M. (2009). Alpha, dimension-free, and model-based internal consistency reliability. Psychometrika 74, 137–143.
Bledow, R., & Frese, M. (2005). Situational Judgment Test on personal initiative. https://docplayer.org/34298704-Situational-judgment-test-on-personal-initiative.html Zugegriffen: 01.09.2021
Bledow, R., & Frese, M. (2009). A situational judgment test of personal initiative and its relationship to performance. Personnel Psychology 62, 229–258.
Block, J. (1995). A contrarian view of the five-factor approach to personality description. Psychological Bulletin 117, 187–215.
Bornstein, R. F. (2011). Toward a process-focused model of test score validity: Improving psychological assessment in science and practice. Psychological Assessment 23, 532–544.
Brem-Gräser, L. (2001). Familie in Tieren: Die Familiensituation im Spiegel der Kinderzeichnung. Entwicklung eines Testverfahrens (8. Aufl.). München: Reinhardt.
Brickenkamp, R. (2002). Test d2: Aufmerksamkeits-Belastungs-Test (9. Aufl.). Göttingen: Hogrefe.
Brickenkamp, R., Schmidt-Atzert, L., & Liepmann, D. (2010). d2-R: Test d2 – Revision. Aufmerksamkeits- und Konzentrationstest. Göttingen: Hogrefe.
Broughton, R. (1984). A prototype strategy for construction of personality scales. Journal of Personality and Social Psychology 47, 1334–1346.
Brown, A., & Maydeu-Olivares, A. (2013). How IRT can solve problems of ipsative data in forced-choice questionnaires. Psychological Methods 18, 36–52.
Brunswik, E. (1952). The conceptual framework of psychology. Chicago: University of Chicago Press.
Bühner, M. (2011). Einführung in die Test- und Fragebogenkonstruktion (3. Aufl.). München: Pearson.
Bühner, M. (2021). Einführung in die Test- und Fragebogenkonstruktion (4. Aufl.). München: Pearson.
Campbell, D. T., & Fiske, D. W. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychological Bulletin 56, 81–105.
Canfield, A. A. (1951). The "sten" scale—a modified C-Scale. Educational and Psychological Measurement 11, 295–297.
Cattell, R. B. (1946). Description and measurement of personality. Oxford: World Book Company.
Cattell, R. B. (1966). The scree test for the number of factors. Multivariate Behavioral Research 1, 245–276.
Cho, E. (2016). Making reliability reliable: A systematic approach to reliability coefficients. Organizational Research Methods 19, 651–682.
Cleary, T. A. (1968). Test bias: Prediction of grades of Negro and white students in integrated colleges. Journal of Educational Measurement 5, 115–124.
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Erlbaum.
Cohen, R. J., Swerdlik, M. E., & Sturman, E. (2013). Psychological testing and assessment: An introduction to tests and measurement (8th ed.). New York, NY: McGraw-Hill.
Colella, A., Hebl, M., & King, E. (2017). One hundred years of discrimination research in the Journal of Applied Psychology: A sobering synopsis. Journal of Applied Psychology 102, 500–513.
Colquitt, J. A., Sabey, T. B., Rodell, J. B., & Hill, E. T. (2019). Content validation guidelines: Evaluation criteria for definitional correspondence and definitional distinctiveness. Journal of Applied Psychology 104, 1243–1265.
Connolly, J. J., Kavanagh, E. J., & Viswesvaran, C. (2007). The convergent validity between self and observer ratings of personality: A meta-analytic review. International Journal of Selection and Assessment 15, 110–117.
Credé, M., Tynan, M. C., & Harms, P. D. (2017). Much ado about grit: A meta-analytic synthesis of the grit literature. Journal of Personality and Social Psychology 113, 492–511.
Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika 16, 297–334.
Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin 52, 281–302.
Dahlke, J. A., Kostal, J. W., Sackett, P. R., & Kuncel, N. R. (2018). Changing abilities vs. changing tasks: Examining validity degradation with test scores and college performance criteria both assessed longitudinally. Journal of Applied Psychology 103, 980–1000.
Deary, I. J., Strand, S., Smith, P., & Fernandes, C. (2007). Intelligence and educational achievement. Intelligence 35, 13–21.
DeCastellarnau, A. (2017). A classification of response scale characteristics that affect data quality: a literature review. Quality & Quantity 52, 1–37.
Diagnostik- und Testkuratorium (2018a). Personalauswahl kompetent gestalten: Grundlagen und Praxis der Eignungsdiagnostik nach DIN 33430. Berlin, Heidelberg: Springer.
Diagnostik- und Testkuratorium (2018b). TBS-DTK. Testbeurteilungssystem des Diagnostik- und Testkuratoriums der Föderation Deutscher Psychologenvereinigungen. Revidierte Fassung vom 03. Jan. 2018. Psychologische Rundschau 18, 109–116.
Diedenhofen, B., & Musch, J. (2017). Empirical option weights improve the validity of a multiple-choice knowledge test. European Journal of Psychological Assessment 33, 336–344.
DiStefano, C., & Motl, R. W. (2006). Further investigating method effects associated with negatively worded items on self-report surveys. Structural Equation Modeling 13, 440–464.
Dunn, T. J., Baguley, T., & Brunsden, V. (2014). From alpha to omega: A practical solution to the pervasive problem of internal consistency estimation. British Journal of Psychology 105, 399–412.
Ebenrett, H. J., Hansen, D., & Puzicha, K. J. (2003). Verlust von Humankapital in Regionen mit hoher Arbeitslosigkeit. Politik und Zeitgeschichte 6–7, 25–31.
Eid, M., & Schmidt, K. (2014). Testtheorie und Testkonstruktion. Göttingen: Hogrefe.
Eid, M., Nussbeck, F. W., Geiser, C., Cole, D. A., Gollwitzer, M., & Lischetzke, T. (2008). Structural equation modeling of multitrait-multimethod data: different models for different types of methods. Psychological Methods 13, 230–253.
Eid, M., Gollwitzer, M., & Schmitt, M. (2017). Statistik und Forschungsmethoden (4. Aufl.). Weinheim: Beltz.
Entertainment and Sports Programming Network (ESPN) (2012). Serena Williams collapses in opener. ESPN.com news services vom 29. Mai 2012. Zugegriffen: 06. Juni 2020.
Ericsson, K. A., & Simon, H. A. (1984). Protocol analysis: Verbal reports as data. Cambridge, MA: Bradford Books/The MIT Press.
Fabrigar, L. R., Wegener, D. T., MacCallum, R. C., & Strahan, E. J. (1999). Evaluating the use of exploratory factor analysis in psychological research. Psychological Methods 4, 272–299.
Fahrenberg, J., Hampel, R., & Selg, H. (2010). FPI-R: Freiburger Persönlichkeitsinventar (8. Aufl.). Göttingen: Hogrefe.
Flanagan, J. C. (1954). The critical incident technique. Psychological Bulletin 51, 327–358.
Fleeson, W., & Gallagher, P. (2009). The implications of Big Five standing for the distribution of trait manifestation in behavior: Fifteen experience-sampling studies and a meta-analysis. Journal of Personality and Social Psychology 97, 1097–1114.
Frank, F., & Kanning, U. P. (2014). Lücken im Lebenslauf. Zeitschrift für Arbeits- und Organisationspsychologie 58, 155–162.
Freudenstein, J.-P., Strauch, C., Mussel, P., & Ziegler, M. (2019). Four personality types may be neither robust nor exhaustive. Nature Human Behaviour 3, 1045–1046.
Geiger, M., Pfattheicher, S., Hartung, J., Weiss, S., Schindler, S., & Wilhelm, O. (2018). Self‐compassion as a facet of neuroticism? A reply to the comments of Neff, Tóth‐Király, and Colosimo (2018). European Journal of Personality 32, 393–404.
Gerlach, M., Farb, B., Revelle, W., & Amaral, L. A. N. (2018). A robust data-driven approach identifies four personality types across four large data sets. Nature Human Behaviour 2, 735–742.
Golden, J. P., Bents, R., & Blank, R. (2004). Golden Profiler of Personality (GPOP). Deutsche Adaptation des Golden Personality Type Profiler von John P. Golden. Bern: Huber.
Gough, H. G., & Heilbrun, A. B. (1980). The adjective check list manual. Palo Alto, CA: Consulting Psychologists Press.
Gregory, R. J. (2004). Psychological testing: History, principles, and applications (4th ed.). Boston: Pearson.
Guion, R. M. (1980). On Trinitarian doctrines of validity. Professional Psychology 11, 385–398.
Gulliksen, H. (1950). Theory of mental tests. Hoboken, NJ: John Wiley & Sons Inc.
Hartmann, P., Reuter, M., & Nyborg, H. (2006). The relationship between date of birth and individual differences in personality and general intelligence: A large-scale study. Personality and Individual Differences 40, 1349–1362.
Hathaway, S. R., McKinley, J. C., & Engel, R. R. (2000). MMPI-2: Minnesota Multiphasic Personality Inventory-2 (Manual). Bern: Huber.
Hausknecht, J. P., Halpert, J. A., Di Paolo, N. T., & Gerrard, M. M. O. (2007). Retesting in selection: A meta-analysis of coaching and practice effects for tests of cognitive ability. Journal of Applied Psychology 92, 373–385.
Hautzinger, M., Keller, F., & Kühner, C. (2006). BDI-II: Beck Depressions-Inventar, Revision. Frankfurt am Main: Harcourt Test Services.
Heidenreich, K. (1993). Die Verwendung standardisierter Tests. In E. Roth (Hrsg.), Sozialwissenschaftliche Methoden. Lehr- und Handbuch für Forschung und Praxis (3. Aufl., S. 389–406). München: Oldenbourg.
Hemphill, J. F. (2003). Interpreting the magnitudes of correlation coefficients. American Psychologist 58, 78–80.
Hinkin, T. R. (1998). A brief tutorial on the development of measures for use in survey questionnaires. Organizational Research Methods 1, 104–121.
Hinkin, T. R., & Tracey, J. B. (1999). An analysis of variance approach to content validation. Organizational Research Methods 2, 175–186.
Hoekstra, R., Morey, R. D., Rouder, J. N., & Wagenmakers, E.-J. (2014). Robust misinterpretation of confidence intervals. Psychonomic Bulletin & Review 21, 1157–1164.
Höft, S., & Muck, P. M. (2009). TBS-TK Rezension: "Golden Profiler of Personality (GPOP). Deutsche Adaptation des Golden Personality Type Profiler von John P. Golden". Report Psychologie 34, 322–323.
Hogan, T. P., Benjamin, A., & Brezinski, K. L. (2000). Reliability methods: A note on the frequency of use of various types. Educational and Psychological Measurement 60, 523–531.
Horn, J. L. (1965). A rationale and test for the number of factors in factor analysis. Psychometrika 30, 179–185.
Hoyle, R. H. (2012). Handbook of structural equation modeling. New York, NY: Guilford Press.
Hoyle, R. H., & Duval, J. L. (2004). Determining the number of factors in exploratory and confirmatory factor analysis. In D. Kaplan (Ed.), The Sage handbook of quantitative methodology for the social sciences (pp. 302–317). Thousand Oaks, CA: Sage Publications.
International Test Commission (ITC). (2013). ITC Guidelines on Test Use. 8th October, 2013, Version 1.2. Final Version. Document reference: ITC-G-TU-20131008. https://www.intestcom.org/files/guideline_test_use.pdf. Zugegriffen: 26. März 2020.
Jacobson, N. S., & Truax, P. (1991). Clinical significance: a statistical approach to defining meaningful change in psychotherapy research. Journal of Consulting and Clinical Psychology 59, 12–19.
Kanning, U. P. (2012). Diagnostik zwischen Inkompetenz und Scharlatanerie: Phänomen, Ursachen, Perspektiven. Report Psychologie 37, 100–113.
Kaplan, R. M., & Saccuzzo, D. P. (2005). Psychological testing – Principles, applications, and issues. Berlmont, CA: Thomson Wadsworth.
Kasten, N., & Freund, P. A. (2016). A meta-analytical multilevel reliability generalization of Situational Judgment Tests (SJTs). European Journal of Psychological Assessment 32, 230–240.
Kelava, A., & Moosbrugger, H. (2020a). Deskriptivstatistische Itemanalyse und Testwertbestimmung. In H. Moosbrugger & A. Kelava (Hrsg.), Testtheorie und Fragebogenkonstruktion (3. Aufl., S. 143–158). Berlin, Heidelberg: Springer.
Kelava, A., & Moosbrugger, H. (2020b). Einführung in die Item-Response-Theorie (IRT). In H. Moosbrugger & A. Kelava (Hrsg.), Testtheorie und Fragebogenkonstruktion (3. Aufl., S. 369–410). Berlin, Heidelberg: Springer.
Kemper, C., Brähler, E., & Zenger, M. (2014). Psychologische und sozialwissenschaftliche Kurzskalen für Wissenschaft und Praxis – Eine Einführung. Berlin: Medizinisch Wissenschaftliche Verlagsgesellschaft.
Kersting, M. (2006). "DIN Screen": Leitfaden zur Kontrolle und Optimierung der Qualität von Verfahren und deren Einsatz bei beruflichen Eignungsbeurteilungen. Lengerich: Pabst.
Kramer, J. (2009). Allgemeine Intelligenz und beruflicher Erfolg in Deutschland: Vertiefende und weiterführende Metaanalysen. Psychologische Rundschau 60, 82–98.
Krauth, J. (1996). Klassische Testtheorie. In K. Pawlik (Hrsg.), Grundlagen und Methoden der Differentiellen Psychologie (Enzyklopädie der Psychologie, Serie Differentielle Psychologie und Persönlichkeitsforschung, Bd. 1, S. 647–671). Göttingen: Hogrefe.
Krosnick, J. A., & Fabrigar, L. R. (1997). Designing rating scales for effective measurement in surveys. In L. Lyberg, P. Biemer, M. Collins, E. De Leeuw, C. Dippo, N. Schwarz, & D. Trewin (Eds.), Survey Measurement and Process Quality (pp. 141–164). Hoboken, NJ: John Wiley & Sons Inc.
Krumm, S., Lievens, F., Hüffmeier, J., Lipnevich, A. A., Bendels, H., & Hertel, G. (2015). How "situational" is judgment in an situational judgment test? Journal of Applied Psychology 100, 399–416.
Krumm, S., Schäpers, P., & Göbel, A. (2016). Motive arousal without pictures? An experimental validation of a hybrid implicit motive test. Journal of Personality Assessment 98, 514–522.
Kruyen, P. M., Emons, W. H., & Sijtsma, K. (2013). On the shortcomings of shortened tests: A literature review. International Journal of Testing 13, 223–248.
Kubinger, K. D., & Holocher-Ertl, S. (2014). Adaptives Intelligenz Diagnostikum 3. Göttingen: Beltz.
Kuder, G. F., & Richardson, M. W. (1937). The theory of the estimation of test reliability. Psychometrika 2, 151–160.
Landy, F. J. (1986). Stamp collecting versus science: Validation as hypothesis testing. American Psychologist 41, 1183–1192.
Leibniz-Zentrum für Psychologische Information und Dokumentation (ZPID). (2001). Internationale Richtlinien für die Testanwendung, Version 2000. Deutsche Fassung. https://www.zpid.de/pub/tests/itc_richtlinien.pdf. Zugegriffen: 15. Apr. 2020.
Lenhard, A., Lenhard, W., Suggate, S., & Segerer, R. (2018). A continuous solution to the norming problem. Assessment 25, 112–125.
Liepmann, D., Beauducel, A., Brocke, B., & Amthauer, R. (2007). I-S-T 2000 R: Intelligenz-Struktur-Test 2000 R (2. Aufl.). Göttingen: Hogrefe.
Lienert, G. A., & Raatz, U. (1998). Testaufbau und Testanalyse (6. Aufl.). Weinheim: Psychologie Verlags Union.
Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
Mariacher, H., & Neubauer, A. (2005). PAI 30: Test zur Praktischen Alltagsintelligenz. Göttingen: Hogrefe.
McCrae, R. R., & Costa, P. T. (1987). Validation of the five-factor model of personality across instruments and observers. Journal of Personality and Social Psychology 52, 81–90.
McDonald, R. P. (1999). Test theory: A unified approach. Mahwah, NJ: Laurence Erlbaum Associates.
McGrew, K. S. (2005). The Cattell-Horn-Carroll Theory of cognitive abilities. In D. P. Flanagan, & P. L. Harrison (Eds.), Contempory intellectual assessment: Theories, tests, and issues (pp. 136–181). New York, NY: Guilford Press.
McNeish, D., & Wolf, M. (2020). Thinking twice about sum scores. Behavior Research Methods 52, 2287–2305.
Mellenbergh, G. J. (1994). Generalized linear item response theory. Psychological Bulletin 115, 300–307.
Menold, N., & Bogner, K. (2015). Gestaltung von Ratingskalen in Fragebögen. Mannheim: GESIS – Leibniz-Institut für Sozialwissenschaften.
Meyer, G. J., Finn, S. E., Eyde, L. D., Kay, G. G., Moreland, K. L., Dies, R. R., Eisman, E. J., Kubiszyn, T. W., & Read, G. M. (2001). Psychological testing and psychological assessment: A review of evidence and issues. American Psychologist 56, 128–165.
Millman, J., Bishop, C. H., & Ebel, R. (1965). An analysis of test-wiseness. Educational and Psychological Measurement 25, 707–726.
Mittring, G., & Rost, D. H. (2008). Die verflixten Distraktoren. Über den Nutzen einer theoretischen Distraktorenanalyse bei Matrizentests (für besser Begabte und Hochbegabte). Diagnostica 54, 193–201.
Moosbrugger, H. (2012a). Item-Response-Theorie (IRT). In L. Schmidt-Atzert, & M. Amelang (Hrsg.), Psychologische Diagnostik (2. Aufl., S. 62–83). Berlin, Heidelberg: Springer.
Moosbrugger, H. (2012b). Klassische Testtheorie (KTT). In H. Moosbrugger & A. Kelava (Hrsg.), Testtheorie und Fragebogenkonstruktion (S. 103–117). Berlin, Heidelberg: Springer.
Moosbrugger, H., & Kelava, A. (2020). Testtheorie und Fragebogenkonstruktion (3. Aufl.). Berlin, Heidelberg: Springer.
Moosbrugger, H., & Schermelleh-Engel, K. (2012). Exploratorische (EFA) und konfirmatorische Faktorenanalyse (CFA). In H. Moosbrugger & A. Kelava (Hrsg.), Testtheorie und Fragebogenkonstruktion (2. Aufl., S. 325–343). Berlin, Heidelberg: Springer.
Naumann, J., Artelt, C., Schneider, W., & Stanat, P. (2010). Lesekompetenz von PISA 2000 bis PISA 2009. In E. Klieme, C. Artelt, J. Hartig, N. Jude, O. Köller, M. Prenzel, W. Schneider, & P. Stanat (Hrsg.), PISA 2009 – Bilanz nach einem Jahrzehnt (S. 23–71). Frankfurt am Main: Deutsches Institut für Internationale Pädagogische Forschung (DIPF).
Olaru, G., Witthöft, M., & Wilhelm, O. (2015). Methods matter: Testing competing models for designing short-scale big-five assessments. Journal of Research in Personality 59, 56–68.
Organisation for Economic Co-operation and Development (OECD). (2000). PISA 2000: Beispielaufgaben aus dem Lesekompetenztest. https://www.oecd.org/berlin/39803735.pdf. Zugegriffen: 02. April 2020.
Ostendorf, F., & Angleitner, A. (2004). NEO-PI-R: NEO-Persönlichkeitsinventar nach Costa und McCrae, revidierte Fassung. Göttingen: Hogrefe.
Oswald, W. D., & Roth, E. (1997). Der Zahlen-Verbindungs-Test. Göttingen: Hogrefe.
Pargent, F., Hilbert, S., Eichhorn, K., & Bühner, M. (2019). Can’t make it better nor worse. European Journal of Psychological Assessment 35, 891–899.
Paulhus, D. L. (1984). Two-component models of socially desirable responding. Journal of Personality and Social Psychology 46, 598–609.
Petermann, F. (2014). WAIS-IV: Wechsler Adult Intelligence Scale. Deutschsprachige Adaptation nach David Wechsler. Frankfurt am Main: Pearson Assessment.
Poropat, A. E. (2009). A meta-analysis of the five-factor model of personality and academic performance. Psychological Bulletin 135, 322–338.
Raju, N. S. (1970). New formula for estimating total test reliability from parts of unequal lengths. Proceedings of the Annual Convention of the American Psychological Association 5(Pt. 1), 143–144.
Rammstedt, B., & Krebs, D. (2007). Does response scale format affect the answering of personality scales? Assessing the Big Five dimensions of personality with different response scales in a dependent sample. European Journal of Psychological Assessment 23, 32–38.
Raven, J. G. (1965). Standard Progressive Matrices. Cambridge: University Press.
Robbins, S. B., Lauver, K., Le, H., Davis, D., Langley, R., & Carlstrom, A. (2004). Do psychosocial and study skill factors predict college outcomes? A meta-analysis. Psychological Bulletin 130, 261–288.
Rohrmann, B. (1978). Empirische Studien zur Entwicklung von Antwortskalen für die sozialwissenschaftliche Forschung: Zeitschrift für Soziologie 9, 222–245.
Rost, J. (1999). Test- und Fragebogenanalysen. In B. Strauß, H. Haag, & M. Kolb (Hrsg.), Datenanalyse in der Sportwissenschaft (S. 455–480). Schorndorf: Hofmann.
Rost, J. (2004). Lehrbuch Testtheorie – Testkonstruktion (2. Aufl.). Bern: Huber.
Salgado, J. F., Anderson, N., Moscoso, S., Bertua, C., de Fruyt, F., & Rolland, J. P. (2003a). A meta-analytic study of general mental ability validity for different occupations in the European community. Journal of Applied Psychology 88, 1068–1081.
Salgado, J. F., Anderson, N., Moscoso, S., Bertua, C., & de Fruyt, F. (2003b). International validity generalization of GMA and cognitive abilities: A European community meta-analysis. Personnel Psychology 56, 573–605.
Sargent, S. S. (1940). Thinking processes at various levels of difficulty. Archives of Psychology 249, 5–58.
Schaarschmidt, U., & Fischer, A. W. (2008). AVEM – Arbeitsbezogenes Verhaltens- und Erlebensmuster (3. Aufl.). Göttingen: Hogrefe.
Scherpenzeel, A. C., & Saris, W. E. (1997). The validity and reliability of survey questions: A meta-analysis of MTMM studies. Sociological Methods & Research 25, 341–383.
Schmalt, H. D., Sokolowski, K., & Langens, T. A. (2000). Das Multi-Motiv-Gitter für Anschluss, Leistung und Macht MMG. Frankfurt am Main: Swets.
Schmidt, F. L. (1992). What do data really mean? Research findings, meta-analysis, and cumulative knowledge in psychology. American Psychologist 47, 1173–1181.
Schmidt, F. L., & Hunter, J. E. (1998). The validity and utility of selection methods in personnel psychology: Practical and theoretical implications of 85 years of research findings. Psychological Bulletin 124, 262–274.
Schmidt-Atzert, L. (2007). Objektiver Leistungsmotivationstest OLMT – Software und Manual (2. Aufl., unter Mitarbeit von M. Sommer, M. Bühner & A. Jurecka). Mödling: Schuhfried.
Schmidt-Atzert, L., Hommers, W., & Hess, M. (1995). Der I-S-T 70: Eine Analyse und Neubewertung. Diagnostica 41, 108–130.
Schmidt-Atzert, L., Bühner, M., Rischen, S., & Warkentin, V. (2004). Erkennen von Simulation und Dissimulation im Test d2. Diagnostica 50, 124–133.
Schönbrodt, F. D., & Perugini, M. (2013). At what sample size do correlations stabilize? Journal of Research in Personality 47, 609–612.
Schuler, H., & Stehle, W. (1983). Neuere Entwicklungen des Assessment-Center-Ansatzes – beurteilt unter dem Aspekt der sozialen Validität. Zeitschrift für Arbeits-und Organisationspsychologie 27, 33–44.
Schultze, M. (2017). Constructing subtests using ant colony optimization [Dissertation]. Berlin: Freie Universität Berlin.
Schulze, J., West, S. G., Freudenstein, J.-P., Schäpers, P., Mussel, P., Eid, M., & Krumm, S. (2020). Hidden framings and hidden asymmetries in the measurement of personality – A combined lens‐model and frame‐of‐reference perspective. Journal of Personality, 89, 357–375.
Sparfeldt, J. R., Kimmel, R., Lowenkamp, L., Steingraber, A., & Rost, D. H. (2012). Not read, but nevertheless solved? Three experiments on PIRLS multiple choice reading comprehension test items. Educational Assessment 17, 214–232.
Staatsinstitut für Schulqualität und Bildungsforschung (ISB). (2004). Jahrgangsstufe 7: Mathematik. Genehmigter Lehrplan. https://www.isb-gym8-lehrplan.de/contentserv/3.1.neu/g8.de/index.php?StoryID=26298. Zugegriffen: 20. April 2020.
Stemmler, G., Hagemann, D., Amelang, M., Spinath, F. M., Hasselhorn, M., Kunde, W., & Schneider, S. (2016). Differentielle Psychologie und Persönlichkeitsforschung. Stuttgart: Kohlhammer.
Streiner, D. L. (2003). Starting at the beginning: An introduction to coefficient alpha and internal consistency. Journal of Personality Assessment 80, 99–103.
Strenze, T. (2007). Intelligence and socioeconomic success: A meta-analytic review of longitudinal research. Intelligence 35, 401–426.
Thielsch, M. T., Lenzner, T., & Melles, T. (2012). Wie gestalte ich gute Items und Interviewfragen? Praxis der Wirtschaftspsychologie II: Themen und Fallbeispiele für Studium und Praxis, 221–240.
Thissen, A., Koch, M., Becker, N., & Spinath, F. M. (2016). Construct your own response: The cube construction task asa novel format for the assessment of spatial ability. European Journal of Psychological Assessment 34, 304–311.
Thissen, A., Spinath, F. M., & Becker, N. (2019). The cube construction task allows for a better manipulation of item difficulties than current cube rotation tasks European Journal of Psychological Assessment, e-pub ahead of print. doi: https://dx.doi.org/10.1027/1015-5759/a000534.
Thorndike, R. L. (1971). Concepts of culture‐fairness. Journal of Educational Measurement 8, 63–70.
Wainer, H., Dorans, N. J., Flaugher, R., Green, B. F., & Mislevy, R. J. (2000). Computerized adaptive testing: A primer. New York, NY: Routledge.
Weijters, B., Geuens, M., & Schillewaert, N. (2010). The stability of individual response styles. Psychological Methods 15, 96–110.
Wetzel, E., & Greiff, S. (2018). The world beyond rating scales. European Journal of Psychological Assessment 34, 1–5.
Wirtz, M. A. (2013). Dorsch – Lexikon der Psychologie (16. Aufl.). Bern: Huber.
Wittchen, H.-U., Zaudig, M., & Fydrich, T. (1997). Strukturiertes Klinisches Interview für DSM-IV. Achse I: Psychische Störungen/Achse II: Persönlichkeitsstörungen. Göttingen: Hogrefe.
Wittmann, W. W. (1988). Multivariate reliability theory: Principles of symmetry and successful validation strategies. In J. R. Nesselroade, & R. B. Cattell (Eds.), Handbook of multivariate experimental psychology (2nd ed., pp. 505–560). New York: Plenum Press.
Wittmann, W. W. (2012). Principles of symmetry in evaluation research with implications for offender treatment. In T. Bliesener, A. Beelmann, & M. Stemmler (Eds.), Antisocial behavior and crime: Contributions of developmental and evaluation research to prevention and intervention (pp. 357–368). Cambridge: Hogrefe.
World Health Organization (WHO) (2018). International Statistical Classification of Diseases and Related Health Problems (ICD-11). Genf: World Health Organization.
Yoder, P., & Symons, F. (2010). Observational measurement of behavior. New York, NY: Springer.
Ziegler, M. (2014). Stop and state your intentions! European Journal of Psychological Assessment 30, 239–242.
Ziegler, M., & Reichert, A. (2017). TBS-TK Rezension: „Adaptives Intelligenz Diagnostikum 3 (AID 3)“. Psychologische Rundschau 68, 237–239.
Ziegler, M., Kemper, C. J., & Kruyen, P. (2014). Short scales–Five misunderstandings and ways to overcome them. Journal of Individual Differences 34, 185–189.
Ziegler, M., Maaß, U., Griffith, R., & Gammon, A. (2015). What is the nature of faking? Modeling distinct response patterns and quantitative differences in faking at the same time. Organizational Research Methods 18, 679–703.
Ziegler, M., MacCann, C., & Roberts, R. (2011). New perspectives on faking in personality assessment. Oxford: Oxford University Press.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer-Verlag GmbH Deutschland, ein Teil von Springer Nature
About this chapter
Cite this chapter
Krumm, S., Schmidt-Atzert, L., Amelang, M. (2021). Grundlagen diagnostischer Verfahren. In: Schmidt-Atzert, L., Krumm, S., Amelang, M. (eds) Psychologische Diagnostik. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-61643-7_2
Download citation
DOI: https://doi.org/10.1007/978-3-662-61643-7_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-61642-0
Online ISBN: 978-3-662-61643-7
eBook Packages: Psychology (German Language)