Skip to main content

Student Ratings of Instruction in College and University Courses

  • Chapter
  • First Online:
Higher Education: Handbook of Theory and Research

Abstract

Findings from research on student ratings are summarized from the 1970s to 2013. There were 1,874 references, including 564 since 1994, using the ERIC descriptors “student evaluation of teacher performance” and “higher education.” The authors address the validity of self-report data, misconceptions about student ratings, essentials of credible research, and elements of reliability and validity. Evidence of reliability includes consistency, stability, and generalizability of ratings. Validity evidence consists of relations to other variables, including achievement; instructor self-ratings; ratings by administrators, colleagues, alumni, and trained observers; and student-written comments as well as survey multidimensionality. Possible sources of bias are extraneous student, instructor, and course characteristics either unrelated or related to ratings. Few meaningful differences occur between ratings administered online versus on paper and ratings in online versus face-to-face courses. Recommendations are made for the appropriate use of student ratings and for future research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Content contained in IDEA Paper No. 50 is reprinted by permission of The IDEA Center.

  2. 2.

    The authors converted various summary statistics reported in the multi-section studies into Pearson product-moment correlations.

References

  • Abrami, P. C. (2001). Improving judgments about teaching effectiveness using teacher rating forms. In M. Theall, P. C. Abrami, & L. A. Mets (Eds.), The student ratings debate: Are they valid? How can we best use them? (New directions for institutional research, no. 109, pp. 59–87). San Francisco: Jossey-Bass.

    Google Scholar 

  • Abrami, P. C., & d’Apollonia, S. (1990). The dimensionality of ratings and their use in personnel decisions. In M. Theall & J. Franklin (Eds.), Student ratings of instruction: Issues for improving practice (New directions for teaching and learning, no. 43, pp. 97–111). San Francisco: Jossey-Bass.

    Google Scholar 

  • Abrami, P. C., & d’Apollonia, S. (1991). Multidimensional students’ evaluations of teaching effectiveness- generalizability of “N = 1”research: Comments on Marsh (1991). Journal of Educational Psychology, 83, 411–415.

    Google Scholar 

  • Abrami, P. C., Leventhal, L., & Perry, R. P. (1982a). Educational seduction. Review of Educational Research, 52, 446–464.

    Google Scholar 

  • Abrami, P. C., Perry, R. P., & Leventhal, L. (1982b). The relationship between student personality characteristics, teacher ratings, and student achievement. Journal of Educational Psychology, 74, 111–125.

    Google Scholar 

  • Abrami, P. C., d’Apollonia, S., & Rosenfeld, S. (2007). The dimensionality of student ratings of instruction: What we know, do not know, and need to do. In R. P. Perry & J. C. Smart (Eds.), The Scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 385–445). Dordrecht: Springer.

    Google Scholar 

  • Adams, M. J., & Umbach, P. D. (2012). Nonresponse and online student evaluations of teaching: Understanding the influence of salience, fatigue, and academic environments. Research in Higher Education, 53(5), 576–591.

    Google Scholar 

  • Addison, W. E., & Stowell, J. R. (2012). Conducting research on student evaluations of teaching. In M. E. Kite (Ed.), Effective evaluations of teaching: A guide for faculty and administrators (pp. 5–12). Retrieved from the Society for the Teaching of Psychology website:http://teachpsych.org/ebooks/evals2012/index.php

  • Aleamoni, L. M. (1978). The usefulness of student evaluations in improving college teaching. Instructional Science, 7, 95–105.

    Google Scholar 

  • Aleamoni, L. M. (1981). Student ratings of instruction. In J. Millman (Ed.), Handbook of teacher evaluation (pp. 110–145). Beverly Hills: Sage.

    Google Scholar 

  • Aleamoni, L. M. (1987). Student rating myths versus research facts. Journal of Personnel Evaluation in Education, 1, 111–119.

    Google Scholar 

  • Alhija, F. N. A., & Fresko, B. (2009). Student evaluation of instruction: What can be learned from students’ written comments? Studies in Educational Evaluation, 35, 37–44.

    Google Scholar 

  • Anderson, H. M., Cain, J., & Bird, E. (2005). Online student course evaluations: Review of literature and a pilot study. American Journal of Pharmaceutical Education, 69, 34–43.

    Google Scholar 

  • Apodaca, P., & Grad, H. (2005). The dimensionality of student ratings of teaching: Integration of uni- and multidimensional models. Studies in Higher Education, 30(6), 723–748.

    Google Scholar 

  • d’Apollonia, S., & Abrami, P. C. (1997). Navigating student ratings of instruction. American Psychologist, 52, 1198–1208.

    Google Scholar 

  • Arreola, R. A. (2006). Developing a comprehensive faculty evaluation system (2nd ed.). Bolton: Anker Publishing.

    Google Scholar 

  • Avery, R. J., Bryant, W. K., & Mathios, A. (2006). Electronic course evaluations: Does an online delivery system influence student evaluations. The Journal of Economic Education, 37, 21–37.

    Google Scholar 

  • Babor, T. F., & Del Boca, F. K. (1992). Just the facts: Enhancing measurements of alcohol consumption using self-report methods. In R. Litten & J. Allen (Eds.), Measuring alcohol consumption: Psychosocial and biochemical methods (pp. 3–19). Totowa: Humana Press.

    Google Scholar 

  • Babor, T. F., Steinberg, K., Anton, R., & Del Boca, F. K. (2000). Talk is cheap: Measuring drinking outcomes in clinical trials. Journal of Studies on Alcohol, 61(1), 55–63.

    Google Scholar 

  • Baird, J. S. (1987). Perceived learning in relation to student evaluation of university instruction. Journal of Educational Psychology, 79, 90–91.

    Google Scholar 

  • Basow, S. A. (2000). Best and worst professors: Gender patterns in students’ choices. Sex Roles, 43(5/6), 407–417.

    Google Scholar 

  • Beattie, J., Spooner, F., Jordan, L., Algozzine, B., & Spooner, M. (2002). Evaluating instruction in distance learning classes. Teacher Education and Special Education, 25, 124–132.

    Google Scholar 

  • Beleche, T., Fairris, D., & Marks, M. (2012). Do course evaluations truly reflect student learning? Evidence from an objectively graded post-test. Economics of Education Review, 31(5), 709–719.

    Google Scholar 

  • Benton, S. L., & Cashin, W. E. (2012). Student ratings of teaching: A summary of research and literature (IDEA Paper No. 50). Manhattan: The IDEA Center.

    Google Scholar 

  • Benton, S. L., & Pallett, W. H. (2013). Class size matters. Inside Higher Education. http://www.insidehighered.com/views/2013/01/29/essay-importance-class-size-higher-education. 28 Jan 2013.

  • Benton, S. L., Webster, R., Gross, A. B., & Pallett, W. (2010a). IDEA technical report no. 16: An analysis of IDEA student ratings of instruction using paper versus online survey methods. Manhattan: The IDEA Center.

    Google Scholar 

  • Benton, S. L., Webster, R., Gross, A. B., & Pallett, W. (2010b). IDEA technical report no. 15: An analysis of IDEA student ratings of instruction in traditional versus online courses. Manhattan: The IDEA Center.

    Google Scholar 

  • Benton, S. L., Duchon, D., & Pallett, W. H. (2011). Validity of self-report student ratings of instruction. Assessment and Evaluation in Higher Education., 38, 377–389.

    Google Scholar 

  • Benton, S. L., Brown, R., & Li., D. (2012a). Replication of IDEA Technical Report No. 12 tables: 2012 IDEA student ratings dataset. Unpublished manuscript.

    Google Scholar 

  • Benton, S. L., Gross, A., & Brown, R. (2012b, October). Which learning outcomes and teaching methods are instructors really emphasizing in STEM courses? Presentation at American Association of Colleges and Universities Network for Academic Renewal, Kansas City.

    Google Scholar 

  • Benton, S. L., Guo, M., Li, D., & Gross, A. (2013, April). Student ratings, teacher standards, and critical thinking skills. Paper presented at the annual meeting of the American Educational Research Association, San Francisco.

    Google Scholar 

  • Beran, T., Violato, C., Kline, D., & Frideres, J. (2007). What’s the “use” of student ratings of instruction for administrators? Canadian Journal of Higher Education, 37, 27–43.

    Google Scholar 

  • Berk, R. A. (2005). Survey of 12 strategies to measure teaching effectiveness. International Journal of Teaching and Learning in Higher Education, 17, 48–62.

    Google Scholar 

  • Biglan, A. (1973). The characteristics of subject matter in different academic areas. Journal of Applied Psychology, 57, 195–203.

    Google Scholar 

  • Braskamp, L. A., & Ory, J. C. (1994). Assessing faculty work: Enhancing individual and institutional performance. San Francisco: Jossey-Bass.

    Google Scholar 

  • Braskamp, L. A., Ory, J. C., & Pieper, D. M. (1981). Student written comments: Dimensions of instructional quality. Journal of Educational Psychology, 73, 65–70.

    Google Scholar 

  • Brener, N. D., Billy, J. O. G., & Grady, W. R. (2003). Assessment of factors affecting the validity of self-reported health-risk behavior among adolescents: Evidence from the scientific literature. Journal of Adolescent Health, 33, 436–457.

    Google Scholar 

  • Brinko, K. T. (1990). Instructional consultation with feedback in higher education. Journal of Higher Education, 61, 65–83.

    Google Scholar 

  • Brown, M. (2012, July). Learning analytics: Moving from concept to practice. In Educause Learning Initiative Brief. Retrieved from the EDUCAUSE website: http://net.educause.edu/ir/library/pdf/ELIB1203.pdf

  • Buchert, S., Laws, E. L., Epperson, J. M., & Bregman, N. J. (2008). First impressions and professor reputation: Influence on student evaluations of instruction. Social Psychology of Education, 11, 397–408.

    Google Scholar 

  • Burdsal, C. A., & Harrison, P. D. (2008). Further evidence supporting the validity of both a multidimensional profile and an overall evaluation of teaching effectiveness. Assessment and Evaluation in Higher Education, 33, 567–576.

    Google Scholar 

  • Burgoon, J. K., Birk, T., & Pfau, M. (1990). Nonverbal behaviors, persuasion, and credibility. Human Communication Research, 17, 140–169.

    Google Scholar 

  • Campbell, J., & Bozeman, W. (2007). The value of student ratings: Perceptions of students, teachers, and administrators. Community College of Research and Practice, 32(1), 13–24.

    Google Scholar 

  • Campbell, D. T., & Stanley, J. C. (1963). Experimental and quasi-experimental designs for research. Boston: Houghton Mifflin.

    Google Scholar 

  • Carrell, S., & West, J. (2010). Does professor quality matter? Evidence from random assignments of students to professors. Journal of Political Economy, 118(3), 409–432.

    Google Scholar 

  • Carrier, N. A., Howard, G. S., & Miller, W. G. (1974). Course evaluations: When? Journal of Educational Psychology, 66, 609–613.

    Google Scholar 

  • Cashin, W. E (1989). Defining and evaluating college teaching (IDEA Paper No. 21). Manhattan: Kansas State University, Center for Faculty Evaluation and Development.

    Google Scholar 

  • Cashin, W. E. (1990). Students do rate different academic fields differently. In M. Theall & J. Franklin (Eds.), Student ratings of instruction: Issues for improving practice (New directions for teaching and learning, no. 43, pp. 113–121). San Francisco: Jossey-Bass.

    Google Scholar 

  • Cashin, W. E. (1996). Developing an Effective Faculty Evaluation System (IDEA Paper No. 33). Manhattan: Kansas State University, Center for Faculty Evaluation and Development.

    Google Scholar 

  • Cashin, W. E. (2003). Evaluating college and university teaching: Reflections of a practitioner. In J. C. Smart (Ed.), Higher education: Handbook of theory and research (pp. 531–593). Dordrecht: Kluwer Academic Publishers.

    Google Scholar 

  • Cashin, W. E., & Downey, R. G. (1992). Using global student ratings for summative evaluation. Journal of Educational Psychology, 84, 563–572.

    Google Scholar 

  • Centra, J. A. (1976). The influence of different directions on student ratings of instruction. Journal of Educational Measurement, 13, 277–282.

    Google Scholar 

  • Centra, J. A. (1979). Determining faculty effectiveness: Assessing teaching, research, and service for personnel decisions and improvement. San Francisco: Jossey-Bass.

    Google Scholar 

  • Centra, J. A. (1993). Reflective faculty evaluation: Enhancing teaching and determining faculty effectiveness. San Francisco: Jossey-Bass.

    Google Scholar 

  • Centra, J. A. (2003). Will teachers receive higher student evaluations by giving higher grades and less course work? Research in Higher Education, 44, 495–518.

    Google Scholar 

  • Centra, J. A. (2009). Differences in responses to the student instructional report: Is it bias? Princeton: Educational Testing Service.

    Google Scholar 

  • Centra, J. A., & Gaubatz, N. B. (2000). Is there a gender bias in student evaluations of teaching? Journal of Higher Education, 70, 17–33.

    Google Scholar 

  • Clayson, D. E. (2009). Student evaluation of teaching: Are they related to what students learn? Journal of Marketing Education, 31, 16–30.

    Google Scholar 

  • Clayson, D. E., & Sheffet, M. J. (2006). Personality and the student evaluation of teaching. Journal of Marketing Education, 28(2), 149–160.

    Google Scholar 

  • Cohen, P. A. (1980). Effectiveness of student-rating feedback for improving college instruction: A meta-analysis of findings. Research in Higher Education, 13, 321–341.

    Google Scholar 

  • Cohen, P. A. (1981). Student ratings of instruction and student achievement: A meta-analysis of multisection validity studies. Review of Educational Research, 51, 281–309.

    Google Scholar 

  • Cohen, P. A. (1987, April). A critical analysis and reanalysis of the multisection validity meta-analysis. Paper presented at the annual meeting of the American Educational Research Association, Washington, DC.

    Google Scholar 

  • Cooper, A. M., Sobell, M. B., Sobell, L. C., & Maisto, S. A. (1981). Validity of alcoholics’ self-reports: Duration data. International Journal of Addiction, 16, 401–406.

    Google Scholar 

  • Costin, F. (1968). A graduate course in the teaching of psychology: Description and evaluation. Journal of Teacher Education, 19, 425–432.

    Google Scholar 

  • Costin, F., Greenough, W. T., & Menges, R. J. (1971). Student ratings of college teaching: reliability, validity, and usefulness. Review of Educational Research, 41, 511–535.

    Google Scholar 

  • Creasman, P. A. (n.d.). IDEA Paper No. 52: Considerations in online course design. Manhattan: The IDEA Center.

    Google Scholar 

  • Davis, B. G. (2009). Tools for teaching (2nd ed.). San Francisco: Jossey-Bass.

    Google Scholar 

  • Del Boca, F. K., & Noll, J. A. (2002). Truth or consequences: The validity of self-report data in health services research on addictions. Addiction, 95, 347–360.

    Google Scholar 

  • Dommeyer, C. J., Baum, P., Hanna, R. W., & Chapman, K. S. (2004). Gathering faculty teaching evaluations by in-class and online surveys: Their effects on response rates and evaluations. Assessment and Evaluation in Higher Education, 29, 611–623.

    Google Scholar 

  • Dorn, D. S. (1987). The first day of class: Problems and strategies. Teaching Sociology, 15, 61–72.

    Google Scholar 

  • Dyckhoff, A. L. (2011). Implications for learning analytics tools: A meta-analysis of applied research questions. International Journal of Computer Information Systems and Industrial Management Application, 3, 594–601.

    Google Scholar 

  • Erdle, S., Murray, H. G., & Rushton, J. P. (1985). Personality, classroom behavior, and student ratings of college teaching effectiveness: A path analysis. Journal of Educational Psychology, 77, 394–407.

    Google Scholar 

  • Feeley, T. H. (2002). Evidence of halo effects in student evaluations of communication instruction. Communication Education, 51(3), 225–236.

    Google Scholar 

  • Feldman, K. A. (1976a). Grades and college students’ evaluations of their courses and teachers. Research in Higher Education, 4, 69–111.

    Google Scholar 

  • Feldman, K. A. (1976b). The superior college teacher from the students’ view. Research in Higher Education, 5, 243–288.

    Google Scholar 

  • Feldman, K. A. (1977). Consistency and variability among college students in rating their teachers and courses: A review and analysis. Research in Higher Education, 6, 233–274.

    Google Scholar 

  • Feldman, K. A. (1978). Course characteristics and college students’ ratings of their teachers: What we know and what we don’t. Research in Higher Education, 9, 199–242.

    Google Scholar 

  • Feldman, K. A. (1979). The significance of circumstances for college students’ ratings of their teachers and courses. Research in Higher Education, 10, 149–172.

    Google Scholar 

  • Feldman, K. A. (1983). Seniority and experience of college teachers as related to evaluations they receive from students. Research in Higher Education, 18, 3–124.

    Google Scholar 

  • Feldman, K. A. (1984). Class size and college students’ evaluations of teachers and courses: A closer look. Research in Higher Education, 21, 45–116.

    Google Scholar 

  • Feldman, K. A. (1986). The perceived instructional effectiveness of college teachers as related to their personality and attitudinal characteristics: A review and synthesis. Research in Higher Education, 24, 129–213.

    Google Scholar 

  • Feldman, K. A. (1987). Research productivity and scholarly accomplishment of college teachers as related to their instructional effectiveness: A review and exploration. Research in Higher Education, 26, 227–298.

    Google Scholar 

  • Feldman, K. A. (1988). Effective college teaching from the students’ and faculty’s view: Matched or mismatched priorities. Research in Higher Education, 28, 291–344.

    Google Scholar 

  • Feldman, K. A. (1989a). Instructional effectiveness of college teachers as judged by teachers themselves, current and former students, colleagues, administrators and external (neutral) observers. Research in Higher Education, 30, 137–194.

    Google Scholar 

  • Feldman, K. A. (1989b). The association between student ratings of specific instructional dimensions and student achievement: Refining and extending the synthesis of data from multisection validity studies. Research in Higher Education, 30, 583–645.

    Google Scholar 

  • Feldman, K. A. (1992). College students’ views of male and female college teachers: Part I–Evidence from the social laboratory and experiments. Research in Higher Education, 33, 317–375.

    Google Scholar 

  • Feldman, K. A. (1993). College students’ views of male and female college teachers: Part II–Evidence from students’ evaluations of their classroom teachers. Research in Higher Education, 34, 151–211.

    Google Scholar 

  • Feldman, K. A. (2007). Identifying exemplary teachers and teaching: Evidence from student ratings. In R. P. Perry & J. C. Smart (Eds.), The scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 93–129). Dordrecht: Springer.

    Google Scholar 

  • Forsyth, D. R. (2003). Professor’s guide to teaching: Psychological principles and practices. Washington, DC: American Psychological Association.

    Google Scholar 

  • Franklin, J., & Theall, M. (1992). Disciplinary differences: Instructional goals and activities, measures of student performance, and student ratings of instruction. Paper presented at the annual meeting of the American Educational Research Association, San Francisco.

    Google Scholar 

  • Freier, M. C., Bell, R. M., & Ellickson, P. (1991). Do teens tell the truth? The validity of self-reported tobacco use by adolescents. Santa Monica: The Rand Corporation.

    Google Scholar 

  • Frey, P. W. (1976). Validity of student instructional ratings as a function of their timing. Journal of Higher Education, 47, 327–336.

    Google Scholar 

  • Galbraith, C. S., Merrill, G. B., & Kline, D. M. (2012). Are student evaluations of teaching effectiveness valid for measuring student learning outcomes in business related courses? A neural network and Bayesian analyses. Research in Higher Education, 53, 353–374.

    Google Scholar 

  • Grant, D. (2007). Grades as information. Economics of Education Review, 26, 201–214.

    Google Scholar 

  • Greenwald, A. G., & Gillmore, G. M. (1997). No pain, no gain? The importance of measuring course workload in student ratings of instruction. Journal of Educational Psychology, 89, 743–751.

    Google Scholar 

  • Hampton, S. E., & Reiser, R. A. (2004). Effects of a theory-based feedback and consultation process on instruction and learning in college classrooms. Research in Higher Education, 45, 497–527.

    Google Scholar 

  • Hardy, N. (2003). Online ratings: Fact and fiction. In T. D. Johnson & D. L. Sorenson (Eds.), Online student ratings of instruction (New directions for teaching and learning, no. 96, pp. 31–38). San Francisco: Jossey-Bass.

    Google Scholar 

  • Harrison, P. D., Douglas, D. K., & Burdsal, C. A. (2004). The relative merits of different types of overall evaluations of teaching effectiveness. Research in Higher Education, 45, 311–323.

    Google Scholar 

  • Hativa, N. (2013a). Student ratings of instruction: A practical approach to designing, operating, and reporting. Oron Publications. Nira@me.com

    Google Scholar 

  • Hativa, N. (2013b). Student ratings of instruction: Recognizing effective teaching. Oron Publications. Nira@me.com

    Google Scholar 

  • Hativa, N., & Raviv, A. (1996). University instructors’ ratings profiles: Stability over time, and disciplinary differences. New Directions for Teaching and Learning No. 64. San Francisco: Jossey-Bass.

    Google Scholar 

  • Hativa, N., Barak, R., & Simhi, E. (2001). Exemplary university teachers: Knowledge and beliefs regarding effective teaching dimensions and strategies. Journal of Higher Education, 72, 699–729.

    Google Scholar 

  • Hativa, N., Many, A., & Dayagi, R. (2010). The whys and wherefores of teaching evaluation by their students [Hebrew]. Al Hagova, 9, 30–37.

    Google Scholar 

  • Hobson, S. M., & Talbot, D. M. (2001). Understanding student evaluations: What all faculty should know. College Teaching, 49, 26–31.

    Google Scholar 

  • Hornbeak, J. L. (2009). Teaching methods and course characteristics related to college students’ desire to take a course. K-State electronic theses, dissertations, and reports: 2004. http://hdl.handle.net/2097/1367

  • Howard, G. S., & Maxwell, S. E. (1980). The correlation between student satisfaction and grades: A case of mistaken causation? Journal of Educational Psychology, 72, 810–820.

    Google Scholar 

  • Howard, G. S., & Maxwell, S. E. (1982). Do grades contaminate student evaluations of instruction? Research in Higher Education, 16, 175–188.

    Google Scholar 

  • Hoyt, D. P., & Cashin, W. E. (1977). IDEA technical report no. 1: Development of the IDEA system. Manhattan: Kansas State University, Center for Faculty Evaluation and Development.

    Google Scholar 

  • Hoyt, D. P., & Lee, E. (2002a). Technical report no. 12: Basic data for the revised IDEA system. Manhattan: The IDEA Center.

    Google Scholar 

  • Hoyt, D. P., & Lee, E. J. (2002b). Technical report #13: Disciplinary differences in student ratings. Manhattan: Kansas State University, IDEA Center.

    Google Scholar 

  • Hoyt, D. P., & Pallett, W. H. (n.d.). IDEA paper no. 36, appraising teaching effectiveness: Beyond student ratings. Manhattan: The IDEA Center.

    Google Scholar 

  • Huston, T. (2005). Research report: Race and gender bias in student evaluations of teaching. Retrieved April 16, 2013, from http://sun.skidmore.union.edu/sunNET/ResourceFiles/Huston_Race_Gender_TeachingEvals.pdf

  • Jenkins, S. J., & Downs, E. (2001). Relationship between faculty personality and student evaluation of courses. College Student Journal, 35(4), 636–640.

    Google Scholar 

  • Johnson, T. D. (2003). Online student ratings: Will students respond? In T. D. Johnson & D. L. Sorenson (Eds.), Online student ratings of instruction (New directions for teaching and learning, no. 96, pp. 49–59). San Francisco: Jossey-Bass.

    Google Scholar 

  • Kember, D., & Leung, D. Y. P. (2011). Disciplinary differences in student ratings of teaching quality. Research in Higher Education, 52, 278–299.

    Google Scholar 

  • Kember, D., McKay, J., Sinclair, K., & Wong, F. K. Y. (2008). A four-category scheme for coding and assessing the level of reflection in written work. Assessment and Evaluation in Higher Education, 33(4), 363–379.

    Google Scholar 

  • Knol, M. (2013). Improving university lectures with feedback and consultation. Academisch Proefschrift. Ipskamp Drukkers, B.V.

    Google Scholar 

  • Krathwohl, D. R. (1998). Methods of educational and social science research. New York: Longman.

    Google Scholar 

  • Kulik, J. A. (2001). Student ratings: Validity, utility, and controversy. In M. Theall, P. C. Abrami, & L. A. Mets (Eds.), The student ratings debate: Are they valid? How can we best use them? (New directions for institutional research, no. 109, pp. 9–25). San Francisco: Jossey-Bass.

    Google Scholar 

  • Kulik, J. A., & McKeachie, W. J. (1975). The evaluation of teachers in higher education. In F. N. Kerlinger (Ed.), Review of research in education (Vol. 3, pp. 210–240). Itasca: F. E. Peacock.

    Google Scholar 

  • Kuncel, N. R., Crede, M., & Thomas, L. L. (2005). The validity of self-reported grade point averages, class ranks, and test scores: A meta-analysis and review of the literature. Review of Educational Research, 75, 63–82.

    Google Scholar 

  • Layne, B. H., DeCristoforo, J. R., & McGinty, D. (1999). Electronic versus traditional student ratings of instruction (electronic version). Research in Higher Education, 40(2), 221–232.

    Google Scholar 

  • Leung, D. Y. P., & Kember, D. (2005). Comparability of data gathered from evaluation questionnaires on paper through the Internet. Research in Higher Education, 46, 571–591.

    Google Scholar 

  • Leventhal, L., Abrami, P. C., Perry, R. P., & Breen, L. J. (1975). Section selection in multi-section courses: Implications for the validation and use of teacher rating forms. Educational and Psychological Measurement, 35, 885–895.

    Google Scholar 

  • Li, Y. (1993). A comparative study of Asian and American students’ perceptions of faculty teaching effectiveness at Ohio University. Unpublished doctoral dissertation, Ohio University, Athens.

    Google Scholar 

  • Linse, A. R. (2012). Faculty strategies for encouraging their students to fill out the SRTEs. Retrieved April 16, 2013, from http://www.schreyerinstitute.psu.edu/IncreaseSRTERespRate/

  • Marincovich, M. (1999). Using student feedback to improve teaching. In P. Seldin & Associates (Eds.), Changing practices in evaluating teaching: A practical guide to improved faculty performance and promotion/tenure decisions (pp. 45–69). Bolton: Anker.

    Google Scholar 

  • Marks, R. B. (2000). Determinants of student evaluation of global measures of instructor and course value. Journal of Marketing Education, 22(2), 108–119.

    Google Scholar 

  • Marsh, H. W. (1982). Validity of students’ evaluations of college teaching: A multitrait-multimethod analysis. Journal of Educational Psychology, 74, 264–279.1.

    Google Scholar 

  • Marsh, H. W. (1984). Students’ evaluations of university teaching: Dimensionality, reliability, validity, potential biases, and utility. Journal of Educational Psychology, 76, 707–754.

    Google Scholar 

  • Marsh, H. W. (1987). Students’ evaluations of university teaching: Research findings, methodological issues, and directions for future research. International Journal of Educational Research, 11, 253–388.

    Google Scholar 

  • Marsh, H. W. (2001). Distinguishing between good (useful) and bad workloads on student evaluations of teaching. American Educational Research Journal, 38, 183–212.

    Google Scholar 

  • Marsh, H. W. (2007). Students’ evaluations of university teaching: Dimensionality, reliability, validity, potential biases and usefulness. In R. P. Perry & J. C. Smart (Eds.), The Scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 319–383). Dordrecht: Springer.

    Google Scholar 

  • Marsh, H. W., & Bailey, M. (1993). Multidimensional students’ evaluations of teaching effectiveness: A profile analysis. Journal of Higher Education, 64, 1–17.

    Google Scholar 

  • Marsh, H. W., & Dunkin, M. J. (1997). Students’ evaluations of university teaching: A multidimensional perspective. In R. P. Perry & J. C. Smart (Eds.), Effective teaching in higher education: Research and practice (pp. 241–320). New York: Agathon Press.

    Google Scholar 

  • Marsh, H. W., & Hattie, J. (2002). The relation between research productivity and teaching effectiveness. Journal of Higher Education, 73, 603–641.

    Google Scholar 

  • Marsh, H. W., & Hocevar, D. (1991). Students’ evaluations of teaching effectiveness: The stability of mean ratings of the same teachers over a 13-year period. Teaching & Teacher Education, 7, 303–314.

    Google Scholar 

  • Marsh, H. W., & Overall, J. U. (1979). Long-term stability of students’ evaluations: A note on Feldman’s consistency and variability among college students in rating their teachers and courses. Research in Higher Education, 10, 139–147.

    Google Scholar 

  • Marsh, H. W., & Roche, L. A. (1993). The use of students’ evaluations and an individually structured intervention to enhance university teaching effectiveness. American Educational Research Journal, 30, 217–251.

    Google Scholar 

  • Marsh, H. W., & Roche, L. A. (2000). Effects of grading leniency and low workload on students’ evaluations of teaching: Popular myth, bias, validity, and innocent bystanders. Journal of Educational Psychology, 92, 202–22.

    Google Scholar 

  • Marsh, H. W., & Ware, J. E. (1982). Effects of expressiveness, content coverage, and incentive on multidimensional student rating scales: New interpretations of the Dr. Fox effect. Journal of Educational Psychology, 74, 126–134.

    Google Scholar 

  • Marsh, H. W., Overall, J. U., & Kesler, S. P. (1979). Validity of student evaluations of instructional effectiveness: A comparison of faculty self-evaluations and evaluation by their students. Journal of Educational Psychology, 71, 149–160.

    Google Scholar 

  • McCarthy, M. A., Niederjohn, D. M., & Bosack, T. N. (2011). Embedded assessment: A measure of student learning and teaching effectiveness. Teaching of Psychology, 38(2), 78–82.

    Google Scholar 

  • McGhee, D. E., & Lowell, N. (2003). Psychometric properties of student ratings of instruction in online and on-campus courses. In T. D. Johnson & D. L. Sorenson (Eds.), Online student ratings of instruction (New directions for teaching and learning, no. 96, pp. 39–48). San Francisco: Jossey-Bass.

    Google Scholar 

  • McGowan, W. R., & Graham, C. R. (2009). Factors contributing to improved teaching performance. Innovative Higher Education, 34, 161–171.

    Google Scholar 

  • McKeachie, W. J. (1979). Student ratings of faculty: A reprise. Academe, 65, 384–397.

    Google Scholar 

  • McKeachie, W. J. (1997). Student ratings: The validity of use. American Psychologist, 52, 1218–1225.

    Google Scholar 

  • McPherson, M. A., & Todd Jewell, R. (2007). Leveling the playing field: Should student evaluation scores be adjusted? Social Science Quarterly, 88(3), 868–881.

    Google Scholar 

  • Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational measurement (3rd ed.). Old Tappan: Macmillan.

    Google Scholar 

  • Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons’ responses and performances as scientific inquiry into score meaning. American Psychologist, 1995(50), 741–749.

    Google Scholar 

  • Midanik, L. (1988). Validity of self-report alcohol use: A literature review and assessment. British Journal of Addictions, 83, 1019–1030.

    Google Scholar 

  • Murray, H. G. (1983). Low-inference classroom teaching behaviors and student ratings of college teaching effectiveness. Journal of Educational Psychology, 75, 138–149.

    Google Scholar 

  • Murray, H. G. (1997). Effective teaching behaviors in the college classroom. In R. P. Perry & J. C. Smart (Eds.), Effective teaching in higher education: Research and practice (pp. 171–204). New York: Agathon Press.

    Google Scholar 

  • Murray, H. G. (2005, June). Student evaluation of teaching: Has it made a difference? Paper presented at the annual meeting for the Society of Teaching and Learning in Higher Education, Charlottetown, Prince Edward Island.

    Google Scholar 

  • Murray, H. G. (2007). Low-inference teaching behaviors and college teaching effectiveness: Recent developments and controversies. In R. P. Perry & J. C. Smart (Eds.), The Scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 145–200). Dordrecht: Springer.

    Google Scholar 

  • Murray, H. G., Rushton, J. P., & Paunonen, S. V. (1990). Teacher personality traits and student instructional ratings in six types of university courses. Journal of Educational Psychology, 82, 250–261.

    Google Scholar 

  • Naftulin, D. H., Ware, J. E., & Donnelly, F. A. (1973). The Doctor Fox lecture: A paradigm of educational seduction. Journal of Medical Education, 48, 630–635.

    Google Scholar 

  • Nilson, L. B. (2013). Time to raise questions about student ratings. In J. E. Groccia & L. Cruz (Eds.), To improve the academy: Resources for faculty, instructional, and organizational development (Vol. 31, pp. 213-–227). San Francisco: Jossey-Bass.

    Google Scholar 

  • Ory, J. C., & Ryan, K. (2001). How do student ratings measure up to a new validity framework? In T. D. Johnson & D. L. Sorenson (Eds.), Online student ratings of instruction (New directions for teaching and learning, no. 5, pp. 27–44). San Francisco: Jossey-Bass.

    Google Scholar 

  • Ory, J. C., Braskamp, L. A., & Pieper, D. M. (1980). Congruency of student evaluative information collected by three methods. Journal of Educational Psychology, 72, 181–185.

    Google Scholar 

  • Overall, J. U., & Marsh, H. W. (1980). Students’ evaluations of instruction: A longitudinal study of their stability. Journal of Educational Psychology, 72, 321–325.

    Google Scholar 

  • Pallett, W. H. (2006). Uses and abuses of student ratings. In P. Seldin (Ed.), Evaluating faculty performance (pp. 50–65). Bolton: Anker Publishing Company, Inc.

    Google Scholar 

  • Patrick, C. L. (2011). Student evaluations of teaching: Effects of the Big Five personality traits, grades and validity hypothesis. Assessment and Evaluation in Higher Education, 36(2), 239–249.

    Google Scholar 

  • Patrick, D. L., Cheadle, A., Thompson, D. C., Diehr, P., Koepsell, T., & Kinne, S. (1994). The validity of self-reported smoking: A review and meta-analysis. American Journal of Public Health, 84(7), 1086–1093.

    Google Scholar 

  • Penny, A. R., & Coe, R. (2004). Effectiveness of consultation on student ratings feedback: Meta-analysis. Review of Educational Research, 74, 215–253.

    Google Scholar 

  • Perry, R. P., & Smart, J. C. (Eds.). (1997). Effective teaching in higher education: Research and practice. New York: Agathon Press.

    Google Scholar 

  • Perry, R. P., & Smart, J. C. (Eds.). (2007). The Scholarship of teaching and learning in higher education: An evidence-based perspective. Dordrecht: Springer.

    Google Scholar 

  • Perry, R. P., Niemi, R. R., & Jones, K. (1974). Effect of prior teaching evaluations and lecture presentation on ratings of teaching performance. Journal of Educational Psychology, 66, 851–856.

    Google Scholar 

  • Perry, R. P., Abrami, P. C., & Leventhal, L. (1979a). Educational seduction: The effect of instructor expressiveness and lecture content on student ratings and achievement. Journal of Educational Psychology, 71, 107–116.

    Google Scholar 

  • Perry, R. P., Abrami, P. C., Leventhal, L., & Check, J. (1979b). Instructor reputation: An expectancy relationship involving student ratings and achievement. Journal of Educational Psychology, 71, 776–787.

    Google Scholar 

  • Ray, J. J. (1987). The validity of self-reports. Personality Study and Group Behaviour, 1, 68–70.

    Google Scholar 

  • Renaud, R. D., & Murray, H. G. (1996). Aging, personality, and teaching effectiveness in academic psychologists. Research in Higher Education, 37, 323–340.

    Google Scholar 

  • Renaud, R. D., & Murray, H. G. (2005). Factorial validity of student ratings of instruction. Research in Higher Education, 46, 929–953.

    Google Scholar 

  • Schmelkin, L. P., Spencer, K. J., & Gellman, E. S. (1997). Faculty perspectives on course and teacher evaluations. Research in Higher Education, 38, 575–592.

    Google Scholar 

  • Schulze, E., & Tomal, A. (2006). The chilly classroom: Beyond gender. College Teaching, 54(3), 263–270.

    Google Scholar 

  • Sixbury, G. R., & Cashin, W. E. (1995). IDEA technical report no. 10: Comparative data by academic field. Manhattan: Kansas State University, Center for Faculty Evaluation and Development.

    Google Scholar 

  • Smith, S. B., Smith, S. J., & Boone, R. (2000). Increasing access to teacher preparation: The effectiveness of traditional instructional methods in an online learning environment. Journal of Special Education Technology, 15(2), 37–46.

    Google Scholar 

  • Sudkamp, A., Kaiser, J., & Moller, J. (2012). Accuracy of teachers’ judgments of students’ academic achievement: A meta-analysis. Journal of Educational Psychology, 104, 743–762.

    Google Scholar 

  • Sullivan, A. M., & Skanes, G. R. (1974). Validity of student evaluation of teaching and the characteristics of successful instructors. Journal of Educational Psychology, 66(4), 584–590.

    Google Scholar 

  • Svinicki, M., & McKeachie, W. J. (2011). McKeachie’s teaching tips: Strategies, research, and theory for college and university teachers (13th ed.). Belmont: Wadsworth.

    Google Scholar 

  • The IDEA Center. (2008). Best practices for online response rates. Retrieved April 16, 2013, from http://www.theideacenter.org/OnlineResponseRates

  • Theall, M., & Feldman, K. A. (2007). Commentary and update on Feldman’s (1997) “Identifying exemplary teachers and teaching: Evidence from student ratings”. In R. P. Perry & J. C. Smart (Eds.), The scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 130–143). Dordrecht: Springer.

    Google Scholar 

  • Venette, S., Sellnow, D., & McIntire, K. (2010). Charting new territory: Assessing the online frontier of student ratings of instruction. Assessment and Evaluation in Higher Education, 35, 101–115.

    Google Scholar 

  • Wachtel, H. K. (1998). Student evaluation of college teaching effectiveness: A brief review. Assessment and Evaluation in Higher Education, 23, 191–211.

    Google Scholar 

  • Wang, A. Y., & Newlin, M. H. (2000). Characteristics of students who enroll and succeed in psychology web-based classes. Journal of Educational Psychology, 92, 137–143.

    Google Scholar 

  • Ware, J. E., & Williams, R. G. (1975). The Dr. Fox effect: A study of lecture effectiveness and ratings of instruction. Journal of Medical Education, 50, 149–156.

    Google Scholar 

  • Weimer, M. (2009). Teachers who improved. The Teaching Professor, 23, 2.

    Google Scholar 

  • Weinberg, B. A., Hashimoto, M., & Fleisher, B. M. (2009). Evaluating teaching in higher education. Journal of Economic Education, 40(3), 227–261.

    Google Scholar 

  • Williams, R. G., & Ware, J. E. (1976). Validity of student ratings of instruction under different incentive conditions: A further study of the Dr. Fox effect. Journal of Educational Psychology, 68, 48–56.

    Google Scholar 

  • Williams, R. G., & Ware, J. E. (1977). An extended visit with Dr. Fox: Validity of student ratings of instruction after repeated exposure to a lecturer. American Educational Research Journal, 14, 449–457.

    Google Scholar 

  • Yunker, P. J., & Yunker, J. A. (2003). Are student evaluations of teaching valid? Evidence from an analytical business core course. Journal of Education for Business, 78, 313–317.

    Google Scholar 

  • Zhao, J., & Gallant, D. J. (2012). Student evaluation of instruction in higher education: Exploring issues of validity and reliability. Assessment and Evaluation in Higher Education, 37, 227–235.

    Google Scholar 

Download references

Acknowledgment

The authors wish to thank The IDEA Center for granting permission to use chapter portions of IDEA Paper No. 50, Student Ratings of Teaching: A Summary of Research and Literature.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stephen L. Benton Ph.D. .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Benton, S.L., Cashin, W.E. (2014). Student Ratings of Instruction in College and University Courses. In: Paulsen, M. (eds) Higher Education: Handbook of Theory and Research. Higher Education: Handbook of Theory and Research, vol 29. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-8005-6_7

Download citation

Publish with us

Policies and ethics