Student Ratings of Instruction in College and University Courses

Benton, Stephen L.; Cashin, William E.

doi:10.1007/978-94-017-8005-6_7

Stephen L. Benton Ph.D.³ &
William E. Cashin Ph.D.⁴

Part of the book series: Higher Education: Handbook of Theory and Research ((HATR,volume 29))

4938 Accesses
33 Citations

Abstract

Findings from research on student ratings are summarized from the 1970s to 2013. There were 1,874 references, including 564 since 1994, using the ERIC descriptors “student evaluation of teacher performance” and “higher education.” The authors address the validity of self-report data, misconceptions about student ratings, essentials of credible research, and elements of reliability and validity. Evidence of reliability includes consistency, stability, and generalizability of ratings. Validity evidence consists of relations to other variables, including achievement; instructor self-ratings; ratings by administrators, colleagues, alumni, and trained observers; and student-written comments as well as survey multidimensionality. Possible sources of bias are extraneous student, instructor, and course characteristics either unrelated or related to ratings. Few meaningful differences occur between ratings administered online versus on paper and ratings in online versus face-to-face courses. Recommendations are made for the appropriate use of student ratings and for future research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Content contained in IDEA Paper No. 50 is reprinted by permission of The IDEA Center.
2.
The authors converted various summary statistics reported in the multi-section studies into Pearson product-moment correlations.

References

Abrami, P. C. (2001). Improving judgments about teaching effectiveness using teacher rating forms. In M. Theall, P. C. Abrami, & L. A. Mets (Eds.), The student ratings debate: Are they valid? How can we best use them? (New directions for institutional research, no. 109, pp. 59–87). San Francisco: Jossey-Bass.
Google Scholar
Abrami, P. C., & d’Apollonia, S. (1990). The dimensionality of ratings and their use in personnel decisions. In M. Theall & J. Franklin (Eds.), Student ratings of instruction: Issues for improving practice (New directions for teaching and learning, no. 43, pp. 97–111). San Francisco: Jossey-Bass.
Google Scholar
Abrami, P. C., & d’Apollonia, S. (1991). Multidimensional students’ evaluations of teaching effectiveness- generalizability of “N = 1”research: Comments on Marsh (1991). Journal of Educational Psychology, 83, 411–415.
Google Scholar
Abrami, P. C., Leventhal, L., & Perry, R. P. (1982a). Educational seduction. Review of Educational Research, 52, 446–464.
Google Scholar
Abrami, P. C., Perry, R. P., & Leventhal, L. (1982b). The relationship between student personality characteristics, teacher ratings, and student achievement. Journal of Educational Psychology, 74, 111–125.
Google Scholar
Abrami, P. C., d’Apollonia, S., & Rosenfeld, S. (2007). The dimensionality of student ratings of instruction: What we know, do not know, and need to do. In R. P. Perry & J. C. Smart (Eds.), The Scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 385–445). Dordrecht: Springer.
Google Scholar
Adams, M. J., & Umbach, P. D. (2012). Nonresponse and online student evaluations of teaching: Understanding the influence of salience, fatigue, and academic environments. Research in Higher Education, 53(5), 576–591.
Google Scholar
Addison, W. E., & Stowell, J. R. (2012). Conducting research on student evaluations of teaching. In M. E. Kite (Ed.), Effective evaluations of teaching: A guide for faculty and administrators (pp. 5–12). Retrieved from the Society for the Teaching of Psychology website:http://teachpsych.org/ebooks/evals2012/index.php
Aleamoni, L. M. (1978). The usefulness of student evaluations in improving college teaching. Instructional Science, 7, 95–105.
Google Scholar
Aleamoni, L. M. (1981). Student ratings of instruction. In J. Millman (Ed.), Handbook of teacher evaluation (pp. 110–145). Beverly Hills: Sage.
Google Scholar
Aleamoni, L. M. (1987). Student rating myths versus research facts. Journal of Personnel Evaluation in Education, 1, 111–119.
Google Scholar
Alhija, F. N. A., & Fresko, B. (2009). Student evaluation of instruction: What can be learned from students’ written comments? Studies in Educational Evaluation, 35, 37–44.
Google Scholar
Anderson, H. M., Cain, J., & Bird, E. (2005). Online student course evaluations: Review of literature and a pilot study. American Journal of Pharmaceutical Education, 69, 34–43.
Google Scholar
Apodaca, P., & Grad, H. (2005). The dimensionality of student ratings of teaching: Integration of uni- and multidimensional models. Studies in Higher Education, 30(6), 723–748.
Google Scholar
d’Apollonia, S., & Abrami, P. C. (1997). Navigating student ratings of instruction. American Psychologist, 52, 1198–1208.
Google Scholar
Arreola, R. A. (2006). Developing a comprehensive faculty evaluation system (2nd ed.). Bolton: Anker Publishing.
Google Scholar
Avery, R. J., Bryant, W. K., & Mathios, A. (2006). Electronic course evaluations: Does an online delivery system influence student evaluations. The Journal of Economic Education, 37, 21–37.
Google Scholar
Babor, T. F., & Del Boca, F. K. (1992). Just the facts: Enhancing measurements of alcohol consumption using self-report methods. In R. Litten & J. Allen (Eds.), Measuring alcohol consumption: Psychosocial and biochemical methods (pp. 3–19). Totowa: Humana Press.
Google Scholar
Babor, T. F., Steinberg, K., Anton, R., & Del Boca, F. K. (2000). Talk is cheap: Measuring drinking outcomes in clinical trials. Journal of Studies on Alcohol, 61(1), 55–63.
Google Scholar
Baird, J. S. (1987). Perceived learning in relation to student evaluation of university instruction. Journal of Educational Psychology, 79, 90–91.
Google Scholar
Basow, S. A. (2000). Best and worst professors: Gender patterns in students’ choices. Sex Roles, 43(5/6), 407–417.
Google Scholar
Beattie, J., Spooner, F., Jordan, L., Algozzine, B., & Spooner, M. (2002). Evaluating instruction in distance learning classes. Teacher Education and Special Education, 25, 124–132.
Google Scholar
Beleche, T., Fairris, D., & Marks, M. (2012). Do course evaluations truly reflect student learning? Evidence from an objectively graded post-test. Economics of Education Review, 31(5), 709–719.
Google Scholar
Benton, S. L., & Cashin, W. E. (2012). Student ratings of teaching: A summary of research and literature (IDEA Paper No. 50). Manhattan: The IDEA Center.
Google Scholar
Benton, S. L., & Pallett, W. H. (2013). Class size matters. Inside Higher Education. http://www.insidehighered.com/views/2013/01/29/essay-importance-class-size-higher-education. 28 Jan 2013.
Benton, S. L., Webster, R., Gross, A. B., & Pallett, W. (2010a). IDEA technical report no. 16: An analysis of IDEA student ratings of instruction using paper versus online survey methods. Manhattan: The IDEA Center.
Google Scholar
Benton, S. L., Webster, R., Gross, A. B., & Pallett, W. (2010b). IDEA technical report no. 15: An analysis of IDEA student ratings of instruction in traditional versus online courses. Manhattan: The IDEA Center.
Google Scholar
Benton, S. L., Duchon, D., & Pallett, W. H. (2011). Validity of self-report student ratings of instruction. Assessment and Evaluation in Higher Education., 38, 377–389.
Google Scholar
Benton, S. L., Brown, R., & Li., D. (2012a). Replication of IDEA Technical Report No. 12 tables: 2012 IDEA student ratings dataset. Unpublished manuscript.
Google Scholar
Benton, S. L., Gross, A., & Brown, R. (2012b, October). Which learning outcomes and teaching methods are instructors really emphasizing in STEM courses? Presentation at American Association of Colleges and Universities Network for Academic Renewal, Kansas City.
Google Scholar
Benton, S. L., Guo, M., Li, D., & Gross, A. (2013, April). Student ratings, teacher standards, and critical thinking skills. Paper presented at the annual meeting of the American Educational Research Association, San Francisco.
Google Scholar
Beran, T., Violato, C., Kline, D., & Frideres, J. (2007). What’s the “use” of student ratings of instruction for administrators? Canadian Journal of Higher Education, 37, 27–43.
Google Scholar
Berk, R. A. (2005). Survey of 12 strategies to measure teaching effectiveness. International Journal of Teaching and Learning in Higher Education, 17, 48–62.
Google Scholar
Biglan, A. (1973). The characteristics of subject matter in different academic areas. Journal of Applied Psychology, 57, 195–203.
Google Scholar
Braskamp, L. A., & Ory, J. C. (1994). Assessing faculty work: Enhancing individual and institutional performance. San Francisco: Jossey-Bass.
Google Scholar
Braskamp, L. A., Ory, J. C., & Pieper, D. M. (1981). Student written comments: Dimensions of instructional quality. Journal of Educational Psychology, 73, 65–70.
Google Scholar
Brener, N. D., Billy, J. O. G., & Grady, W. R. (2003). Assessment of factors affecting the validity of self-reported health-risk behavior among adolescents: Evidence from the scientific literature. Journal of Adolescent Health, 33, 436–457.
Google Scholar
Brinko, K. T. (1990). Instructional consultation with feedback in higher education. Journal of Higher Education, 61, 65–83.
Google Scholar
Brown, M. (2012, July). Learning analytics: Moving from concept to practice. In Educause Learning Initiative Brief. Retrieved from the EDUCAUSE website: http://net.educause.edu/ir/library/pdf/ELIB1203.pdf
Buchert, S., Laws, E. L., Epperson, J. M., & Bregman, N. J. (2008). First impressions and professor reputation: Influence on student evaluations of instruction. Social Psychology of Education, 11, 397–408.
Google Scholar
Burdsal, C. A., & Harrison, P. D. (2008). Further evidence supporting the validity of both a multidimensional profile and an overall evaluation of teaching effectiveness. Assessment and Evaluation in Higher Education, 33, 567–576.
Google Scholar
Burgoon, J. K., Birk, T., & Pfau, M. (1990). Nonverbal behaviors, persuasion, and credibility. Human Communication Research, 17, 140–169.
Google Scholar
Campbell, J., & Bozeman, W. (2007). The value of student ratings: Perceptions of students, teachers, and administrators. Community College of Research and Practice, 32(1), 13–24.
Google Scholar
Campbell, D. T., & Stanley, J. C. (1963). Experimental and quasi-experimental designs for research. Boston: Houghton Mifflin.
Google Scholar
Carrell, S., & West, J. (2010). Does professor quality matter? Evidence from random assignments of students to professors. Journal of Political Economy, 118(3), 409–432.
Google Scholar
Carrier, N. A., Howard, G. S., & Miller, W. G. (1974). Course evaluations: When? Journal of Educational Psychology, 66, 609–613.
Google Scholar
Cashin, W. E (1989). Defining and evaluating college teaching (IDEA Paper No. 21). Manhattan: Kansas State University, Center for Faculty Evaluation and Development.
Google Scholar
Cashin, W. E. (1990). Students do rate different academic fields differently. In M. Theall & J. Franklin (Eds.), Student ratings of instruction: Issues for improving practice (New directions for teaching and learning, no. 43, pp. 113–121). San Francisco: Jossey-Bass.
Google Scholar
Cashin, W. E. (1996). Developing an Effective Faculty Evaluation System (IDEA Paper No. 33). Manhattan: Kansas State University, Center for Faculty Evaluation and Development.
Google Scholar
Cashin, W. E. (2003). Evaluating college and university teaching: Reflections of a practitioner. In J. C. Smart (Ed.), Higher education: Handbook of theory and research (pp. 531–593). Dordrecht: Kluwer Academic Publishers.
Google Scholar
Cashin, W. E., & Downey, R. G. (1992). Using global student ratings for summative evaluation. Journal of Educational Psychology, 84, 563–572.
Google Scholar
Centra, J. A. (1976). The influence of different directions on student ratings of instruction. Journal of Educational Measurement, 13, 277–282.
Google Scholar
Centra, J. A. (1979). Determining faculty effectiveness: Assessing teaching, research, and service for personnel decisions and improvement. San Francisco: Jossey-Bass.
Google Scholar
Centra, J. A. (1993). Reflective faculty evaluation: Enhancing teaching and determining faculty effectiveness. San Francisco: Jossey-Bass.
Google Scholar
Centra, J. A. (2003). Will teachers receive higher student evaluations by giving higher grades and less course work? Research in Higher Education, 44, 495–518.
Google Scholar
Centra, J. A. (2009). Differences in responses to the student instructional report: Is it bias? Princeton: Educational Testing Service.
Google Scholar
Centra, J. A., & Gaubatz, N. B. (2000). Is there a gender bias in student evaluations of teaching? Journal of Higher Education, 70, 17–33.
Google Scholar
Clayson, D. E. (2009). Student evaluation of teaching: Are they related to what students learn? Journal of Marketing Education, 31, 16–30.
Google Scholar
Clayson, D. E., & Sheffet, M. J. (2006). Personality and the student evaluation of teaching. Journal of Marketing Education, 28(2), 149–160.
Google Scholar
Cohen, P. A. (1980). Effectiveness of student-rating feedback for improving college instruction: A meta-analysis of findings. Research in Higher Education, 13, 321–341.
Google Scholar
Cohen, P. A. (1981). Student ratings of instruction and student achievement: A meta-analysis of multisection validity studies. Review of Educational Research, 51, 281–309.
Google Scholar
Cohen, P. A. (1987, April). A critical analysis and reanalysis of the multisection validity meta-analysis. Paper presented at the annual meeting of the American Educational Research Association, Washington, DC.
Google Scholar
Cooper, A. M., Sobell, M. B., Sobell, L. C., & Maisto, S. A. (1981). Validity of alcoholics’ self-reports: Duration data. International Journal of Addiction, 16, 401–406.
Google Scholar
Costin, F. (1968). A graduate course in the teaching of psychology: Description and evaluation. Journal of Teacher Education, 19, 425–432.
Google Scholar
Costin, F., Greenough, W. T., & Menges, R. J. (1971). Student ratings of college teaching: reliability, validity, and usefulness. Review of Educational Research, 41, 511–535.
Google Scholar
Creasman, P. A. (n.d.). IDEA Paper No. 52: Considerations in online course design. Manhattan: The IDEA Center.
Google Scholar
Davis, B. G. (2009). Tools for teaching (2nd ed.). San Francisco: Jossey-Bass.
Google Scholar
Del Boca, F. K., & Noll, J. A. (2002). Truth or consequences: The validity of self-report data in health services research on addictions. Addiction, 95, 347–360.
Google Scholar
Dommeyer, C. J., Baum, P., Hanna, R. W., & Chapman, K. S. (2004). Gathering faculty teaching evaluations by in-class and online surveys: Their effects on response rates and evaluations. Assessment and Evaluation in Higher Education, 29, 611–623.
Google Scholar
Dorn, D. S. (1987). The first day of class: Problems and strategies. Teaching Sociology, 15, 61–72.
Google Scholar
Dyckhoff, A. L. (2011). Implications for learning analytics tools: A meta-analysis of applied research questions. International Journal of Computer Information Systems and Industrial Management Application, 3, 594–601.
Google Scholar
Erdle, S., Murray, H. G., & Rushton, J. P. (1985). Personality, classroom behavior, and student ratings of college teaching effectiveness: A path analysis. Journal of Educational Psychology, 77, 394–407.
Google Scholar
Feeley, T. H. (2002). Evidence of halo effects in student evaluations of communication instruction. Communication Education, 51(3), 225–236.
Google Scholar
Feldman, K. A. (1976a). Grades and college students’ evaluations of their courses and teachers. Research in Higher Education, 4, 69–111.
Google Scholar
Feldman, K. A. (1976b). The superior college teacher from the students’ view. Research in Higher Education, 5, 243–288.
Google Scholar
Feldman, K. A. (1977). Consistency and variability among college students in rating their teachers and courses: A review and analysis. Research in Higher Education, 6, 233–274.
Google Scholar
Feldman, K. A. (1978). Course characteristics and college students’ ratings of their teachers: What we know and what we don’t. Research in Higher Education, 9, 199–242.
Google Scholar
Feldman, K. A. (1979). The significance of circumstances for college students’ ratings of their teachers and courses. Research in Higher Education, 10, 149–172.
Google Scholar
Feldman, K. A. (1983). Seniority and experience of college teachers as related to evaluations they receive from students. Research in Higher Education, 18, 3–124.
Google Scholar
Feldman, K. A. (1984). Class size and college students’ evaluations of teachers and courses: A closer look. Research in Higher Education, 21, 45–116.
Google Scholar
Feldman, K. A. (1986). The perceived instructional effectiveness of college teachers as related to their personality and attitudinal characteristics: A review and synthesis. Research in Higher Education, 24, 129–213.
Google Scholar
Feldman, K. A. (1987). Research productivity and scholarly accomplishment of college teachers as related to their instructional effectiveness: A review and exploration. Research in Higher Education, 26, 227–298.
Google Scholar
Feldman, K. A. (1988). Effective college teaching from the students’ and faculty’s view: Matched or mismatched priorities. Research in Higher Education, 28, 291–344.
Google Scholar
Feldman, K. A. (1989a). Instructional effectiveness of college teachers as judged by teachers themselves, current and former students, colleagues, administrators and external (neutral) observers. Research in Higher Education, 30, 137–194.
Google Scholar
Feldman, K. A. (1989b). The association between student ratings of specific instructional dimensions and student achievement: Refining and extending the synthesis of data from multisection validity studies. Research in Higher Education, 30, 583–645.
Google Scholar
Feldman, K. A. (1992). College students’ views of male and female college teachers: Part I–Evidence from the social laboratory and experiments. Research in Higher Education, 33, 317–375.
Google Scholar
Feldman, K. A. (1993). College students’ views of male and female college teachers: Part II–Evidence from students’ evaluations of their classroom teachers. Research in Higher Education, 34, 151–211.
Google Scholar
Feldman, K. A. (2007). Identifying exemplary teachers and teaching: Evidence from student ratings. In R. P. Perry & J. C. Smart (Eds.), The scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 93–129). Dordrecht: Springer.
Google Scholar
Forsyth, D. R. (2003). Professor’s guide to teaching: Psychological principles and practices. Washington, DC: American Psychological Association.
Google Scholar
Franklin, J., & Theall, M. (1992). Disciplinary differences: Instructional goals and activities, measures of student performance, and student ratings of instruction. Paper presented at the annual meeting of the American Educational Research Association, San Francisco.
Google Scholar
Freier, M. C., Bell, R. M., & Ellickson, P. (1991). Do teens tell the truth? The validity of self-reported tobacco use by adolescents. Santa Monica: The Rand Corporation.
Google Scholar
Frey, P. W. (1976). Validity of student instructional ratings as a function of their timing. Journal of Higher Education, 47, 327–336.
Google Scholar
Galbraith, C. S., Merrill, G. B., & Kline, D. M. (2012). Are student evaluations of teaching effectiveness valid for measuring student learning outcomes in business related courses? A neural network and Bayesian analyses. Research in Higher Education, 53, 353–374.
Google Scholar
Grant, D. (2007). Grades as information. Economics of Education Review, 26, 201–214.
Google Scholar
Greenwald, A. G., & Gillmore, G. M. (1997). No pain, no gain? The importance of measuring course workload in student ratings of instruction. Journal of Educational Psychology, 89, 743–751.
Google Scholar
Hampton, S. E., & Reiser, R. A. (2004). Effects of a theory-based feedback and consultation process on instruction and learning in college classrooms. Research in Higher Education, 45, 497–527.
Google Scholar
Hardy, N. (2003). Online ratings: Fact and fiction. In T. D. Johnson & D. L. Sorenson (Eds.), Online student ratings of instruction (New directions for teaching and learning, no. 96, pp. 31–38). San Francisco: Jossey-Bass.
Google Scholar
Harrison, P. D., Douglas, D. K., & Burdsal, C. A. (2004). The relative merits of different types of overall evaluations of teaching effectiveness. Research in Higher Education, 45, 311–323.
Google Scholar
Hativa, N. (2013a). Student ratings of instruction: A practical approach to designing, operating, and reporting. Oron Publications. Nira@me.com
Google Scholar
Hativa, N. (2013b). Student ratings of instruction: Recognizing effective teaching. Oron Publications. Nira@me.com
Google Scholar
Hativa, N., & Raviv, A. (1996). University instructors’ ratings profiles: Stability over time, and disciplinary differences. New Directions for Teaching and Learning No. 64. San Francisco: Jossey-Bass.
Google Scholar
Hativa, N., Barak, R., & Simhi, E. (2001). Exemplary university teachers: Knowledge and beliefs regarding effective teaching dimensions and strategies. Journal of Higher Education, 72, 699–729.
Google Scholar
Hativa, N., Many, A., & Dayagi, R. (2010). The whys and wherefores of teaching evaluation by their students [Hebrew]. Al Hagova, 9, 30–37.
Google Scholar
Hobson, S. M., & Talbot, D. M. (2001). Understanding student evaluations: What all faculty should know. College Teaching, 49, 26–31.
Google Scholar
Hornbeak, J. L. (2009). Teaching methods and course characteristics related to college students’ desire to take a course. K-State electronic theses, dissertations, and reports: 2004. http://hdl.handle.net/2097/1367
Howard, G. S., & Maxwell, S. E. (1980). The correlation between student satisfaction and grades: A case of mistaken causation? Journal of Educational Psychology, 72, 810–820.
Google Scholar
Howard, G. S., & Maxwell, S. E. (1982). Do grades contaminate student evaluations of instruction? Research in Higher Education, 16, 175–188.
Google Scholar
Hoyt, D. P., & Cashin, W. E. (1977). IDEA technical report no. 1: Development of the IDEA system. Manhattan: Kansas State University, Center for Faculty Evaluation and Development.
Google Scholar
Hoyt, D. P., & Lee, E. (2002a). Technical report no. 12: Basic data for the revised IDEA system. Manhattan: The IDEA Center.
Google Scholar
Hoyt, D. P., & Lee, E. J. (2002b). Technical report #13: Disciplinary differences in student ratings. Manhattan: Kansas State University, IDEA Center.
Google Scholar
Hoyt, D. P., & Pallett, W. H. (n.d.). IDEA paper no. 36, appraising teaching effectiveness: Beyond student ratings. Manhattan: The IDEA Center.
Google Scholar
Huston, T. (2005). Research report: Race and gender bias in student evaluations of teaching. Retrieved April 16, 2013, from http://sun.skidmore.union.edu/sunNET/ResourceFiles/Huston_Race_Gender_TeachingEvals.pdf
Jenkins, S. J., & Downs, E. (2001). Relationship between faculty personality and student evaluation of courses. College Student Journal, 35(4), 636–640.
Google Scholar
Johnson, T. D. (2003). Online student ratings: Will students respond? In T. D. Johnson & D. L. Sorenson (Eds.), Online student ratings of instruction (New directions for teaching and learning, no. 96, pp. 49–59). San Francisco: Jossey-Bass.
Google Scholar
Kember, D., & Leung, D. Y. P. (2011). Disciplinary differences in student ratings of teaching quality. Research in Higher Education, 52, 278–299.
Google Scholar
Kember, D., McKay, J., Sinclair, K., & Wong, F. K. Y. (2008). A four-category scheme for coding and assessing the level of reflection in written work. Assessment and Evaluation in Higher Education, 33(4), 363–379.
Google Scholar
Knol, M. (2013). Improving university lectures with feedback and consultation. Academisch Proefschrift. Ipskamp Drukkers, B.V.
Google Scholar
Krathwohl, D. R. (1998). Methods of educational and social science research. New York: Longman.
Google Scholar
Kulik, J. A. (2001). Student ratings: Validity, utility, and controversy. In M. Theall, P. C. Abrami, & L. A. Mets (Eds.), The student ratings debate: Are they valid? How can we best use them? (New directions for institutional research, no. 109, pp. 9–25). San Francisco: Jossey-Bass.
Google Scholar
Kulik, J. A., & McKeachie, W. J. (1975). The evaluation of teachers in higher education. In F. N. Kerlinger (Ed.), Review of research in education (Vol. 3, pp. 210–240). Itasca: F. E. Peacock.
Google Scholar
Kuncel, N. R., Crede, M., & Thomas, L. L. (2005). The validity of self-reported grade point averages, class ranks, and test scores: A meta-analysis and review of the literature. Review of Educational Research, 75, 63–82.
Google Scholar
Layne, B. H., DeCristoforo, J. R., & McGinty, D. (1999). Electronic versus traditional student ratings of instruction (electronic version). Research in Higher Education, 40(2), 221–232.
Google Scholar
Leung, D. Y. P., & Kember, D. (2005). Comparability of data gathered from evaluation questionnaires on paper through the Internet. Research in Higher Education, 46, 571–591.
Google Scholar
Leventhal, L., Abrami, P. C., Perry, R. P., & Breen, L. J. (1975). Section selection in multi-section courses: Implications for the validation and use of teacher rating forms. Educational and Psychological Measurement, 35, 885–895.
Google Scholar
Li, Y. (1993). A comparative study of Asian and American students’ perceptions of faculty teaching effectiveness at Ohio University. Unpublished doctoral dissertation, Ohio University, Athens.
Google Scholar
Linse, A. R. (2012). Faculty strategies for encouraging their students to fill out the SRTEs. Retrieved April 16, 2013, from http://www.schreyerinstitute.psu.edu/IncreaseSRTERespRate/
Marincovich, M. (1999). Using student feedback to improve teaching. In P. Seldin & Associates (Eds.), Changing practices in evaluating teaching: A practical guide to improved faculty performance and promotion/tenure decisions (pp. 45–69). Bolton: Anker.
Google Scholar
Marks, R. B. (2000). Determinants of student evaluation of global measures of instructor and course value. Journal of Marketing Education, 22(2), 108–119.
Google Scholar
Marsh, H. W. (1982). Validity of students’ evaluations of college teaching: A multitrait-multimethod analysis. Journal of Educational Psychology, 74, 264–279.1.
Google Scholar
Marsh, H. W. (1984). Students’ evaluations of university teaching: Dimensionality, reliability, validity, potential biases, and utility. Journal of Educational Psychology, 76, 707–754.
Google Scholar
Marsh, H. W. (1987). Students’ evaluations of university teaching: Research findings, methodological issues, and directions for future research. International Journal of Educational Research, 11, 253–388.
Google Scholar
Marsh, H. W. (2001). Distinguishing between good (useful) and bad workloads on student evaluations of teaching. American Educational Research Journal, 38, 183–212.
Google Scholar
Marsh, H. W. (2007). Students’ evaluations of university teaching: Dimensionality, reliability, validity, potential biases and usefulness. In R. P. Perry & J. C. Smart (Eds.), The Scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 319–383). Dordrecht: Springer.
Google Scholar
Marsh, H. W., & Bailey, M. (1993). Multidimensional students’ evaluations of teaching effectiveness: A profile analysis. Journal of Higher Education, 64, 1–17.
Google Scholar
Marsh, H. W., & Dunkin, M. J. (1997). Students’ evaluations of university teaching: A multidimensional perspective. In R. P. Perry & J. C. Smart (Eds.), Effective teaching in higher education: Research and practice (pp. 241–320). New York: Agathon Press.
Google Scholar
Marsh, H. W., & Hattie, J. (2002). The relation between research productivity and teaching effectiveness. Journal of Higher Education, 73, 603–641.
Google Scholar
Marsh, H. W., & Hocevar, D. (1991). Students’ evaluations of teaching effectiveness: The stability of mean ratings of the same teachers over a 13-year period. Teaching & Teacher Education, 7, 303–314.
Google Scholar
Marsh, H. W., & Overall, J. U. (1979). Long-term stability of students’ evaluations: A note on Feldman’s consistency and variability among college students in rating their teachers and courses. Research in Higher Education, 10, 139–147.
Google Scholar
Marsh, H. W., & Roche, L. A. (1993). The use of students’ evaluations and an individually structured intervention to enhance university teaching effectiveness. American Educational Research Journal, 30, 217–251.
Google Scholar
Marsh, H. W., & Roche, L. A. (2000). Effects of grading leniency and low workload on students’ evaluations of teaching: Popular myth, bias, validity, and innocent bystanders. Journal of Educational Psychology, 92, 202–22.
Google Scholar
Marsh, H. W., & Ware, J. E. (1982). Effects of expressiveness, content coverage, and incentive on multidimensional student rating scales: New interpretations of the Dr. Fox effect. Journal of Educational Psychology, 74, 126–134.
Google Scholar
Marsh, H. W., Overall, J. U., & Kesler, S. P. (1979). Validity of student evaluations of instructional effectiveness: A comparison of faculty self-evaluations and evaluation by their students. Journal of Educational Psychology, 71, 149–160.
Google Scholar
McCarthy, M. A., Niederjohn, D. M., & Bosack, T. N. (2011). Embedded assessment: A measure of student learning and teaching effectiveness. Teaching of Psychology, 38(2), 78–82.
Google Scholar
McGhee, D. E., & Lowell, N. (2003). Psychometric properties of student ratings of instruction in online and on-campus courses. In T. D. Johnson & D. L. Sorenson (Eds.), Online student ratings of instruction (New directions for teaching and learning, no. 96, pp. 39–48). San Francisco: Jossey-Bass.
Google Scholar
McGowan, W. R., & Graham, C. R. (2009). Factors contributing to improved teaching performance. Innovative Higher Education, 34, 161–171.
Google Scholar
McKeachie, W. J. (1979). Student ratings of faculty: A reprise. Academe, 65, 384–397.
Google Scholar
McKeachie, W. J. (1997). Student ratings: The validity of use. American Psychologist, 52, 1218–1225.
Google Scholar
McPherson, M. A., & Todd Jewell, R. (2007). Leveling the playing field: Should student evaluation scores be adjusted? Social Science Quarterly, 88(3), 868–881.
Google Scholar
Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational measurement (3rd ed.). Old Tappan: Macmillan.
Google Scholar
Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons’ responses and performances as scientific inquiry into score meaning. American Psychologist, 1995(50), 741–749.
Google Scholar
Midanik, L. (1988). Validity of self-report alcohol use: A literature review and assessment. British Journal of Addictions, 83, 1019–1030.
Google Scholar
Murray, H. G. (1983). Low-inference classroom teaching behaviors and student ratings of college teaching effectiveness. Journal of Educational Psychology, 75, 138–149.
Google Scholar
Murray, H. G. (1997). Effective teaching behaviors in the college classroom. In R. P. Perry & J. C. Smart (Eds.), Effective teaching in higher education: Research and practice (pp. 171–204). New York: Agathon Press.
Google Scholar
Murray, H. G. (2005, June). Student evaluation of teaching: Has it made a difference? Paper presented at the annual meeting for the Society of Teaching and Learning in Higher Education, Charlottetown, Prince Edward Island.
Google Scholar
Murray, H. G. (2007). Low-inference teaching behaviors and college teaching effectiveness: Recent developments and controversies. In R. P. Perry & J. C. Smart (Eds.), The Scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 145–200). Dordrecht: Springer.
Google Scholar
Murray, H. G., Rushton, J. P., & Paunonen, S. V. (1990). Teacher personality traits and student instructional ratings in six types of university courses. Journal of Educational Psychology, 82, 250–261.
Google Scholar
Naftulin, D. H., Ware, J. E., & Donnelly, F. A. (1973). The Doctor Fox lecture: A paradigm of educational seduction. Journal of Medical Education, 48, 630–635.
Google Scholar
Nilson, L. B. (2013). Time to raise questions about student ratings. In J. E. Groccia & L. Cruz (Eds.), To improve the academy: Resources for faculty, instructional, and organizational development (Vol. 31, pp. 213-–227). San Francisco: Jossey-Bass.
Google Scholar
Ory, J. C., & Ryan, K. (2001). How do student ratings measure up to a new validity framework? In T. D. Johnson & D. L. Sorenson (Eds.), Online student ratings of instruction (New directions for teaching and learning, no. 5, pp. 27–44). San Francisco: Jossey-Bass.
Google Scholar
Ory, J. C., Braskamp, L. A., & Pieper, D. M. (1980). Congruency of student evaluative information collected by three methods. Journal of Educational Psychology, 72, 181–185.
Google Scholar
Overall, J. U., & Marsh, H. W. (1980). Students’ evaluations of instruction: A longitudinal study of their stability. Journal of Educational Psychology, 72, 321–325.
Google Scholar
Pallett, W. H. (2006). Uses and abuses of student ratings. In P. Seldin (Ed.), Evaluating faculty performance (pp. 50–65). Bolton: Anker Publishing Company, Inc.
Google Scholar
Patrick, C. L. (2011). Student evaluations of teaching: Effects of the Big Five personality traits, grades and validity hypothesis. Assessment and Evaluation in Higher Education, 36(2), 239–249.
Google Scholar
Patrick, D. L., Cheadle, A., Thompson, D. C., Diehr, P., Koepsell, T., & Kinne, S. (1994). The validity of self-reported smoking: A review and meta-analysis. American Journal of Public Health, 84(7), 1086–1093.
Google Scholar
Penny, A. R., & Coe, R. (2004). Effectiveness of consultation on student ratings feedback: Meta-analysis. Review of Educational Research, 74, 215–253.
Google Scholar
Perry, R. P., & Smart, J. C. (Eds.). (1997). Effective teaching in higher education: Research and practice. New York: Agathon Press.
Google Scholar
Perry, R. P., & Smart, J. C. (Eds.). (2007). The Scholarship of teaching and learning in higher education: An evidence-based perspective. Dordrecht: Springer.
Google Scholar
Perry, R. P., Niemi, R. R., & Jones, K. (1974). Effect of prior teaching evaluations and lecture presentation on ratings of teaching performance. Journal of Educational Psychology, 66, 851–856.
Google Scholar
Perry, R. P., Abrami, P. C., & Leventhal, L. (1979a). Educational seduction: The effect of instructor expressiveness and lecture content on student ratings and achievement. Journal of Educational Psychology, 71, 107–116.
Google Scholar
Perry, R. P., Abrami, P. C., Leventhal, L., & Check, J. (1979b). Instructor reputation: An expectancy relationship involving student ratings and achievement. Journal of Educational Psychology, 71, 776–787.
Google Scholar
Ray, J. J. (1987). The validity of self-reports. Personality Study and Group Behaviour, 1, 68–70.
Google Scholar
Renaud, R. D., & Murray, H. G. (1996). Aging, personality, and teaching effectiveness in academic psychologists. Research in Higher Education, 37, 323–340.
Google Scholar
Renaud, R. D., & Murray, H. G. (2005). Factorial validity of student ratings of instruction. Research in Higher Education, 46, 929–953.
Google Scholar
Schmelkin, L. P., Spencer, K. J., & Gellman, E. S. (1997). Faculty perspectives on course and teacher evaluations. Research in Higher Education, 38, 575–592.
Google Scholar
Schulze, E., & Tomal, A. (2006). The chilly classroom: Beyond gender. College Teaching, 54(3), 263–270.
Google Scholar
Sixbury, G. R., & Cashin, W. E. (1995). IDEA technical report no. 10: Comparative data by academic field. Manhattan: Kansas State University, Center for Faculty Evaluation and Development.
Google Scholar
Smith, S. B., Smith, S. J., & Boone, R. (2000). Increasing access to teacher preparation: The effectiveness of traditional instructional methods in an online learning environment. Journal of Special Education Technology, 15(2), 37–46.
Google Scholar
Sudkamp, A., Kaiser, J., & Moller, J. (2012). Accuracy of teachers’ judgments of students’ academic achievement: A meta-analysis. Journal of Educational Psychology, 104, 743–762.
Google Scholar
Sullivan, A. M., & Skanes, G. R. (1974). Validity of student evaluation of teaching and the characteristics of successful instructors. Journal of Educational Psychology, 66(4), 584–590.
Google Scholar
Svinicki, M., & McKeachie, W. J. (2011). McKeachie’s teaching tips: Strategies, research, and theory for college and university teachers (13th ed.). Belmont: Wadsworth.
Google Scholar
The IDEA Center. (2008). Best practices for online response rates. Retrieved April 16, 2013, from http://www.theideacenter.org/OnlineResponseRates
Theall, M., & Feldman, K. A. (2007). Commentary and update on Feldman’s (1997) “Identifying exemplary teachers and teaching: Evidence from student ratings”. In R. P. Perry & J. C. Smart (Eds.), The scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 130–143). Dordrecht: Springer.
Google Scholar
Venette, S., Sellnow, D., & McIntire, K. (2010). Charting new territory: Assessing the online frontier of student ratings of instruction. Assessment and Evaluation in Higher Education, 35, 101–115.
Google Scholar
Wachtel, H. K. (1998). Student evaluation of college teaching effectiveness: A brief review. Assessment and Evaluation in Higher Education, 23, 191–211.
Google Scholar
Wang, A. Y., & Newlin, M. H. (2000). Characteristics of students who enroll and succeed in psychology web-based classes. Journal of Educational Psychology, 92, 137–143.
Google Scholar
Ware, J. E., & Williams, R. G. (1975). The Dr. Fox effect: A study of lecture effectiveness and ratings of instruction. Journal of Medical Education, 50, 149–156.
Google Scholar
Weimer, M. (2009). Teachers who improved. The Teaching Professor, 23, 2.
Google Scholar
Weinberg, B. A., Hashimoto, M., & Fleisher, B. M. (2009). Evaluating teaching in higher education. Journal of Economic Education, 40(3), 227–261.
Google Scholar
Williams, R. G., & Ware, J. E. (1976). Validity of student ratings of instruction under different incentive conditions: A further study of the Dr. Fox effect. Journal of Educational Psychology, 68, 48–56.
Google Scholar
Williams, R. G., & Ware, J. E. (1977). An extended visit with Dr. Fox: Validity of student ratings of instruction after repeated exposure to a lecturer. American Educational Research Journal, 14, 449–457.
Google Scholar
Yunker, P. J., & Yunker, J. A. (2003). Are student evaluations of teaching valid? Evidence from an analytical business core course. Journal of Education for Business, 78, 313–317.
Google Scholar
Zhao, J., & Gallant, D. J. (2012). Student evaluation of instruction in higher education: Exploring issues of validity and reliability. Assessment and Evaluation in Higher Education, 37, 227–235.
Google Scholar

Download references

Acknowledgment

The authors wish to thank The IDEA Center for granting permission to use chapter portions of IDEA Paper No. 50, Student Ratings of Teaching: A Summary of Research and Literature.

Author information

Authors and Affiliations

The IDEA Center, 301 South Fourth St., Suite 200, Manhattan, KS, 66502, USA
Stephen L. Benton Ph.D.
Kansas State University, Manhattan, KS, USA
William E. Cashin Ph.D.

Authors

Stephen L. Benton Ph.D.
View author publications
You can also search for this author in PubMed Google Scholar
William E. Cashin Ph.D.
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stephen L. Benton Ph.D. .

Editor information

Editors and Affiliations

Department of Educational Policy and Leadership Studies, University of Iowa, Iowa City, Iowa, USA
Michael B. Paulsen

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Benton, S.L., Cashin, W.E. (2014). Student Ratings of Instruction in College and University Courses. In: Paulsen, M. (eds) Higher Education: Handbook of Theory and Research. Higher Education: Handbook of Theory and Research, vol 29. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-8005-6_7

Download citation

DOI: https://doi.org/10.1007/978-94-017-8005-6_7
Published: 30 December 2013
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-017-8004-9
Online ISBN: 978-94-017-8005-6
eBook Packages: Humanities, Social Sciences and LawEducation (R0)

Publish with us

Policies and ethics