Bibliography on SET Prepared by UD SET Committee (2013)

Abel, M. H., & Meltzer, A. L. (2007).  Student ratings of a male and female professors’ lecture on sex discrimination in the workforce. Sex Roles, 57, 173-180.

Abrami, P. C. (2001). Improving judgments about teaching effectiveness using teacher rating forms. In M. Theall, P. C. Abrami, & L. A. Mets (Eds.), The student ratings debate: Are they valid? How can we best use them? New Directions for Institutional Research, No. 109 (pp. 59-87). San Francisco: Jossey-Bass.

Abrami, P. C., Dickens, W. J., Perry, R. P., & Leventhal, L. (1980). Do teacher standards for assignment grades affect student evaluations of instruction? Journal of Educational Psychology, 72, 107-118.

Abrami, P. C., d’Apollonia, S., & Rosenfield, S. (2007). The dimensionality of student ratings of instruction: What we know and what we do not. In R. P. Perry & J. C. Smart (Eds.), The scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 385-446).  Dordecht, The Netherlands: Springer.

Abrami, P. C., Rosenfield, S., & Dedic, H. (2007). The dimensionality of student ratings of instruction: An update on what we know, do not know, and need to do.  In R. P. Perry & J. C. Smart, (Eds.) The scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 446-456). New York: Springer.

Acker, J. R. (2003). Class acts: Outstanding college teachers and the difference they make.  Criminal Justice Review, 28, 215-231.

Albanese, M. A. (1991). The validity of lecturer ratings by students and trained observers.  Academic Medicine, 66, 26-28.

Aleamoni, L. M. (1981). Student ratings of instruction. In J. Millman (Ed.), Handbook of teacher evaluation (pp. 110-145). Beverly Hill: Sage.

Aleamoni, L. M. (1997). Issues in linking instructional improvement research to faculty development in higher education. Journal of Personnel Evaluation in Education, 11, 31-37.

Aleamoni, L.M., & Hexner, P. Z. (1980). A review of the research on student evaluation and a report on the effect of different sets of instructions on student course and instructor evaluation. Instructional Science, 9, 67-84.

Aleamoni, L. M., & Thomas G. S. (1980). Differential relationships of student, instructor, and course characteristics to general and specific items on a course evaluation questionnaire. Teaching of Psychology, 7, 233-235.

Algozzine, B., Beattie, J., Bray, M., Flowers, C., Gretes, J., Howley, L., Mohanty, G., & Spooner, F. (2004). Student evaluation of college teaching: A practice in search of principles. College Teaching, 52, 134-141.

Ambady, N. & Rosenthal, R. (1992).Half a minute: Predicting teacher evaluations from thin slices of nonverbal behavior and physical attractiveness. Journal of Personality and Social Psychology, 64, 432-441.

Amin, M. E. (1994). Gender as a discriminating factor in the evaluation of teaching. Assessment and Evaluation in Higher Education, 16, 260-278.

Arreola, R. A. (2000). Developing a comprehensive faculty evaluation system (2nd ed.). Bolton, MA: Anker.

Atamian, R., & Ganguli, G. (1993). Teacher popularity and teaching effectiveness: Viewpoint of accounting students. Journal of Education for Business, 68, 163-169,

Aulls, M. W. (2004). Students’ experiences with good and poor university courses. Educational Research and Evaluation, 10, 303-335.

Bachen, C. M., McLoughlin, M. M., & Garcia, S. S. (1999). Assessing the role of gender in college students’ evaluations of faculty. Communication Education, 48, 193-210.

Baird, J. S., Jr. (1987). Perceived learning in relation to student evaluation to university instruction. Journal of Educational Psychology, 79, 90-91.

Basow, S. A. (2000). Best and worst professors: Gender patterns in students’ choices. Sex Roles, 43, 407-417.

Basow, S. A., & Distenfeld, M. A.. (1985). Teacher expressiveness: More important for male teacher than female teachers? Journal of Educational Psychology, 77, 45-52.

Basow, S. A., Phelan, J. E., & Capotosto, L. (2006). Gender patterns in college students’ choices of their best and worst professors. Psychology of Women Quarterly, 30, 25-35.

Basow, S. A., & Silberg, N. T. (1987). Student evaluations of college professors: Are female and male professors rated differently? Journal of Educational Psychology, 79, 308-314.

Bendig, A. W. (1952). A preliminary study of the effect of academic level, sex, and course variables on student rating of psychology instructors. Journal of Psychology, 34, 2-126.

Benton, S. E., & Scott, O. (1976). A comparison of criterion validity of two types of student response inventories for appraising instruction. Paper presented at the annual meeting of the National Council on Measurement in Education.

Benton, S. L., & Cahin, W. E. (2012 ). IDEA paper no. 50: Student ratings of teaching: A Summary of research and literature. Manhattan: Kansas State University Center for Faculty Evaluation and Development.

Blackburn, R. T., & Lawrence, J. H. (1986). Aging and the quality of faculty job-performance.  Review of Educational Research, 56, 265-290.

Blunt, A. (1991). The effects of anonymity and manipulated grades on student ratings of instructors. Community College Review, 18, 48-54.

Bolton, B., Bonge, D., & Marr, J. (1979). Ratings of instruction, examination performance, and subsequent enrollment in psychology courses. Teaching of Psychology, 6, 82-85.

Boretz, E. (2004). Grade inflation and the myth of student consumerism. College Teaching, 52, 42-52.

Boring, A, Ottoboni, K, & Stark P. (2016). Student evaluations of teaching (mostly) do not measure teaching effectiveness. ScienceOpen.com. 

Brandenburg, D. C., Slinde, J. A., & Batista, E. E. (1977). Student ratings of instruction: Validity and normative interpretations. Research in Higher Education, 7, 67-78.

Braskamp, L. A., Brandenberg, D. C., & Ory, J. C. (1984). Evaluating teaching effectiveness: A practical guide. Beverly Hills: Sage.

Braskamp, L. A., Caulley, D., & Costin, F. (1979). Studetn ratings and instructor self-ratings and their relationships to student achievement. American Educational Research Journal, 16, 295-306.

Braskamp, L. A., & Ory, J. C. (1994). Assessing faculty work: Enhancing individual and institutional performance. San Francisco: Jossey-Bass.

Brewer, R. E., & Brewer, M. B. (1970). Relative importance of ten qualities for college teaching determined by pair comparisons. Journal of Educational Research, 63, 243-246.

Bridges, C. M., Ware, W. B., Brown, B. B., & Greenwood, G. (1971). Characteristics of best and worst college teachers. Science Education, 55, 545-553.

Brown, D. L. (1976). Faculty ratings and student grades: A university-wide multiple regression analysis. Journal of Educational Psychology, 68, 573-578.

Brown, W., & Tomlin, J. (1996). Best and worst university teachers: The opinions of undergraduate students. College Student Journal, 30, 431-434.

Bryson, R. (1974). Teacher evaluations and student learning: A reexamination. Journal of Educational Research, 68, 11-14.

Carrier, N. A., Howard, G. S., & Miller, W. G. (1974). Course evaluations: When? Journal of Educational Psychology, 66, 609-613.

Carson, B. H. (1996). Thirty years of stories: The professor’s place in student memories.  Change, 28, 10-17.

Cashin, W. E. (1988). IDEA technical report no. 20: Student ratings of teaching: A summary of the research. Manhattan: Kansas State University Center for Faculty Evaluation and Development.

Cashin, W. W. (1990). Student do rate different academic fields differently.  In M. Theall, & J. Franklin (Eds.), Student ratings of instruction: Issues for improving practice. New Directions for Teaching and      Learning, No. 43 (pp. 113-121). San Francisco: Jossey-Bass.

Cashin, W. E. (1995). IDEA paper no. 32: Student ratings of teaching: The research revisited. Manhattan: Kansas State University Center for Faculty Evaluation and Developmen.

Cashin, W. E., & Clegg, V. L. (1987, April). Are student ratings of different academic fields different? Paper presented at the annual meeting of the American Educational Research Association, Chicago, IL.

Cashin, W. E., & Slawson, H. M. (1977). IDEA technical report no. 2: Description of database 1976-1977.  Manhattan: Kansas State University Center for Faculty Evaluation and Development.

Centra, J. A. (1976). The influence of different directions on student ratings of instruction.  Journal of Educational Measurement, 13, 277-282.

Centra, J. A. (1977). Student ratings of instruction and their relationship to student learning.  American Educational Research Journal, 14, 17-24.

Centra, J. A. (1993). Reflective faculty evaluation: Enhancing teaching and determining faculty effectiveness. San Francisco: Jossey-Bass.

Centra, J. A. (2003). Will teaches receive higher student evaluations by giving higher grades and less course work? Research in Higher Education, 44, 495-518.

Centra, J. A. (2009). Differences in responses to the Student Instructional Report: Is it bias?  Princeton: Educational Teaching Service.

Centra, J. A., & Creech, F. R. (1976). The relationship between student, teacher, and course characteristics and student ratings of teacher effectiveness. Princeton: Educational Testing Service.

Centra, J. A., & Gaubatz, N. B. (1998, April). Is there gender bias in student ratings of instruction? Paper presented at the Seventy-Ninth Annual Meeting of the American Educational Research Association, San Diego.

Centra, J. A., & Gaubatz, N. B. (2000). Is there a gender bias in student evaluations of teaching?  Journal of Higher Education, 70. 17-33.

Clark, K. E., and Keller, R. J. Student ratings of college teaching. (1954). In R. E. Eckert, & R. J. Keller (Eds.), A university looks at its program: The report of the University of Minnesota Bureau of Institutional Research, 1942-1952. Minneapolis: University of Minnesota Press.

Clayson, D. E., & Haley, D. A. (1990). Student evaluations in marketing: What is actually being measured? Journal of Marketing Education, 12, 9-17.

Cohen, P. A. (1981). Student ratings of instruction and student achievement: A meta-analysis of multi-section validity studies.  Review of Educational Research, 51, 281-309.

Cohen, P. A. (1986, April). An updated and expanded metaanalysis of multisection student rating validity studies. Paper presented at the annual meeting of the American Educational Research Association; San Francisco, CA.

Cohen, P. A. (1987, April). A critical analysis and reanalysis of the multi-section validity meta-analysis. Paper presented at the annual meeting of the American Educational Research Association, Washington, D.C.

Cohen, P. A. (1982). Validity of student ratings in psychology courses: A research synthesis. Teaching of Psychology, 9, 78-82.

Cohen, S. H., & Berger, W. G. (1970). Dimensions of students’ ratings of college instructors underlying subsequent achievement on course examinations. Proceedings of the 78th Annual Convention of the American Psychological Association, (pp. 605-696). American Psychological Association.

Cornwall, C. D. (1974). Statistical treatment of data from student teaching evaluation questionnaires.  Journal of Chemical Education, 51, 155-160.

Conran, P. B., et al. (1991). High school student evaluation of student teachers: How do they compare with professionals? Illinois School Research and Development, 27, 145-150.

Costin, F. (1968). A graduate course in the teaching of psychology: Description and evaluation.  Journal of Teacher Education, 19, 425-432.

Costin, F. (1978). Do student ratings of college teachers predict student achievement? Teaching of Psychology, 5, 86-88.

Costin, F., Greenough, W. T., & Menges, R. J. (1971). Student ratings of college teaching: Reliability, validity, and usefulness. Review of Educational Research, 41, 511-536.

Crittenden, K. S., Norr, J. L., & LeBailly, R. K. (1975). Size of university classes and student evaluations of teaching. Journal of Higher Education, 46, 461-470.

d’Apollonia, S., & Abrami, P. C. (1997). Navigating student ratings of instruction. American Psychologist, 52, 1198-1208.

Davidovitch, N., & Soen, D. (2009). Myths and facts about student surveys of teaching the links between students’ evaluations of faculty and course grades. Journal of College Teaching & Learning, 6, 41-49.

Davis, B. G. (2009).  Tools for teaching, (2nd ed.). San Francisco: Jossey-Bass.

Delaney, E. L. (1976). The relationships of student ratings of instruction to student, instructor and course characteristics. Paper read at the annual meeting of the American Educational Research Association.

Delaney, E. L., & Coon, E. E., Jr. (1976). Differing views on the criteria and purposes of student ratings of instruction. Paper read at the annual meeting of the Association for Instructional Research.

Divoky, J. J., & Rothermel, M. A. (1988). Student perceptions of the relative importance of dimensions of teaching performance across type of class. Educational Research Quarterly, 12, 40-45.

Donaldson, J. F., et al. (1993). A triangulated study comparing adult college students’ perceptions of effective teaching with those of traditional students. Continuing Higher Education Review, 57, 146-165.

Downie, N. M. (1952). Student evaluation of faculty. Journal of Higher Education, 23, 495-496, 503.

Doyle, K. O., Jr., & Crichton, L. I. (1978). Student, peer, and self evaluation of college instruction. Journal of Educational Psychology, 70, 815-826.

Doyle, K. O., Jr., & Whitely, S. E. (1974). Student ratings as criteria for effective teaching.  American Educational Research Journal, 11, 259-274.

Dukes, R. L. & Victoria, B. (1989). The effects of gender, status, and effective teaching on the evaluation of college instruction. Teaching Sociology, 17, 447-457.

Elliott, D. N. (1950). Characteristics and relationships of various criteria of college and university teaching. Purdue University Studies in Higher Education, 70, 5-61.

Ellis, N. R., & Rickard, H. C. (1977). Evaluating the teaching of introductory psychology. Teaching of Psychology, 4, 128-132.

Endo, G. T., & Della-Piana, G. (1976). A validation study of course evaluation ratings. Improving College and University Teaching, 24, 84-86.

Evans, E. D. (1969). Student activism and teaching effectiveness: Survival of the fittest? Journal of College Student Personnel, 10, 102-108.

Feldman, K. A. (1976ab). Grades and college students’ evaluations of their courses and teachers. Research in Higher Education, 5, 243-288.

Feldman, K. A. (1977). Consistency and variability among college students in rating their teachers and courses: A review and analysis. Research in Higher Education, 6, 233-274.

Feldman, K. A. (1978). Course characteristics and college students’ ratings of their teachers: What we know and what we don’t. Research in Higher Education, 9, 199-242.

Feldman, K. A. (1983). Seniority and experience of college teachers as related to evaluations they receive from students. Research in Higher Education, 18, 3-124.

Feldman, K. A. (1984). Class size and college students’ evaluations of teachers and courses: A closer look. Research in Higher Education, 21, 45-116.

Feldman, K. A. (1986). The perceived instructional effectiveness of college teachers as related to their personality and attitudinal characteristics. Research in Higher Education, 24, 129-213.

Feldman, K. A. (1988). Effective college teaching from the students’ and faculty’s view: Matched or mismatched priorities? Research in Higher Education, 28, 291-329.

Feldman, K. A. (1989). The association between student ratings of specific instructional dimensions and student achievement: Refining and extending the synthesis of data from multisection validity studies. Research in Higher Education, 30, 583-645.

Feldman, K. A. (1992). College students’ views of male and female college teachers: Part I—Evidence from the social laboratory and experiments. Research in Higher Education, 33, 317-375.

Feldman, K. A. (1993). College students’ views of male and female college teachers: Part II—Evidence from students’ evaluations of their classroom teachers. Research in Higher Education, 34, 151-211.

Feldman, K. A. (1997). Identifying exemplary teachers and teaching: Evidence from student ratings. In R. P. Perry & J. C. Smart (Eds.), Effective teaching in higher education: Research and practice. (pp. 368-395). New York: Agathon.

Feldman, K. A. (2007). Identifying exemplary teachers and teaching: Evidence from student ratings. In R. P. Perry & J. C. Smart (Eds). The scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 75-96). San Francisco: Jossey-Bass.

Fernandez, J. & Mateo, M. (1997). Student and faculty gender in ratings of university teaching quality. Sex Roles, 37, 997-1003.

Fortson, S. B., & W. E. Brown. (1998). Best and worst university instructors: The opinions of graduate students. College Student Journal, 32, 572-576.

Francis, J. B. (1976). Faculty ratings of course evaluation items. Research in Higher Education, 4, 23-40.

Franklin, J., & Theall, M. (1991, April). Grade inflation and student ratings: A closer look. Paper presented at the 75th annual meeting of the American Educational Research Association, Chicago.

Franklin, J., & Theall, M. (1993). Student ratings of instruction and gender differences revisited. Paper presented at the annual meeting of the American Educational Research Association.

Franklin, J. & Theall, M. (1994, April). Student ratings of instruction and sex differences revisited. Paper presented at the Seventh-fifth Annual Meeting of the American Educational Research Association, New Orleans.

Franklin, J., & Theall, M. (1995). The relationship of disciplinary differences and the value of class preparation time to student ratings of instruction. In N. Hativa & M Marinocovich (Eds.), Disciplinary differences in teaching and learning: Implication for practice. New Directions for Teaching and Learning No. 64. (pp. 41-48). San Francisco: Jossey-Bass.

Freeman, H. (1994). Student evaluation of college instructors: Effects of type of course taught, instructor gender and gender role, and student gender. Journal of Educational Psychology, 86, 627-630.

Freilich, M. B. (1983). A student evaluation of teaching techniques. Journal of Chemical Education, 60, 218-221.

Frey, P. W. (1973). Student ratings of teaching: Validity of several rating factors. Science, 182, 83-85.

Frey, P. W. (1976). Validity of student instructional ratings as a function of their timing. Journal of Higher Education, 47, 327-336.

Frey, P. W., Leonard, D. W., & Beatty, W. W. (1975). Student ratings of instruction: Validity research. American Educational Research Journal, 12, 435-444.

Gage, N. L. (1961). The appraisal of college teaching: An analysis of ends and means. Journal of Higher Education, 32, 17-22.

Gessner, P. K. (1973). Evaluation of instruction. Science, 180, 566-569.

Gigliotti, R. J., & Buchtel, F. S. (1990). Attributional bias and course evaluation. Journal of Educational Psychology, 82, 342-351.

Goldberg, G., & Callahan, J. (1991). Objectivity of student evaluations of instructors. Journal of Education for Business, 66, 377-378.

Goldman, L. (1993). On the erosion of education and the eroding foundation of teacher education (or why we should not take student evaluation of faculty seriously). Teacher Education Quarterly, 20, 57-64.

Goodwin, L. D., & Stevens, E. A. (1993). The influence of gender on university faculty members’ perceptions of “good” teaching. Journal of Higher Education, 64, 166-185.

Grant, C. W. (1971). Faculty allocation of effort and student course evaluations. Journal of Educational Research, 64, 405-410.

Gravestock, P., & Gregor-Greenleaf, E. (2008). Student course evaluations: research models and trends. Toronto: Higher Education Quality Council of Ontario.

Greenwald, A. G., & Gillmore, G. M. (1997). Grading leniency is a removable contaminant of student ratings. American Psychologist, 52, 1209-1217.

Greenwood, G. E., Hazelton, A., Smith, A. B., & Ware, W. B. (1976). A study of the validity of four types of student ratings of college teaching assessed on a criterion of student achievement gains. Research in Higher Education, 5, 171-178.

Grush, J. E., & Costin, F. (1975). The student as consumer of the teaching process. American Educational Research Journal, 12, 55-66.

Guskey, T. R., & Easton, J. Q. (1983). The characteristics of very effective teacher in urban community colleges. Community College Journal of Research and Practice, 7, 265-274.

Hamermesh, D. & Parker, A. (2005). Beauty in the classroom: Instructors’ pulchritude and putative pedagogy productivity. Economics of Education Review, 24, 369-376.

Hancock, P. D., et al. (1992). Student and teacher gender ratings of university faculty: Results from five colleges of study. Journal of Personnel Evaluation in Education, 6, 359-366.

Hativa, N. (1996). University instructors’ rating profiles: Stability over time, and disciplinary differences. Research in Higher Education, 37, 341-365.

Haslett, B. J. (1976). Student knowledgeability, student sex, class size, and class level: Their interactions and influences on student ratings of instruction. Research in Higher Education, 5, 39-65.

Heckert, T. M., Latier, A., Ringwald-Burton, A., & Drazen, C. (2006). Relations among student effort, perceived class difficulty appropriateness, and student evaluations of teaching: Is it possible to “buy” better evaluations through lenient grading? College Student Journal, 40, 588-596.

Heilman, J. D., & Armentrout, W. D. (1936). The rating of college teachers on ten traits by their students. Journal of Educational Psychology, 27, 197-216.

Hoffman, R. G. (1978). Variables affecting university student ratings of instructor behavior.  American Educational Research Journal, 15, 287-299.

Howard, G. S., & Maxwell, S. E. (1980). The correlation between student satisfaction and grades: A case of mistaken causation? Journal of Educational Psychology, 72, 810-820.

Howard, G. S., & Maxwell, S. E. (1982). Do grades contaminate student evaluations of instruction? Research in Higher Education, 16, 175-188.

Hoyt, D. P., & Cashin, W. E. (1977). IDEA technical report no. 1: Development of the IDEA system. Manhattan: Kansas State University Center for Faculty Evaluation and Development.

Hoyt, D. P. & Lee, E. (2002). IDEA technical report no. 12: Basic data for the revised IDEA system. Manhattan: Kansas State University Center for Faculty Evaluation and Development.

Huston, T. A. (2005). Research report: Race and gender bias in student evaluations of teaching. Seattle: Seattle University Center for Excellence in Teaching and Learning.

Hutchinson, L. M., & Beadle, M. E. (1992). Professors’ communication styles: How they influence male and female seminar participants. Teaching and Teacher Education, 8, 405-18.

Isaacson, R.L., McKeachie, W. J., Milholland, J. E., Lin, Y-G., Hotelier, M., Baerwaldt, J. W., & Zinn, K. L. (1964). Dimensions of student evaluations of teaching. Journal of Educational Psychology, 55, 344-351.

Johnson, D. W., Johnson, R. T., & Smith, K. A. Cooperative Learning: Increasing College Faculty Productivity. ASHE-ERIC Higher Education Report, no 4. Washington, D.C.: George Washington University.

Johnson, R. T. & Johnson, D. W. Cooperation and Competition Theory and Research. Edina, MN: Interaction, 1989.

Johnson, T. D. (2003). Online student ratings: Will student respond? In D. L. Sorenson & T. D. Johnson (Eds.), Online student ratings of instruction. New Directions for Teaching and Learning, 96, 49-60.

Johnson, V. (2003). Grade Inflation: A Crisis in Higher Education. New York, Springer.

Jones, S., & Dindia, K. (2004). A meta-analytic perspective on sex equity in the classroom. Review of Educational Research, 74, 443-471.

Kierstead, D., et al. (1988). Sex role stereotyping of college professors: Bias in students’ ratings of instructors. Journal of Educational Psychology, 80, 342-344.

Kogan, L. R., Schoenfeld-Tacher, R., & Hellyer, P. W.  Student evaluations of teaching: perceptions of faculty based on gender, position, and rank.  Teaching in Higher Education, 15, 623-636. 

Kohlan, R. G. (1973). A comparison of faculty evaluations early and late in the course. Journal of Higher Education, 44, 587-595.

Kozub, Robert M. (2010). Relationship of course, instructor, and student characteristics to dimensions of student ratings of teaching effectiveness in business school. American Journal of Business Education, 3, 22-40.

Krautmann, A. C., & Sander, W. (1999). Grades and student evaluation of teachers. Economics of Education Review, 18, 59-63.

Kulik, J. A. (2001). Student ratings: Validity, utility, and controversy. In M. Theall, P. C. Abrami, and L.A. Mets (Eds.). The student ratings debate: Are they valid? How can we best use them?  New Directions for Institutional Research. No. 109. (pp. 9-26). San Francisco: Jossey-Bass.

Kulik, J. A., & McDeachie, W. J. (1975). The evaluation of teachers in higher education. In F. N. Kerlilnger (Ed.), Review of Research in Education, Vol 3. Itasca, IL: Peacock.

Landis, L. M., & Pirro, E. B. (1977). Required/elective student differences in course evaluations. Teaching Political Science, 4, 405-422.

Lin, Y-G., McKeachie, W. J., & Tucker, D. G. (1984). The use of student ratings in promotion decisions. Journal of Higher Education, 55, 583-589.

Linsky, A. S., & Straus, M. (1975). Student evaluations, research productivity and eminence of college faculty. Journal of Higher Education, 46, 89-102.

Lovell, G. D., and Haner, C. F. (1975). Forced-choice applied to college faculty rating. Educational and Psychological Measurement, 15, 291-304.

Ludwig, J. M., & Meacham, J. A. (1997). Teaching controversial courses: Student evaluations of instructor and content. Educational Research Quarterly, 21, 27-38.

Luek, T. L., et al. (1993). The interaction effects of gender on teaching evaluations. Journalism Educator, 48, 235-248.

MacNell, L., Driscoll, A., & Hunt, A. N. (2015). What’s in a name? Exposing gender bias in student ratings of teaching.  Innovative Higher Education, 40, 291-303.

Marks, R. B. (2000). Determinants of student evaluations of global measures of instructor and course value. Journal of Marketing Education, 22, 108-119.

Marques, T. E., Lane, D. M., & Dorfman, P. W. (1979). Toward the development of a system for instructional evaluation: Is thee consensus regarding what constitutes effective teaching? Journal of Educational Psychology, 71, 840-849.

Marsh, H. W. (1978). Students’ evaluations of instructional effectiveness: Relationship to student, course, and instructor characteristics. Paper read at the annual meeting of the American Educational Research Association.

Marsh, H. W. (1982). The use of path analysis to estimate teacher and course effects in student ratings of instructional effectiveness. Applied Psychological Measurement, 6, 47-59.

Marsh, H. W. (1984). Students’ evaluations of university teaching: Dimensionality, reliability, validity, potential biases, and unity.  Journal of Educational Psychology, 76, 707-754.

Marsh, H. W. (1987). Students’ evaluations of university teaching: Research findings, methodological issues, and directions for future research. International Journal of Educational Research, 11, 252-388.

Marsh, H. W. (1992). A longitudinal perspective of student evaluations of university teaching: Ratings of the same teachers over a thirteen-year period.” Paper presented at the 73rd Annual Meeting of the American Educational Research Association, San Francisco.

Marsh, H. W. (2007a). Do university teachers become more effective with experience? A multilevel growth model of students’ evaluations of teaching over 13 years. Journal of Educational Psychology, 99, 775-790.

Marsh, H. W. (2007b). Students’ evaluations of university teaching: Dimensionality, reliability, validity, potential biases and usefulness. In R. P. Perry & J. C. Smart (Eds.), The scholarship of teaching and learning in higher education: An evidence-based perspective (pp. 319-383). Dordecht, The Netherlands: Springer.

Marsh, H. W., & Dunkin, M. J. (1992). Students’ evaluations of university teaching: A multidimensional perspective. In J. C. Smart (Ed.) Higher education: Handbook of theory and research. Vol. 8 (pp. 143-233). New York: Agathon.

Marsh, H. W.,& Dunkin, M. J. (1997). Students evaluations of university teaching: A multidimensional perspective. In R. P. Perry & J. C. Smart (Eds.), Effective teaching in higher education: Research and practice (pp. 241-320). New York: Agathon.

Marsh, H. W., Fleiner, H., & Thomas, C. S. (1975). Validity and usefulness of instructional quality. Journal of Educational Psychology, 67, 833-839.

Marsh, H. W., & Overall, J. U. (1980). Validity of students’ evaluations of teaching effectiveness: Cognitive and affective criteria. Journal of Educational Psychology, 67, 833-839.

Marsh, H. W., Overall, J. U., & Thomas, C. S. (1976). The relationship between student evaluations of instruction and expected grade. Paper read at the annual meeting of the American Educational Research Association.

Marsh, H. W., & Roche, L. A. (1987). Making students’ evaluations of teaching effectiveness effective: The critical issues of validity, bias, and utility. American Psychologist, 52, 1187-1197.

Marsh, H. W., & Roche, L. A. (1993). The use of students’ evaluations of teaching: Popular myth, bias, validity, and innocent bystanders. Journal of Educational Psychology, 92, 217-251.

Marsh, H. W., & Roche, L. A. (2000). Effects of grading leniency and low workload on students’ evaluations of teaching: Popular myth, bias, validity, or innocent bystanders? Journal of Educational Psychology, 92, 202-228.

Marsh, H. W., & Ware, J. E. (1982). Effects of expressiveness, content coverage, and incentive on multidimensional student rating scales: New interpretations of the Dr. Fox effect. Journal of Educational Psychology, 74, 126-134.

Mateo, M. A., & Fernandez, J. (1996). Incidence of class size on the evaluation of university teaching quality. Educational and Psychological Measurement, 56, 771-778.

McCallum, L. W. (1984). A meta-analysis of course evaluation data and its use in the tenure decision. Research in Higher Education, 21, 150-158.

McDaniel, E. D., & Feldhusen, J. F. (1970). Relationships between faculty ratings and indexes of service and scholarship. Proceedings of the 78th Annual convention of the American Psychological Association. (pp. 619-620). American Psychological Association.

McPherson, M., & Jewell, R. (2007). Leveling the playing field: Should student evaluation scores be adjusted? Social Science Quarterly, 88, 868-881.

McKeachie, W. J. (1979). Student ratings of faculty: A reprise. Academe, 65, 384-397.

McKeachie, W. J. (1997). Student ratings: The validity of use. American Psychologist, 52, 1218-1225.

McKeachie, W. J. (2007). Good teaching makes a difference—and we know what it is. In R. P. Perry & J. C. Smart (Eds.), The Scholarship of Teaching and Learning in Higher Education: An Evidence-based Perspective (pp. 457-474). Dordecht, The Netherlands:  Springer.

McKeachie, W. J., Lin, Y-G., & Mann, W. (1971). Student ratings of teacher effectiveness: Validity studies. American Educational Research Association, 8, 435-445.

McKinnon, M. M. (1999). Core Elements of student motivation in problem-based learning. In M. Theall (Ed.), Motivation from Within: Encouraging Faculty and Students to Excel. New Directions for Teaching and Learning, no. 78. (pp. 49-58). San Francisco: Jossey-Bass.

Menges, R. J. (1973). The new reporters: Students rate instruction. In C. R. Pace (Ed.) Evaluating learning and teaching. New directions for higher education, no. 4 (pp. 59-75). San Francisco: Jossey-Bass.

Minor, L. C., Onwuegbuzie, A. J., Witcher, A. E., & James, T. L. (2002). Preservice teachers’ educational beliefs and their perceptions of characteristics of effective teachers. Journal of Educational Research, 96, 116-127.

Mintzes, J. J. (1976). Field test and validation of a teaching evaluation instrument: The Student Opinion Survey of Teaching. A report submitted to the Senate Committee for Teaching and Learning, Faculty Senate, University of Windsor. Windsor, Ontario: University of Windsor.

Miron, M., & Segal, E. (1978). The good university teacher.  Higher Education, 7, 27-34.

Mirus, R. (1973). Some implications of student evaluation of teachers. Journal of Economic Education, 5, 35-37.

Moritsch, B. G., & Suter, W. N. (1988). Correlates of halo error in teaching evaluation. Educational Research Quarterly, 12, 29-34.

Morsh, J. E., Burgess, G. G., & Smith, P. N. (1956). Student achievement as a measure of instructor effectiveness. Journal of Educational Psychology, 47, 79-88.

Murray, H. G. (1983). Low-inference classroom teaching behaviors in relation to six measures of college teaching effectiveness. Proceedings of the Conference on the Evaluation and Improvement of University Teaching: The Canadian Experience (pp. 43-73). Montreal: McGill University, Centre for Teaching and Learning Services.

Murray, H. G., Rushton, J. P., & Paunonen, S. V. (1990). Teacher personality traits and student instructional ratings in six types of university courses. Journal of Educational Psychology, 82, 250-61.

Naftulin, D. H., Ware, J. E., & Donnelly, F. A. (1973). The Doctor Fox lecture: A paradigm of educational seduction. Journal of Medical Education, 48, 630-635.

Nelson, J. P., & Lynch, K. A. (1984). Grade inflation, real income, simultaneity, and teaching evaluations. Journal of Economic Education, 15, 21-35.

Neumann, R. (2001). Disciplinary differences and university teaching. Studies in Higher Education, 26, 135-146.

Nimmer, J. G., & Stone, E. R. (1991). Effects of grading practices and time of rating on student ratings of faculty performance and student learning. Research in Higher Education, 32, 195-215.

Onwuegbuzie, A. J., Witcher, A. E., Collins, K. M. T., Filer, J. D., Wiedmaier, C. D., & Moore, C. W. (2007). Students’ perceptions of characteristics of effective college teachers: A validity study of a teaching evaluation form using mixed-methods analysis. American Educational Research Journal, 44, 113-169.

O’Reilly, M. T. (1987). Relationship of physical attractiveness to student ratings of teaching effectiveness. Journal of Dental Education, 51, 600-602.

Orpen, C. (1980). Student evaluations of lecturers as an indicator of instructional quality: A validity study.  Journal of Educational Research, 74, 5-7.

Ory, J. C. (2001). Faculty thoughts and concerns about student ratings. In K. G. Lewis (Ed.), Techniques and strategies for interpreting student evaluations. New Directions for Teaching and Learning, no. 87 (pp. 3-15). San Francisco: Jossey-Bass.

Overall, J. U., & Marsh, H. W. (1980). Students’ evaluations of instruction: A longitudinal study of their stability. Journal of Educational Psychology, 72, 321-325.

Panitz, T.  (1999). Benefits of cooperation in relation to student motivation. In M. Theall (Ed.), Motivation from Within: Encouraging Faculty and Students to Excel. New Directions for Teaching and Learning, no. 78. (59-67). San Francisco: Jossey-Bass.

Perlman, D. (1973). Class size and students’ ratings of university courses. Paper read at the annual meeting of the Canadian Psychological Association.

Perry, R. R. (1969). Evaluation of teaching behavior seeks to measure effectiveness. College and University Business, 47, 18, 22.

Petchers, M. K., & Chow, J. C. (1988). Sources of variation in students’ evaluations of instruction in a graduate social work program. Journal of Social Work Education, 24, 35-42.

Pohlmann, J. T. (1975). A multivariate analysis of selected class characteristics and student ratings of instruction. Multivariate Behavioral Research, 10, 81-92.

Pozo-Munoz, C., Rebolloso-Pacheco, E., & Fernandez-Ramirez, B. (2000). The “ideal teacher”: Implications for student evaluation of teacher effectiveness. Assessment & Evaluation in Higher Education, 25, 254-263.

Punyanunt-Carter, N., & Carter, S. L. (2015). Students’ gender bias in teaching evaluations. Higher Learning Research Communications, 5, 28-37.

Renaud, R. D., & Murray, H. G. (1996). Aging, personality, and teaching effectiveness in academic psychologists. Research in Higher Education, 37, 323-340.

Rodabaugh, R. C., & Kravitz, D. A. (1994). Effects of procedural fairness on student judgments of professors. Journal on Excellence in College Teaching, 5, 67-83.

Rodin, M., & Rodin, B. (1972). Student evaluations of teachers. Science, 177, 1164-1166.

Romine, S. (1974). Student and faculty perception of an effective university instructional climate. Journal of Educational Research, 68, 139-143.

Rosenfield, S., Dedic, H., Dickie, L., Rosenfield, E., Allus, M. W., Koestner, R., Kishtalka, A.,   Milkman, K., & Abrami, P. (2005). Étude des facteurs aptes à influencer la réussite et la retention dans les programmes de la science aux cégeps anglophones. Final report submitted to Fonds de recherché sure la société et al culture. Quebec: Montreal.

Rowden, G. V., & Carlson, R. E. (1996). Gender issues and students’ perceptions of instructors’ immediacy and evaluation of teaching and course. Psychological Reports, 78, 835-839.

Rubinstein, J., & Mitchell, H. (1970). Feeling free, student involvement, and appreciation. Proceedings of the 78th Annual Convention of the American Psychological Association, 5, 623-624. American Psychological Association.

Sailor, P., Worthen, B. R., & Shin, E. H. (1997). Class level as a possible mediator of the relationship between grades and student ratings of teaching. Assessment and Evaluation in Higher Education, 22, 261-269.

Salthouse, T. A., McKeachie, W. J., & Lin, Y-G. (1978). An experimental investigation of factors affecting university promotion decisions. Journal of Higher Education, 49, 177-183.

Scherr, F. C., & Scherr, S. S. (1990). Bias in student evaluation of teacher effectiveness. Journal of Education for Business, 65, 356-358.

Schulte, D. P., Slate, J. R., & Onwuegbzie, A. J. (2009). Effective high school teachers: A mixed investigation. International Journal of Educational Research, 47, 351-361.

Scott, C. S. (1975). Correlates of student ratings of professorial performance: Instructor defined extenuating circumstances, class size, and faculty member’s professional experience and willingness to publish results. Paper read at the annual meeting of the American Educational Research Association.

Scott, C. S. (1977). Student ratings and instructor-defined extenuating circumstances. Journal of Educational Psychology, 69, 744-747.

Scriven, M. (1981). Summative teacher evaluation. In J. Millman (Ed.), Handbook of teacher evaluation (pp. 244-271). Beverly Hills: Sage.

Shapiro, G. E. (1990). Effect of instructor and class characteristics on student class evaluations. Research in Higher Education, 31, 135-148.

Shingles, R. D. (1977). Faculty ratings: Procedures for interpreting student evaluations.  American Educational Research Journal, 14, 459-470.

Sixbury, G. R., & Cashin, W. E. (1995). Comparative data by academic field. IDEA technical report, no. 10. Manhattan: Kansas State University, Center for Faculty Evaluation and Development.

Slate, J. R., LaPrairie, K. N., Schulte, D. P., & Onwuegbuzie, A. J. (2011). Views of effective college faculty: A mixed analysis. Assessment & Evaluation in Higher Education, 36, 331-346.

Solomon, D., Rosenberg, L., & Bezkek, W. E. (1964). Teacher behavior and student learning.  Journal of Educational Psychology, 55, 23-30.

Sorge, D. H., & Klinie, C. E. (1973). Verbal behavior of college instructors and attendant effect upon student attitudes and achievement. College Student Journal, 7, 24-29

Spooren, P., Brockx, B., & Mortelmans, D. (2013). On the validity of student evaluation of teaching: The state of the art. Review of Educational Research, 83, 598-642.

Sprague, J., & Massoni, D. (2005). Student evaluations and gendered expectations: What we can’t count can hurt us. Sex Roles, 53, 779-793.

Stuit, D. B., & Ebel, R. L. (1952). Instructor rating at a large state university. College and University, 27, 247-254.

Stark, P. B., & Freishtat, R.  (2014). An evaluation of course evaluations. ScienceOpen.com.

Stumpf, S. A., & Freedman, R. D. (1979). Expected grade covariation with student ratings of instruction: Individual versus class effects. Journal of Educational Psychology, 71, 293-302.

Sullivan, A. M., & Skanes, G. R. (1974). Validity of student evaluation of teaching and the characteristics of successful instructors. Journal of Educational Psychology, 66, 584-590.

Summers, M. A., et al. (1996). The camera adds more than pounds: Gender differences in course satisfaction for campus and distance learning students. Journal of Research and Development in Education, 29, 212-219.

Tang, T. L-P. (1997). Teaching evaluation at a public institution of higher education: Factors related to the overall teaching effectiveness. Public Personnel Management, 26, 379-389.

Tatro, C. N. (1995). Gender effects on student evaluations of faculty. Journal of Research and Development in Education, 28, 169-173.

Theall, M. (1999). New directions for theory and research on teaching: A review of the past twenty years.  In M. D. Svinicki (Ed.), New Directions for Teaching and Learning, no. 30. (pp. 29-52). San Francisco: Jossey-Bass.

Theall, M., & Feldman, K. A. (2007). Commentary and update on Feldman’s (1997) “Identifying exemplary teachers and teaching: Evidence from student ratings.” In R. P. Perry & J. C. Smart (Eds.), The Scholarship of Teaching and Learning in Higher Education: An Evidence-based Perspective (pp. 130-1440.  Dordecht, The Netherlands: Springer.

Theall, M., & Franklin, J. (1991). Using student ratings for teaching improvement. New Directions for Teaching and Learning, no. 48. (pp. 83-95). San Francisco: Jossey-Bass.

Theall, M., & Franklin, J. (2001). Looking for bias in all the wrong places: A search for truth or a witch hunt in student ratings of instruction? In M Theall, P. C. Abrami, & L. A. Mets, Eds.), The student ratings debate: Are they valid? How can we best use them? New Directions for Institutional Research, no. 109. (pp. 45-56). San Francisco: Jossey-Bass.

Theall, M., Franklin, J., & Ludlow, L. (1990a). Attributions and retributions: Student ratings and the perceived causes of performance. Instructional Evaluation, 11, 12-17.

Theall, M., Franklin, J., & Ludlow, L. (1990b). Attributions and retributions: Student ratings and the perceived causes of performance. Paper presented at the annual meeting of the American Educational Research Association.

Trout, P. (2000). Flunking the test: The dismal record of student evaluations. Academe July-August: 58-61.

Turner, R. L., & Thompson, R. P. (1974). Relationships between college student ratings of instructors and residual learning. Paper read at the annual meeting of the American Educational Research Association.

Wachtel, H. K. (1998). Student evaluation of college teaching effectiveness: A brief review. Assessment & Evaluation in Higher Education, 29, 191-221.

Waters, M., Kemp, E., & Pucci, A. (1988). High and low faculty evaluations: Descriptions by students. Teaching of Psychology, 15, 203-204.

Wheeless, V. E., & Potorti, P. F. (1989). Student assessment of teacher masculinity and femininity: A test of the sex role congruency hypothesis on student attitudes toward learning. Journal of Educational Psychology, 81, 259-262.

Whitworth, J. E., Price, B. A., & Randall, C. H.  (2002). Factors that affect College of Business student opinion of teaching and learning. Journal of Education for Business, 77(5), 282-289.

Widmeyer, W. N., & Loy, J. M. (1988). When you’re hot, you’re hot! Warm-cold effects in first impressions of persons and teaching effectiveness. Journal of Educational Psychology, 80, 118-121.

Wigington, H., Tollefson, N., & Rodriguez, E. (1989). Students’ ratings of instructors revisited: Interactions among class and instructor variables. Research in Higher Education, 30(3), 331-344.

Wilson, R. (1998). New research casts doubt on value of comparing adult college students’ perceptions of effective teaching with those of traditional students. Chronicle of Higher Education, 44, A1 2-A1 4.

Winocur, S., et al. (1989). Perceptions of male and female academics within a teaching context. Research in Higher Education, 30, 317-329.

Witcher, A. E., Onwuegbuzie, A. J., & Minor, L. (2001). Characteristics of effective teachers: Perceptions of preservice teachers. Research in the Schools, 8(2), 45-57.

Wittmaier, B. C. (1975). Teaching styles: A comparison of faculty and student preferences. Improving College and University Teaching Yearbook 1975 (pp. 249-251). Corvallis: Oregon State University Press.

Wright, S. L., & Jenkins-Guarnieri, M. A. (2012). Student evaluations of teaching: Combining the meta-analyses and demonstrating further evidence for effective use. Assessment & Evaluation in Higher Education, 37(6), 683-699.

Young, S., & Shaw, D. G. (1999). Profiles of effective college and university teachers. The Journal of Higher Education, 70, 670-686.

Zahn, D. K., & Schramm, R. M. (1992). Student perception of teacher effectiveness based on teacher employment and course skill level. Business Education Forum, 46, 16-18.