首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 812 毫秒
1.
《College Teaching》2013,61(2):44-50
Abstract

The article examines students' perceptions of student evaluations of teaching and compares them to faculty perceptions on several dimensions. Findings indicate students were much less likely to agree that SET encourage faculty to grade more leniently, that they have an influence on a faculty member's career, or that they lead to changes in courses and/or teaching styles. Faculty members believed that students rate easy, entertaining instructors more highly. Includes suggestions for change and future research directions.  相似文献   

2.
Student evaluation of teaching (SET) ratings are used to evaluate faculty's teaching effectiveness based on a widespread belief that students learn more from highly rated professors. The key evidence cited in support of this belief are meta-analyses of multisection studies showing small-to-moderate correlations between SET ratings and student achievement (e.g., Cohen, 1980, Cohen, 1981; Feldman, 1989). We re-analyzed previously published meta-analyses of the multisection studies and found that their findings were an artifact of small sample sized studies and publication bias. Whereas the small sample sized studies showed large and moderate correlation, the large sample sized studies showed no or only minimal correlation between SET ratings and learning. Our up-to-date meta-analysis of all multisection studies revealed no significant correlations between the SET ratings and learning. These findings suggest that institutions focused on student learning and career success may want to abandon SET ratings as a measure of faculty's teaching effectiveness.  相似文献   

3.
Abstract

Evaluation of college instructors often centers on course ratings; however, there is little evidence that these ratings only reflect teaching. The purpose of this study was to assess the relative importance of three facets of course ratings: instructor, course and occasion. We sampled 2,459 fully-crossed dyads from a large university where two instructors taught the same two courses at least twice in a 3-year period. Generalizability theory was used to estimate unconfounded variance components for instructor, course and occasion, as well as their interactions. Meta-analysis was used to summarize those estimates. Results indicated that a three-way interaction between instructor, course and occasion that includes measurement error accounted for the most variance in student ratings (24%), with instructor accounting for the second largest amount (22%). While instructor - and presumably teaching - accounted for substantial variance in student course ratings, factors other than instructor quality had a larger influence on student ratings.  相似文献   

4.
To investigate the effect of a number of situational factors on student evaluations of lecturers, a teacher rating form (TRF) was administered to 326 first-year students taking an Introduction to Economics course. A comparison was made of the responses of the following subgroups among the students: full-time versus part-time students; students asked to sign their responses versus those whose responses were left anonymous; students told that their responses were just for feedback to the lecturer versus those told that their responses would be used for promotion, tenure, and salary decisions about the lecturer; students told that the TRFs were sponsored by the student association versus those told that the TRFs were sponsored by the staff association. The factors of sponsor identity (staff or student association) and use of ratings (feedback or decisional) had significant effects on TRF scores; but the effects of the factors of anonymity and full- versus part-time were insignificant. These results are taken as suggestive evidence that conditions of administration can modify student responses to TRFs and that they should be as uniform as possible if TRFs are to be used to compare lecturers.  相似文献   

5.
Abstract

The authors compared the average grades given in 165 behavioral and social science courses with the average ratings given by students to the instructors who taught the courses. Significant positive correlations were found between the average ratings for instructional quality and the average grades received by students. The courses in which the average grades were the highest were also those in which students gave teachers the highest ratings. Among possible reasons for the correlations are that better teachers attracted better students or that quality teachers provided more effective instruction, resulting in more student learning and, thus, higher average grades. Another explanation is that most college students tend to bias their ratings of instructional quality in favor of teachers who grade leniently (I. Neath, 1996). If correct, the latter reasoning begins to explain why the widespread use of student evaluations in the United States in recent decades has been accompanied by increases in the average grades that university students received. To prevent grade inflation, and particularly to avoid rewarding and promoting instructors who use increasingly lax grading standards, administrators should adjust student ratings of instructional quality for the average grades given for a course. In general, only courses near the extremely high and low ends in terms of students' average grades were significantly affected by the statistical adjustment.  相似文献   

6.
Instructors whose teaching was evaluated by students were given the opportunity to rate how applicable the evaluation items were to their classes. This study examined the kinds of items which instructors felt to be applicable or inapplicable, the relationships between the student ratings and the instructor applicability ratings, and the effect on an overall evaluation score of using the instructor applicability judgments as weights.Results generally support the consensus procedure of establishing rating forms; they suggest that the common criticism that faculty judgments of item applicability are influenced by anticipation of student ratings may be true for specific items and that while weighting composite evaluation scores by means of faculty applicability judgments does not affect those overall scores, the distributions of certain items may be altered.  相似文献   

7.
Faculty Views of Student Evaluation of College Teaching   总被引:1,自引:1,他引:1  
The literature abounds with psychometric studies of course evaluation measures and articles debating the merits of student ratings of instruction, but little research has focused on faculty perceptions of this procedure. In the present study faculty perceptions are explored at a teachers' college where evaluation is carried out annually on a sample of courses. The sample includes 101 instructors who completed the research questionnaire. Faculty attitudes reflected a broad range of responses towards validity of student ratings, and their usefulness for improving instruction. Although overall attitudes were mildly positive, few instructors reported changing instruction as a result of student ratings. Moreover, few supported sending evaluation results directly to college administrators or publishing them for student consumption.  相似文献   

8.
The purpose of this study was to examine how grading leniency and grade discrepancy (the difference between expected grades and deserved grades) were associated with various dimensions of student ratings of instruction. A sample of 754 undergraduate college students completed a student ratings of instruction instrument and provided responses to a number of other questions on topics such as course difficulty and workload. A series of multilevel regression analyses were conducted and results showed that an instructor's grading leniency, as perceived by students, was positively associated with student ratings on 11 of 12 dimensions of instruction examined. This finding suggests that more lenient instructors tend to receive higher student ratings. The second finding shows that grade discrepancy was negatively associated with most dimensions of instruction. This supports the self-serving bias hypothesis under attribution theory (Gigliotti & Buchtel, 1990) in that students tended to punish instructors with lower ratings when expected grades were lower than students believed they deserved, yet little evidence of a pattern of rewards existed in student ratings when students expected grades higher than they deserved.  相似文献   

9.
Abstract

To assess the extent to which student evaluations of lecturers are accurate indicators of quality of instruction (teacher effectiveness), 128 students taking a multisection course in University Mathematics evaluated the performance of their particular lecturers. Initial tests indicated no difference among the students taking the different sections (same content) in either aptitude or expected performance. Hence, the fact that significantly positive correlations were obtained between average student performance and average student evaluations of their lecturers across the various sections suggests that student evaluations of this kind can be used as a measure of instructional effectiveness under appropriate conditions.  相似文献   

10.
A manipulation of the instructions students received prior to completing the 7-item Endeavor Instructional Rating card differentially affected their ratings on two types of items. Specifically, when students were led to believe their ratings would have a strong impact on the instructor's career, they tended to be more lenient on items measuring rapport (i.e., the affective domain); this same effect was not observed for items measuring pedagogical skill (i.e., the cognitive domain). The different items on our instructional rating instrument appear to be measuring different things. One implication of this observation is that the inconsistent findings reported in past research on student ratings of instruction may be due to the differential mix of items from one instrument to another. When instructors are compared on ratings given them by students, unbiased interpretation requires that the multidimensional nature of teaching (and of the rating instrument) be considered.  相似文献   

11.
At the University of Queensland a questionnaire to students for their evaluation of tutors’ and lecturers’ classroom presentation and classroom management has been in (increasing) use since 1982. The introduction of the evaluation questionnaire and the management of the evaluation system are based on the research lieterature on change. Student evaluations per se do not induce change. However, self evaluations focus staffs attention on their own perception as teachers, and possible discrepancies between self and student evaluation may then motivate staff to change.

Therefore self evaluation has been used in the evaluation of teaching schemesto facilitate change. Overall, there was no relationship between student ratings and staff self ratings on the question, “All things considered, how would you rate this staff member's overall effectiveness as a university teacher?” Both highly and poorly rated lecturers showed large discrepancies between their self perception and student perception. This emphasises the importance of using more than one source of evaluative information for decision makeing. An interview study found that nearly all those evaluated had implemented changes and felt positive about evaluation.  相似文献   


12.
A number of recent studies have used generalizability theory to examine the dependability of student ratings of instruction. This study extends this line of research by examining the consistency of ratings between different sections of a course taught in a given semester by the same instructor, and by comparing the performance of global- and attribute-type instructor rating items. Five samples of physics instructors, varying in size from 5 to 12 instructors, were rated by their students on a form containing two global and eight attribute items. Each instructor taught two sections of a course. The study found that the section effect was small (ratings of instructors were consistent across different sections of the same course), and that the generalizability of ratings was substantially influenced by item specificity. For summary purposes, one global item seemed sufficient.  相似文献   

13.
Based on previous work by H. G. Murray, this study examined relationships between overt classroom teaching behaviors of college instructors and student ratings of teaching effectiveness. One hundred seven students who enrolled in six sections of an introductory psychology course rated instructors on 12 measures of teaching effectiveness and reported the frequencies of 20 observable, inclass teaching behaviors. Relationships between teaching behaviors and student ratings were examined by simple, multiple, and canonical correlation procedures. A complex pattern of relationships emerged. The canonical analysis revealed a two-dimensional structure consisting of instructor clarity and student-teacher rapport dimensions. Teaching behaviors loading on these dimensions were examined.  相似文献   

14.
This study, using student ratings of lecturers, examines the perceived effect of the lecturer’s ability to communicate effectively. The relationship between the standard question—’The lecturer was able to communicate ideas and information clearly’—and the global rating question—’Overall, the lecturer is an effective teacher’—was investigated in 7072 undergraduate standard teaching surveys from one university, using the lecturer’s language background as a factor. The results show that overall student ratings of English as a second language (ESL) lecturers are, on average, 0.4 points lower on a five‐point scale than student ratings of native English speaking lecturers. There is a strong interaction between this average difference and the lecturer’s faculty, with little difference in arts (humanities and social sciences) through to 0.6 points difference in science. The study also found that, of the four categorical questions used in the university’s standard teaching survey, the ‘communication’ question had the highest correlation with the ‘overall’ question. The correlation (R?=?0.96) suggests that the standard teaching survey is overly influenced by the students’ perception of this one aspect of teaching—reflecting a transmission model. The rating difference between ESL and native English speaking lecturers is briefly explored. In addition, the paper briefly considers the implications of the above findings for teacher development and for student expectations against a background of a growing ESL student population.  相似文献   

15.
This paper examines the effects of two background variables in students' ratings of teaching effectiveness (SETs): class size and students' motivation (as surrogated by students' likelihood to respond randomly). Resampling simulation methodology has been employed to test the sensitivity of the SET scale for three hypothetical instructors (excellent, average, and poor). In an ideal scenario without confounding factors, SET statistics unmistakably distinguish the instructors. However, at different class sizes and levels of random responses, SET class averages are significantly biased. Results suggest that evaluations based on SET statistics should look at more than class averages. Resampling methodology (bootstrap simulation) is useful for SET research for scale sensitivity study, research results validation, and actual SET score analyses. Examples will be given on how bootstrap simulation can be applied to real-life SET data comparison.  相似文献   

16.
Graduate teaching assistants (GTAs) constitute a valuable and economical teaching force in many higher education undergraduate programmes. However, student satisfaction with their teaching has attracted little attention in the research literature. This study aimed at examining students’ evaluation of teaching of GTAs in discussion groups, as well as exploring the effects of group and GTA variables on these ratings. Data were collected using a questionnaire administered online and completed by 7078 undergraduate students. Participants were enrolled in classes taught by 278 GTAs from four faculties in a major Israeli university. Results indicated that ratings assigned to clarity of instruction were the most salient predictor of students’ overall evaluation. Generally, findings were consistent with those reported in the literature for other categories of instructors. Groups taught by GTAs in exact sciences and engineering were rated higher than those in social sciences and business management. Group size and the percentage of men students were inversely correlated with student ratings, while student attendance rate was positively correlated. Women GTAs and GTAs who taught more than one group tended to receive higher ratings. Overall student attendance rate was the most prominent predictor of student ratings. The implications of the findings are discussed.  相似文献   

17.
In this paper, we discuss the anomaly between the increasing interest in feedback in current education research, the continued role of time-limited, unseen examinations as a form of assessment and the dearth of literature on feedback related to such exams. We argue that while exams have long been regarded as different from other forms of assessment, it is not justifiable to exempt them from the good practice that can, and does, inform these other types of assessment. We suggest a solution to providing timely, effective feedback for end of course examinations is to move the feedback emphasis to ‘feedforward’ by implementing exemplars (examples of real students’ work, generally of different qualities). This study adopts a mixed-methods approach to investigate whether there was a relationship between student use of exemplars before the exam and the final exam grade achieved (n?=?520), and to explore students’ and lecturers’ perspectives about the effectiveness of and engagement with exemplars. Quantitative findings suggested that those students who accessed exemplars did score better in their exams than those that did not. Qualitative data revealed that exemplars were received positively by students and lecturers, and we use this to provide practical suggestions on exemplar good practice.  相似文献   

18.
ABSTRACT

Educators in higher education commonly use peer and self evaluations to help assess student performance on group projects. Although these evaluations provide multiple benefits, many educators are wary of using them due to concerns about their quality. This study addresses three questions debated in the literature regarding the quality of these assessments. How much do students differentiate among peer contributions through their ratings? How reliable are peer ratings? How much agreement exists between peer and self ratings? Although these questions have been addressed to varying degrees in past work, their answers have been far from settled. While many studies focus on just one of the questions, this study’s data make it possible to address all three questions for the same group of students as well as examine each question by student performance level. The evaluations assessed in this study were completed by a large number of students under conditions associated with obtaining more valid and reliable ratings. Overall, the results provide support for using peer and self evaluations to help assess student contributions to group projects. Peer ratings were largely reliable as group members generally agreed on the scores given to their peers. In addition, most students differentiated among group member contributions through their ratings. Students also tended to rate themselves higher than their peers rated them. This study has implications for how peer and self evaluations can be most effectively used by educators to measure student performance in group work.  相似文献   

19.
20.
The literature contains indications of a bias in student evaluations of teaching (SET) against online instruction compared to face-to-face instruction. The present case study consists of content analysis of anonymous student responses to open-ended SET questions submitted by 534 students enrolled in 82 class sections taught by 41 instructors, one online and one face-to-face class section for each instructor. There was no significant difference in the proportion of appraisal text segments by delivery method, suggesting no delivery method bias existed. However, there were significant differences in the proportion of text segments for topical themes and topical categories by delivery method. Implications of the findings for research and practice are presented.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号