共查询到20条相似文献,搜索用时 46 毫秒
1.
网上阅卷是近年来兴起的利用现代技术控制主观题评分误差的方法,其在作文评分中误差控制的效果十分明显。网上阅卷主要通过评卷员之间一致性误差控制、评卷员本人一致性误差控制、两评的误差控制、评分点之间的误差控制、抽查监控等5种方法来实现评分误差控制,同时通过机控系统实施评分误差控制管理。随着研究的深入与技术的发展,将有可能实现基于互联网的高考作文评卷和计算机自动评卷,以进一步实现评卷误差控制。 相似文献
2.
章晓英 《中学数学教学参考》2010,(6):46-49
1阅卷概况笔者作为中学教师参加了2009年浙江省高考数学网上阅卷的评卷工作.高考评卷分为试评和正式评卷两个步骤.首先是试评,先由每道题的组长(浙江大学数学系教授)讲解该题可能出现的解法及每一种解法的评分细则,然后由每道题的阅卷者进行网上试评.每一份试卷的每一道题最少要经过两个人背对背批改.在此过程中阅卷者需达到两个统一: 相似文献
3.
4.
5.
6.
《山西教育(综合版)》2009,(2):23-23
上海市部分区县的期末考试出现新动向,纷纷试行“网上阅卷”,“拷贝”中高考的评卷模式。例如,语文作文采取“一卷两阅”,若两位阅卷老师给作文的评分相差较大,那么还需要进行“三判”。如果两位老师给的分数差是在4分以内,那系统就将自动算出平均分作为该学生的作文成绩。如果评分差超过4分,那就成为‘问题卷’,交由第三位老师仲裁。总负责人还可以随时对批改情况进行“进度监控和质量监控”。 相似文献
7.
广西高考从2005年起进行无纸化阅卷,将考生的答题卡进行扫描,客观题由计算机阅卷判定分数,主观题采用网上人工阅卷判定分数,无论采用人工阅卷还是计算机阅卷方式,考生主观题的得分都是评卷教师判定的,评卷教师评卷时基本上都采用“踩点评分”的评分方式,即考生主观题的解答过程中,采用了哪些合理的知识点,按照正确的逻辑推理,进行必要的运算,得到正确的结论,评卷教师对考生答卷上的每一个正确的过程进行评判给分.考生在答卷时若能按照解题过程规范化答题,才有利于阅卷教师公正判分,减少不必要的失分. 相似文献
8.
9.
随着6月28日上海高考发榜,2008年的高考阅卷工作也顺利落下了帷幕。据介绍,上海高考阅卷目前有三种模式并存:机器阅卷,由机器对涂点的答题卡进行识别;网上阅卷,即答卷被扫描到网上后进行阅卷,电脑实时监控,今年采用这一方式的科目增加到语文、政治、历史三门;传统的人工阅卷。 相似文献
10.
论河北省高考英语网上阅卷的优势 总被引:1,自引:0,他引:1
2005年6月河北省高考英语首次实行网上阅卷。网上阅卷有着传统阅卷方式无法比拟的优势。利用现代高新技术,网上阅卷能够严格控制主观题评分误差,保证阅卷质量和效率,最大程度地体现高考的公平、公正原则。 相似文献
11.
12.
What are the validity issues involved in automated scoring of tests? What is the nature of the interplay among construct definition, task design, examinee interface, tutorial, test development tools, and automated scoring and reporting? 相似文献
13.
Kadriye Ercikan 《Educational Measurement》2002,21(2):8-14
Multiple scoring is widely used in large-scale assessments. The use of a single response for making multiple inferences as is done in multiple scoring has implications on the validity of these inferences and interpretations based on assessment results. The purpose of this article is to review two types of multiple scoring practices and discuss how multiple scoring affects inferences. 相似文献
14.
张红霞 《安徽广播电视大学学报》2003,(3):47-51
TEM4听写采用的是较传统的数错扣分法。数错扣分法是负分法,其中存在一些问题。因此我们提出一种实验性的评分方法——部分得分制。实验数据有两组,分别采用TEM4听写评分制和新评分制。数据比较以及部分得分模型(Rasch模型之一)对实验量表效能的分析(如模型与数据拟合值、被试拟合值、信息函数等)说明,实验评分制能较好地测量大多数学生的听写水平。 相似文献
15.
《教育实用测度》2013,26(4):345-358
Performance assessments typically are scored by having experts rate individual performances. In contexts such as medical licensure, where the examinee population is large and the pool of expert raters is limited practically, this approach may be unworkable. This article describes an automated scoring algorithm for a computer simulation-based examination of physicians' patient-management skills. The algorithm is based on the policy used by clinicians in rating case performances. The results show that scores produced using this algorithm are highly correlated to actual clinician ratings. These scores also are shown to be effective in discriminating between case performance judged to be passing or failing by an independent group of clinicians. 相似文献
16.
17.
18.
19.
选取MHK口语试题实测数据,运用多面Rasch模型的理论和方法对人工评分与计算机自动评分的严厉程度、评分员内部以及各评分员之间的一致性程度进行研究,分析了两种评分方式在严厉度、一致性等方面存在的具体差异,以及不同试题之间是否存在难度差异,希望据此为提高MHK阅卷的科学化水平及命题质量提供依据和建议。 相似文献
20.
This study pioneers a Rasch scoring approach and compares it to a conventional summative approach for measuring longitudinal gains in student learning. In this methodological note, our proposed methodology is demonstrated using an example of rating scales in a student survey as part of a higher education outcome assessment. Such assessments have become increasingly important worldwide for purposes of institutional accreditation and accountability to stakeholders. Data were collected from a longitudinal study by tracking self-reported learning outcomes of individual students in the same cohort who completed the student learning experience questionnaire (SLEQ) in their first and final years. Rasch model was employed for item calibration and latent trait estimation, together with a scaling procedure of concurrent calibration incorporating a randomly equivalent group design and a single group design to measure the gains in self-reported learning outcomes as yielded by repeated measures. The extent to which Rasch scoring compared to the conventional summative scoring method in its sensitivity to change was quantified by a statistical index namely relative performance (RP). Findings indicated greater ability to capture learning outcomes gains from Rasch scoring over the conventional summative scoring method, with RP values ranging from 3 to 17% in the cognitive, social, and value domains of the SLEQ. The Rasch scoring approach and the scaling procedure presented in the study can be readily generalised to studies using rating scales to measure change in student learning in the higher education context. The methodological innovations and contributions of this study are discussed. 相似文献