首页 | 本学科首页   官方微博 | 高级检索  
文章检索
  按 检索   检索词:      
出版年份:   被引次数:   他引次数: 提示:输入*表示无穷大
  收费全文   11篇
  免费   0篇
教育   8篇
科学研究   2篇
信息传播   1篇
  2024年   1篇
  2019年   1篇
  2013年   2篇
  2010年   1篇
  2008年   1篇
  2005年   2篇
  2003年   1篇
  2000年   1篇
  1996年   1篇
排序方式: 共有11条查询结果,搜索用时 31 毫秒
1.
Rater training is an important part of developing and conducting large‐scale constructed‐response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to pass a calibration test before every scoring shift. To support the high‐stakes decisions made on the basis of rater certification tests, a psychometric approach for their development, analysis, and use is proposed. The circumstances and uses of these tests suggest that they are expected to have relatively low reliability. This expectation is supported by empirical data. Implications for the development and use of these tests to ensure their quality are discussed.  相似文献   
2.
目前,以色列已成为世界上风险投资最发达的国家之一,被誉为“第二硅谷”。尽管,以色列资源缺乏,而且长期处于战乱之中。1亿美元的政府基金,吸引了40多亿美元的国际资本,以色列风险投资业的发展得益于政府在战略、政策方面的扶持。  相似文献   
3.
Books in review     
DYNAMICS OF IDEALISM: WHITE ACTIVISTS IN A BLACK MOVEMENT. By N.J. Demerath, III, Gerald Marwell, and Michael T. Aiken. San Francisco: Jossey‐Bass Inc., Publishers, 1971. $9.75.

TOMS, COONS, MULATTOES, MAMMIES, AND BUCKS: AN INTERPRETIVE HISTORY OF BLACKS IN AMERICAN FILMS. By Donald Bogle. New York: The Viking Press, 1973. $12.50.

RADICALS AND MILITANTS: AN ANNOTATED BIBLIOGRAPHY OF EMPIRICAL RESEARCH ON CAMPUS UNREST. By Kenneth Keniston in collaboration with Mary‐Kay Duffield and Sharon Marti nek. Lexington, Mass: D.C. Heath and Company, 1973. $15.00  相似文献   
4.
We show that using the point-biserial as a discrimination index for distractors by differentiating between examinees who chose the distractor and examinees who did not choose the distractor is theoretically wrong and may lead to an incorrect rejection of items. We propose an alternative usage and present empirical evidence for its suitability.  相似文献   
5.
Remote proctoring, or monitoring test takers through internet-based, video-recording software, has become critical for maintaining test security on high-stakes assessments. The main role of remote proctors is to make judgments about test takers' behaviors and decide whether these behaviors constitute rule violations. Variability in proctor decision making, or the degree to which humans/proctors make different decisions about the same test-taking behaviors, can be problematic for both test takers and test users (e.g., universities). In this paper, we measure variability in proctor decision making over time on a high-stakes English language proficiency test. Our results show that (1) proctors systematically differ in their decision making and (2) these differences are trait-like (i.e., ranging from lenient to strict), but (3) systematic variability in decisions can be reduced. Based on these findings, we recommend that test security providers conduct regular measurements of proctors’ judgments and take actions to reduce variability in proctor decision making.  相似文献   
6.
Some cognitive characteristics of graph comprehension items were studied, and a model comprised of several variables was developed. 132 graph items of the Psychometric Entrance Test were included in the study. By analyzing the actual difficulty of the items, an evaluation of the impact of the cognitive variables on item difficulties could be made. Results indicate that successful prediction of item difficulty can be calculated on the basis of a wide range of item characteristics and task demands. This suggests that items can be screened for processing difficulty prior to being administered to examinees. However, the results also have implications for test validity in that the various processing variables identified involve distinct ability dimensions.  相似文献   
7.
目前,以色列已成为世界上风险投资最发达的国家之一,被誉为“第二硅谷“.尽管,以色列资源缺乏,而且长期处于战乱之中.1亿美元的政府基金,吸引了4 0多亿美元的国际资本,以色列风险投资业的发展得益于政府在战略、政策方面的扶持.……  相似文献   
8.
In this article, the authors show that test makers and test takers have a strong and systematic tendency for hiding correct answers—or, respectively, for seeking them—in middle positions. In single, isolated questions, both prefer middle positions to extreme ones in a ratio of up to 3 or 4 to 1. Because test makers routinely, deliberately, and excessively balance the answer key of operational tests, middle bias almost, though not quite, disappears in those keys. Examinees taking real tests also produce answer sequences that are more balanced than their single question tendencies but less balanced than the correct key. In a typical four-choice test, about 55% of erroneous answers are in the two central positions. The authors show that this bias is large enough to have real psychometric consequences, as questions with middle correct answers are easier and less discriminating than questions with extreme correct answers, a fact of which some implications are explored.  相似文献   
9.
ABSTRACT

Automated essay scoring is a developing technology that can provide efficient scoring of large numbers of written responses. Its use in higher education admissions testing provides an opportunity to collect validity and fairness evidence to support current uses and inform its emergence in other areas such as K–12 large-scale assessment. In this study, human and automated scores on essays written by college students with and without learning disabilities and/or attention deficit hyperactivity disorder were compared, using a nationwide (U.S.) sample of prospective graduate students taking the revised Graduate Record Examination. The findings are that, on average, human raters and the automated scoring engine assigned similar essay scores for all groups, despite average differences among groups with respect to essay length and spelling errors.  相似文献   
10.
Generalizability theory and analysis of variance methods are employed, together with the concept of objective time pressure, to estimate response time distributions and the degree of time pressure in timed tests. By estimating response time variance components due to person, item, and their interaction, and fixed effects due to item types and examinee time pressure, one can predict the distribution (mean and variance) of total response time for a population of examinees and a particular time limit. Furthermore, these variance components and fixed effects can be used in a simulation approach to estimate the distributions of time pressure during the test to help test developers evaluate the appropriateness of specific time limits. I present theoretical considerations and empirical results from two tests.  相似文献   
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号