期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Rater Certification Tests: A Psychometric Approach

Yigal Attali 《Educational Measurement》2019,38(2):6-13

Rater training is an important part of developing and conducting large‐scale constructed‐response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to pass a calibration test before every scoring shift. To support the high‐stakes decisions made on the basis of rater certification tests, a psychometric approach for their development, analysis, and use is proposed. The circumstances and uses of these tests suggest that they are expected to have relatively low reliability. This expectation is supported by empirical data. Implications for the development and use of these tests to ensure their quality are discussed. 相似文献

2.

1亿政府基金吸引40亿资本以色列四两拨千斤发展风投经验

ErlichYigal 《深圳特区科技》2005,(4):82-84

目前，以色列已成为世界上风险投资最发达的国家之一，被誉为“第二硅谷”。尽管，以色列资源缺乏，而且长期处于战乱之中。1亿美元的政府基金，吸引了40多亿美元的国际资本，以色列风险投资业的发展得益于政府在战略、政策方面的扶持。相似文献

3.

Books in review

Jo‐Anne Jamison Howard S. Erlich Jerry Douglas 《Journal of Applied Communication Research》2013,41(1):45-47

DYNAMICS OF IDEALISM: WHITE ACTIVISTS IN A BLACK MOVEMENT. By N.J. Demerath, III, Gerald Marwell, and Michael T. Aiken. San Francisco: Jossey‐Bass Inc., Publishers, 1971. $9.75.

TOMS, COONS, MULATTOES, MAMMIES, AND BUCKS: AN INTERPRETIVE HISTORY OF BLACKS IN AMERICAN FILMS. By Donald Bogle. New York: The Viking Press, 1973. $12.50.

RADICALS AND MILITANTS: AN ANNOTATED BIBLIOGRAPHY OF EMPIRICAL RESEARCH ON CAMPUS UNREST. By Kenneth Keniston in collaboration with Mary‐Kay Duffield and Sharon Marti nek. Lexington, Mass: D.C. Heath and Company, 1973. $15.00 相似文献

4.

The Point-Biserial as a Discrimination Index for Distractors in Multiple-Choice Items: Deficiencies in Usage and an Alternative

Yigal Attali Tamar Fraenkel 《Journal of Educational Measurement》2000,37(1):77-86

We show that using the point-biserial as a discrimination index for distractors by differentiating between examinees who chose the distractor and examinees who did not choose the distractor is theoretically wrong and may lead to an incorrect rejection of items. We propose an alternative usage and present empirical evidence for its suitability. 相似文献

5.

Measuring Variability in Proctor Decision Making on High-Stakes Assessments: Improving Test Security in the Digital Age

William Belzak J. R. Lockwood Yigal Attali 《Educational Measurement》2024,43(1):52-65

Remote proctoring, or monitoring test takers through internet-based, video-recording software, has become critical for maintaining test security on high-stakes assessments. The main role of remote proctors is to make judgments about test takers' behaviors and decide whether these behaviors constitute rule violations. Variability in proctor decision making, or the degree to which humans/proctors make different decisions about the same test-taking behaviors, can be problematic for both test takers and test users (e.g., universities). In this paper, we measure variability in proctor decision making over time on a high-stakes English language proficiency test. Our results show that (1) proctors systematically differ in their decision making and (2) these differences are trait-like (i.e., ranging from lenient to strict), but (3) systematic variability in decisions can be reduced. Based on these findings, we recommend that test security providers conduct regular measurements of proctors’ judgments and take actions to reduce variability in proctor decision making. 相似文献

6.

The Effects of Component Variables on Performance in Graph Comprehension Tests

Yigal Attalim Chanan Goldschmidt 《Journal of Educational Measurement》1996,33(1):93-105

Some cognitive characteristics of graph comprehension items were studied, and a model comprised of several variables was developed. 132 graph items of the Psychometric Entrance Test were included in the study. By analyzing the actual difficulty of the items, an evaluation of the impact of the cognitive variables on item difficulties could be made. Results indicate that successful prediction of item difficulty can be calculated on the basis of a wide range of item characteristics and task demands. This suggests that items can be screened for processing difficulty prior to being administered to examinees. However, the results also have implications for test validity in that the various processing variables identified involve distinct ability dimensions. 相似文献

7.

1亿政府基金吸引40亿资本以色列四两拨千斤发展风投经验

Erlich Yigal 《深圳特区科技》2005,(5)

目前,以色列已成为世界上风险投资最发达的国家之一,被誉为“第二硅谷“.尽管,以色列资源缺乏,而且长期处于战乱之中.1亿美元的政府基金,吸引了4 0多亿美元的国际资本,以色列风险投资业的发展得益于政府在战略、政策方面的扶持.…… 相似文献

8.

Guess Where: The Position of Correct Answers in Multiple-Choice Test Items as a Psychometric Variable

Yigal Attali Maya Bar-Hillel 《Journal of Educational Measurement》2003,40(2):109-128

In this article, the authors show that test makers and test takers have a strong and systematic tendency for hiding correct answers—or, respectively, for seeking them—in middle positions. In single, isolated questions, both prefer middle positions to extreme ones in a ratio of up to 3 or 4 to 1. Because test makers routinely, deliberately, and excessively balance the answer key of operational tests, middle bias almost, though not quite, disappears in those keys. Examinees taking real tests also produce answer sequences that are more balanced than their single question tendencies but less balanced than the correct key. In a typical four-choice test, about 55% of erroneous answers are in the two central positions. The authors show that this bias is large enough to have real psychometric consequences, as questions with middle correct answers are easier and less discriminating than questions with extreme correct answers, a fact of which some implications are explored. 相似文献

9.

Comparing Human and Automated Essay Scoring for Prospective Graduate Students With Learning Disabilities and/or ADHD

Heather Buzick Maria Elena Oliveri Yigal Attali Michael Flor 《教育实用测度》2013,26(3):161-172

ABSTRACT

Automated essay scoring is a developing technology that can provide efficient scoring of large numbers of written responses. Its use in higher education admissions testing provides an opportunity to collect validity and fairness evidence to support current uses and inform its emergence in other areas such as K–12 large-scale assessment. In this study, human and automated scores on essays written by college students with and without learning disabilities and/or attention deficit hyperactivity disorder were compared, using a nationwide (U.S.) sample of prospective graduate students taking the revised Graduate Record Examination. The findings are that, on average, human raters and the automated scoring engine assigned similar essay scores for all groups, despite average differences among groups with respect to essay length and spelling errors. 相似文献

10.

An Analysis of Variance Approach for the Estimation of Response Time Distributions in Tests

Yigal Attali 《Journal of Educational Measurement》2010,47(4):458-470

Generalizability theory and analysis of variance methods are employed, together with the concept of objective time pressure, to estimate response time distributions and the degree of time pressure in timed tests. By estimating response time variance components due to person, item, and their interaction, and fixed effects due to item types and examinee time pressure, one can predict the distribution (mean and variance) of total response time for a population of examinees and a particular time limit. Furthermore, these variance components and fixed effects can be used in a simulation approach to estimate the distributions of time pressure during the test to help test developers evaluate the appropriateness of specific time limits. I present theoretical considerations and empirical results from two tests. 相似文献