期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Assessing statistical aspects of test fairness with structural equation modelling

Rex B. Kline 《Educational Research and Evaluation》2013,19(2-3):204-222

Test fairness and test bias are not synonymous concepts. Test bias refers to statistical evidence that the psychometrics or interpretation of test scores depend on group membership, such as gender or race, when such differences are not expected. A test that is grossly biased may be judged to be unfair, but test fairness concerns the broader, more subjective evaluation of assessment outcomes from perspectives of social justice. Thus, the determination of test fairness is not solely a matter of statistics, but statistical evidence is important when evaluating test fairness. This work introduces the use of the structural equation modelling technique of multiple-group confirmatory factor analysis (MGCFA) to evaluate hypotheses of measurement invariance, or whether a set of observed variables measures the same factors with the same precision over different populations. An example of testing for measurement invariance with MGCFA in an actual, downloadable data set is also demonstrated. 相似文献

2.

Elementary Teachers' Knowledge and Self-Efficacy for Measurement Concepts

Chad M. Gotch Brian F. French 《The Teacher Educator》2013,48(1):46-57

Educational standardized testing impacts millions of children and educational professionals each year. In the current accountability climate, an effective educational system depends on professionals who are literate in assessment and can take the appropriate actions in response to test results. Measurement researchers should begin to focus more attention on how teachers use assessment results, what skills teachers possess, and what teachers believe they can do in working with test results. This study examined elementary teacher knowledge and self-efficacy in measurement concepts through a random sample of teachers in the state of Washington. Teachers had greater success with skills related to basic measurement concepts compared to using test scores for informed decisions. No relationship was found between years of teaching and measurement knowledge or self-efficacy. However, teachers showing interest in resources for communicating test results to parents had lower self-efficacy compared to teachers not interested in resources. 相似文献

3.

Teaching for the Test: Validity, Fairness, and Moral Action 总被引：1，自引：0，他引：1

Linda Crocker 《Educational Measurement》2003,22(3):5-11

In response to heightened levels of assessment activity at the K-12 level to meet requirements of the No Child Left Behind Act of 2001, measurement professionals are called to focus greater attention on four fundamental areas of measurement research and practice: (a) improving the research infrastructure for validation methods involving judgments of test content; (b) expanding the psychometric definition of fairness in achievement testing; (c) developing guidelines for validation studies of test use consequences; and (d) preparing teachers for new roles in instruction and assessment practice. Illustrative strategies for accomplishing these goals are outlined. 相似文献

4.

Does gender-specific differential item functioning affect the structure in vocational interest inventories?

Andrea Beinicke Katja Pässler Benedikt Hell 《International Journal for Educational and Vocational Guidance》2014,14(2):181-198

The study investigates consequences of eliminating items showing gender-specific differential item functioning (DIF) on the psychometric structure of a standard RIASEC interest inventory. Holland’s hexagonal model was tested for structural invariance using a confirmatory methodological approach (confirmatory factor analysis and randomization tests of hypothesized order relations). Results suggest that eliminating items showing gender-specific DIF had no considerable influence on the instrument’s psychometric structure. Considering DIF as one possibility to improve test fairness when developing interest inventories is discussed. 相似文献

5.

School principals’ leadership skills: measurement equivalence across cultures

Rima’a Da’as 《Compare》2017,47(2):207-222

Despite substantial interest and research in measuring leader’s skills, little is known about the measurement equivalence and mean differences in the scores measuring principals’ skills (cognitive, interpersonal, strategic) across cultures (collectivism versus individualism). The aim of the present study was to assess measurement equivalence – configural, metric and scalar – on leaders’ skills across Arab and Jewish teachers in the Israeli educational system. A total of 1388 teachers from 210 elementary schools responded to a skills questionnaire. Results indicated that the configural model is equivalent across samples. The test for metric equivalence, showed that the construct holds the same psychological meaning across the two samples, with the exception of two items. The intercept latent test means (i.e., scalar) showed unequal intercepts among the Arab and Jewish samples, in the strategic and cognitive skills scale. The results have implications for cross-ethnic research and, more broadly, for the assessment of principals’ skills. 相似文献

6.

Inconsistencies in DIF Detection for Sub-Groups in Heterogeneous Language Groups

Kadriye Ercikan Wolff-Michael Roth Marielle Simon Debra Sandilands Juliette Lyons-Thomas 《教育实用测度》2014,27(4):273-285

Diversity and heterogeneity among language groups have been well documented. Yet most fairness research that focuses on measurement comparability considers linguistic minority students such as English language learners (ELLs) or Francophone students living in minority contexts in Canada as a single group. Our focus in this research is to examine the degree to which measurement comparability, as indicated by differential item functioning (DIF), is consistent for sub-groups among linguistic minority Francophone students in Canada. The findings suggest that the linguistic minority Francophone students who speak French at home and those who do not speak French at home should not be grouped together for investigating measurement comparability or for examining performance gaps. We identified a great degree of differences in DIF identification with a consistency of 7–10% in DIF identification in the separate analyses for the two groups. The findings highlight methodological problems with investigating fairness for diverse linguistic groups that are treated as a single group. 相似文献

7.

Some Methodological Issues with “Draw a Scientist Tests” among Young Children

Susan C. Losh Ryan Wilke Margareta Pop 《International Journal of Science Education》2013,35(6):773-792

Children’s stereotypes about scientists have been postulated to affect student science identity and interest in science. Findings from prior studies using “Draw a Scientist Test” methods suggest that students see scientists as largely white, often unattractive, men; one consequence may be that girls and minority students feel a science career is “not like me”. However, a major shortcoming in prior research is that scholars have asked children to draw only scientists, thus making interpretations of earlier research findings ambiguous. We added other professionals to compare how 616 drawings of teachers, scientists, and veterinarians by 206 elementary school children varied by student gender, ethnicity, and grade. Students made clear distinctions: drawing teachers as most attractive and largely female, and scientists as most often male and least attractive. Aspects of the drawings suggest that scientists do have an “image problem” among children. However, large sex differences in the drawings and often‐unrecognizable gender figures in boys’ pictures lead us to question use of the “Draw a Scientist Test” as a projective test among young children. 相似文献

8.

Effort Analysis: Individual Score Validation of Achievement Test Data

Steven L. Wise 《教育实用测度》2015,28(3):237-252

Whenever the purpose of measurement is to inform an inference about a student’s achievement level, it is important that we be able to trust that the student’s test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student’s test event is not unduly influenced by construct-irrelevant factors that could distort his score. This article examines one such factor—test-taking motivation—that tends to induce a person-specific, systematic negative bias on test scores. Because current measurement models underlying achievement testing assume students respond effortfully to test items, it is important to identify test scores that have been materially distorted by non-effortful test taking. A method for conducting effort-related individual score validation is presented, and it is recommended that measurement professionals have a responsibility to identify invalid scores to individuals who make inferences about student achievement on the basis of those scores. 相似文献

9.

新中国成立以来我国高考政策公平性的价值变迁

王新凤《教育学报》2021,(1)

新中国成立以来,我国高考政策的公平性呈现出明显的阶段性特征。新中国成立之初,高校招生优先保障工农子女受教育权利,强调对考生的政治质量的考查;1977年恢复高考之后,高校招生强调效率优先、择优录取,高考政策的公平性更多体现为“分数面前人人平等”的程序公平;20世纪90年代末以来,公平公正成为我国高校招生的首要价值导向,更加注重协调发展基础上多渠道促进实质公平,重视对弱势群体的补偿。高考政策公平性的价值变迁与不同历史时期社会结构变迁、党和国家的政治策略和政治观念的变化密切相关。新时期的高考改革应继续贯彻以人民为中心的教育公平观,促进学生的全面发展,以共享发展为导向促进学生入学机会的城乡、区域公平,构建多元利益群体参与的教育治理格局,保障程序公平。相似文献

10.

A Historical Perspective and Current Views on the Standards for Educational and Psychological Testing 总被引：1，自引：0，他引：1

Wayne J. Camara Suzanne Lane 《Educational Measurement》2006,25(3):35-41

The Standards for Educational and Psychological Testing have evolved in the breadth and depth of coverage of issues in educational testing and measurement since their first publication in 1954. There were a number of substantive changes in the 1999 revision that addressed validity, fairness, accommodations, and compliance with the Standards. In addition, there was nearly a 50% increase in the number of standards contained in the last revision. The next revision of the Standards may be initiated in 2007 and there are remaining concerns about access and awareness by non-measurement professionals, compliance by test publishers and users, relevance in addressing mandates for accountability, and substantive areas of educational assessment. This review of major changes to the Standards and discussion of future topics is designed to inform the next revision. 相似文献

11.

The quest for fairness in language testing

Hossein Karami 《Educational Research and Evaluation》2013,19(2-3):158-169

The search for fairness in language testing is distinct from other areas of educational measurement as the object of measurement, that is, language, is part of the identity of the test takers. So, a host of issues enter the scene when one starts to reflect on how to assess people's language abilities. As the quest for fairness in language testing is still in its infancy, even the need for such a research has been controversial, with some (e.g., Davies, 2010) arguing that such research is entirely in vain. This paper will provide an overview of some of the issues involved. Special attention will be given to critical language testing (CLT) as it has had a large impact on language testing research. It will be argued that although CLT has been very effective in revealing the ideological and value implications of the constructs of focus in language testing, extremism in this direction is not justified. 相似文献

12.

国际教育考试质量与公平标准的特点与趋势——基于NVivo 11.0的编码分析

王后雄杨季冬《现代教育管理》2020,(3):45-51

国家教育考试是选拔人才的主要手段。欧美等国十分重视对教育考试质量与公平的研究,形成了许多有影响力的教育考试标准,这是教育考试公平和质量的重要保障。研究选取了美国心理协会、教育研究会、全美教育测量协会联合制定的标准,美国教育考试服务中心制定的标准,欧洲测试协会制定的两个标准,欧洲国际语言测试协会制定的两个标准,共计6个国际教育考试标准。通过编码研究,发现它们共同呈现出的国际特点为:一是突出基于证据的考试测量决策与操作;二是强调对考试分数的有效解释;三是公平与质量是共识性最高的价值取向;四是考试标准不具有明显的本土化特征。除了特点以外,研究还总结了国际教育考试标准呈现出的趋势:一是考试质量与公平标准多由测试协会主导制定;二是兼顾测试公平性原则和测试行为准则;三是所有考试利益相关者承担维护的协作责任。教育考试在我国有着举足轻重的作用,这些特点与趋势为制定我国考试质量与公平标准提供了理论基础和技术支持,从而保障考试的公平与质量。相似文献

13.

当前高校自主招生笔试环节考题品质探讨

林丽燕《教育与考试》2013,(4):24-29

高校自主招生笔试环节,试图以命题个性化、标准弹性化取代命题统一化、答案标准化的固有考试模式,对人才选拔机制的完善起着至关重要的作用,由此笔试环节考题品质的保障也被提上议程。面对社会公众对考题内容、功能、测量维度和公平性的质疑之声,考题设置要以科学性和可靠性为考题的指导思想,内容应追求纵深化和拓展化,体现多重的测量维度和以公平性原则为基本条件,才能找寻自主招生考题品质的提升之路。相似文献

14.

测验开发中的公平性审核探微——以美国ETS为例

陈吉《现代教育论丛》2011,(3)

测验公平有很多不同的定义,基于效度对其进行界定对于测验开发者而言是最有用的。要开发一个公平的测验,对试题进行公平性审核是不可缺少的重要一环。为使审核过程不那么主观,应遵循一定的审核原则。此外,为更好地解决公平性审核过程中出现的问题,还应建立规范的审核程序。ETS在对测验进行公平性审核的方面积累了丰富经验,其中有不少值得借鉴之处。相似文献

15.

Fairness in Measurement and Selection: Statistical,Philosophical, and Public Perspectives

Rebecca Zwick 《Educational Measurement》2019,38(4):34-41

Selection decisions have a major impact on our education, occupation, and quality of life, and the role of standardized tests in selection has always been a source of controversy. Here, I consider various definitions of fairness in measurement and selection—those emerging from within educational measurement and statistics, those from philosophy, and finally, those from the public. I use examples of public challenges to selection practices to illustrate the fact that technical and philosophical definitions of fairness do not align well with public concerns. I emphasize the importance of promoting awareness of existing standards, advocating for the fair use of testing and selection practices, and communicating in a candid and straightforward way when engaging with test takers and test users. 相似文献

16.

谈如何在新形势下培养地质学专业大学生的学习兴趣 总被引：2，自引：1，他引：1

陈勇周瑶琪陈世悦《中国地质教育》2008,17(3):23-25

由于当前市场经济对地质专业人才择业产生了影响,导致在某些领域地质专业人才的短缺,而能源与资源紧张的新形势又为地质专业人才提供了广阔的就业空间和发展机遇。本文在论述学习兴趣重要性的基础上,认为应当从加深学生对专业的认识、增强学生就业的自信心、用科学问题激发学生的好奇心、增强学生的社会责任感和保持学习兴趣等五个方面来培养地质学专业大学生的学习兴趣。相似文献

17.

Young children's motivational beliefs about learning science

Panayota Mantzicopoulos Helen PatrickAla Samarapungavan 《Early childhood research quarterly》2008

For learning science, motivational beliefs such as confidence in one's science abilities and liking of science are associated with current and future science achievement, as well as continued interest in science classes and careers. However, there are currently no measures to test young children's motivational beliefs related to science learning. To meet this need, we developed the Puppet Interview Scales of Competence in and Enjoyment of Science (PISCES). We piloted PISCES with 113 kindergarten children in public schools participating in the Scientific Literacy Project (SLP). Factor analysis supported the multidimensional structure of young children's self-related beliefs about learning science. PISCES scales measured Science Liking, Science Competence, and Ease of Science Learning. Correlations among PISCES scales and achievement subtests provided evidence of PISCES's validity. Children's motivational beliefs varied as a function of length of time spent learning science, with competence beliefs associated positively with science experience. There were no gender differences. 相似文献

18.

Structural validity and invariance of the Feedback Perceptions Questionnaire

《Studies in Educational Evaluation》2021

Despite a growing interest in instructional feedback, students’ feedback perceptions received limited attention. We examined the structural validity and measurement invariance of the Feedback Perceptions Questionnaire (FPQ). The FPQ measures feedback perceptions in terms of perceived fairness, usefulness, acceptance, willingness to improve, and affect. Secondary school students (N = 1486) received a fictional scenario containing Concise General Feedback or Elaborated Specific Feedback by a fictional peer. Students rated their perceptions as if they had received the feedback themselves. Confirmatory Factor Analysis (CFA) supports the structural validity of the FPQ and its invariance for the two types of peer feedback, gender, four grade-levels and two tracks. Perceived fairness of peer feedback was a strong positive predictor of willingness to improve and affect, whereas perceived usefulness and acceptance of peer feedback showed a more complex pattern in predicting willingness to improve and affect. 相似文献

19.

论国家助学贷款补贴的教育资助效应

季俊杰沈红《教育学术月刊》2008,(8):53-56

国家助学贷款现有贴息、代偿、利率补贴、风险补偿金、税前扣除、免征营业税等六种补贴形式,根据补贴方式的不同。它们可分为显性直接补贴、隐性直接补贴和显性间接补贴、隐性间接补贴四种类型。其中,显性直接补贴可以使更多的学生得到资助。并减轻其还贷负担,资助效果最理想;而隐性直接补贴和间接补贴的资助效果欠佳,前者会使受助学生减少,增加学生贷款成本,后者会导致贷款机构的“停贷”和机会主义行为。因此,政府应允许助学贷款利率浮动．再将间接补贴转化为贴息．以改善教育资助效应．促进教育公平．相似文献

20.

效率与公平相协调：从理念走向实践

戴木才《江西师范大学学报(哲学社会科学版)》2010,43(5):37-45,88

正确处理效率与公平的辩证关系,不仅是一个涉及经济政策和收入分配制度的理论问题,而且是一个涉及我国经济社会的发展战略、政治秩序、社会稳定和价值追求的理论问题,是一个涉及我国社会改革的方向和性质的重大现实问题。坚持注重效率与维护公平相协调,使效率与公平实现有机统一,是建设社会主义和谐社会的重要目标和重要支撑。在实践中,我国逐步探索出一条既注重效率又维护公平的建设社会主义和谐社会之路。相似文献