首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The purpose of this study is to apply the attribute hierarchy method (AHM) to a subset of SAT critical reading items and illustrate how the method can be used to promote cognitive diagnostic inferences. The AHM is a psychometric procedure for classifying examinees’ test item responses into a set of attribute mastery patterns associated with different components from a cognitive model. The study was conducted in two steps. In step 1, three cognitive models were developed by reviewing selected literature in reading comprehension as well as research related to SAT Critical Reading. Then, the cognitive models were validated by having a sample of students think aloud as they solved each item. In step 2, psychometric analyses were conducted on the SAT critical reading cognitive models by evaluating the model‐data fit between the expected and observed response patterns produced from two random samples of 2,000 examinees who wrote the items. The model that provided best data‐model fit was then used to calculate attribute probabilities for 15 examinees to illustrate our diagnostic testing procedure.  相似文献   

2.
The development of cognitive diagnostic‐computerized adaptive testing (CD‐CAT) has provided a new perspective for gaining information about examinees' mastery on a set of cognitive attributes. This study proposes a new item selection method within the framework of dual‐objective CD‐CAT that simultaneously addresses examinees' attribute mastery status and overall test performance. The new procedure is based on the Jensen‐Shannon (JS) divergence, a symmetrized version of the Kullback‐Leibler divergence. We show that the JS divergence resolves the noncomparability problem of the dual information index and has close relationships with Shannon entropy, mutual information, and Fisher information. The performance of the JS divergence is evaluated in simulation studies in comparison with the methods available in the literature. Results suggest that the JS divergence achieves parallel or more precise recovery of latent trait variables compared to the existing methods and maintains practical advantages in computation and item pool usage.  相似文献   

3.
Most of the existing classification accuracy indices of attribute patterns lose effectiveness when the response data is absent in diagnostic testing. To handle this issue, this article proposes new indices to predict the correct classification rate of a diagnostic test before administering the test under the deterministic noise input “and” gate (DINA) model. The new indices include an item‐level expected classification accuracy (ECA) for attributes and a test‐level ECA for attributes and attribute patterns, and both of them are calculated based solely on the known item parameters and Q ‐matrix. Theoretical analysis showed that the item‐level ECA could be regarded as a measure of correct classification rates of attributes contributed by an item. This article also illustrates how to apply the item‐level ECA for attributes to estimate the correct classification rate of attributes patterns at the test level. Simulation results showed that two test‐level ECA indices, ECA_I_W (an index based on the independence assumption and the weighted sum of the item‐level ECAs) and ECA_C_M (an index based on Gaussian Copula function that incorporates the dependence structure of the events of attribute classification and the simple average of the item‐level ECAs), could make an accurate prediction for correct classification rates of attribute patterns.  相似文献   

4.
This paper examines the reasons why long‐serving teachers remain in the teaching profession. Interest in teacher retention has grown in recent years, both in the UK and internationally, due to concerns over teacher shortage. However, most research on retention has focused on why teachers leave; this paper aims to fill the gap in our understanding of the positive reasons why long‐serving teachers stay in the profession, and how these reasons change over time. We define ‘long‐serving teachers’ as teachers who have taught for 10 years and more. We draw on a subset of data from an existing, broader study (Menzies et al., 2015 ) on why teachers enter and stay in the profession. In this paper, we draw on questionnaire findings from over 900 teachers with 0 to over 30 years’ teaching experience, and interviews with 14 long‐serving teachers, to understand why long‐serving teachers enter and, more importantly for our purposes, stay in teaching. We find that teachers’ motivational patterns are highly complex and influenced by school‐level and policy contexts. Nonetheless, two prominent retention factors are identified: teachers’ perceived professional mastery and altruistic reasons. Perceived professional mastery is particularly important due to its mutually reinforcing analytic relationships with other reasons. We find that teachers’ identification with intrinsic, altruistic and perceived professional mastery reasons become stronger with years of experience, but in some cases, paradoxically, so does their identification with extrinsic reasons. From our evidence, we suggest policy implications for enhancing the retention of long‐serving teachers.  相似文献   

5.
Some argue that the goal of education is to influence out‐of‐school learning activity, yet little research exists on how teachers can help students develop an interest in a topic and continue to pursue that interest outside of school. The current study tested classroom context variables from self‐determination theory (teachers’ autonomy support) and from achievement goal theory (teachers’ mastery goals) that may predict students’ school‐prompted interest above and beyond students’ mastery goals and self‐efficacy. Survey data were collected from 178 high school students in 15 mathematics classes. Hierarchical multiple regression revealed that students’ perceptions of a teacher who focuses on mastery goals account for little variance above and beyond students’ own motivation, whereas perceived teachers’ autonomy support emerged as a significant predictor. While the results are correlational and do not support causal claims, they suggest that teachers who support student autonomy might foster school‐prompted interest.  相似文献   

6.
Achievement goal theory is an important theoretical framework for understanding achievement motivation. In previous studies, a mastery orientation has been shown to be related to students’ interest, while a performance orientation has been found to be predictive of academic performance outcomes such as course grades. In this study, the two mastery sub‐scores from the Multiple Goals Theory Measure (MGTM) of academic motivation, which was developed specifically from achievement goal theory, was found to be predictive of college grades for a sample of 257 undergraduates at a public university in the north‐eastern United States. Additionally, the results support a trichotomous model of achievement orientations comprising mastery approach, performance approach, and avoidance. The MGTM appears to hold promise as a diagnostic tool, but additional research is required on its resistance to faking and other threats to validity.  相似文献   

7.
Recent research anchored in achievement goal theory suggests mastery goals are more adaptive when endorsed for autonomous rather than controlled reasons. We report on two studies (N = 622) in which we explored whether the combined effects of goals and reasons on academic outcomes were different for a sample of low-SES youth than for other older higher-SES samples in the literature. Participants were low-SES high-school students in Lima, Peru. The results show that autonomous reasons for endorsing mastery goals positively predicted students' collective engagement and mathematics grades above the effect of mastery goals as such. Second, controlled reasons negatively predicted end-of-the year math grades. Finally, mastery goals’ relations with mathematics grades and behavioral engagement were attenuated when endorsed for low autonomous reasons. The findings extend the knowledge on mastery goal-complexes and show they apply to low-SES students.  相似文献   

8.
The purpose of the study was to test the effectiveness of an intervention to reduce aggressive behavior among children and adolescents. The study encompassed 70 special education students with behavioral problems in 10 schools (grades 5‐9) in Israel. These students were identified by teachers as aggressive children, based on a 10‐item questionnaire that referred to verbal and physical aggression. The children were randomly divided into experimental and control conditions. A short‐term multidimensional program utilizing bibliotherapy and clarifying processes was introduced to the experimental students, while the control students remained in their homeroom groups. The Achenbach Self‐Report and Teacher Report scales were administered to the students before and after treatment. Results indicated significant gains on both measures: The intervention was effective in promoting adjusting behavior and particularly in reducing aggression. © 2000 John Wiley & Sons, Inc.  相似文献   

9.
Classification consistency and accuracy are viewed as important indicators for evaluating the reliability and validity of classification results in cognitive diagnostic assessment (CDA). Pattern‐level classification consistency and accuracy indices were introduced by Cui, Gierl, and Chang. However, the indices at the attribute level have not yet been constructed. This study puts forward a simple approach to estimating the indices at both the attribute and the pattern level through one single test administration. Detailed elaboration is made on how the upper and lower bounds for the attribute‐level accuracy can be derived from the variance of error of the attribute mastery probability estimate. In addition, based on Cui's pattern‐level indices, an alternative approach to estimating the attribute‐level indices is also proposed. Comparative analysis of simulation results indicate that the new indices are very desirable for evaluating test‐retest consistency and correct classification rate.  相似文献   

10.
认知诊断通过分析被试的项目作答反应,推断被试的认知属性掌握状态,为学习困难学生设计补救教学提供了非常有价值的信息。本文作者在探讨了小学生多位数乘法计算能力的认知属性、编制了2份相同考核模式的认知诊断测验后,选择江西某小学310名高年级学生为被试,先施测第1份认知诊断测验,采用DINA模型,自编参数估计程序进行诊断,得到了每一个被试的属性掌握模式分类及全体被试在各个属性上的掌握情况。然后设计和实施补救教学,在实施补救教学后再施测第2份认知诊断测验以检验补救效果。研究发现:(1)该小学高年级学生对0XN运算法则、多位数乘以两位数的运算程序、乘法进位认知属性的掌握不太理想,特别是乘法进位。(2)属性掌握模式中属全部掌握模式的被试人数占86.47%,其余被试均分类于存在各种认知不足的掌握模式。(3)比较两份认知诊断测验报告,结果表明在认知诊断指导下的补救教学有针对性,补救后被试正确作答项目增多,属性掌握个数也有所增加,补救效果良好。  相似文献   

11.
This study aimed to develop and validate an instrument to assess the multidimensional nature of teacher self‐regulation. A nine‐factor structure was proposed: goal setting, intrinsic interest, performance goal orientation, mastery goal orientation, self‐instruction, emotional control, self‐evaluation, self‐reaction, and help‐seeking. Through a series of confirmatory factor analyses with different samples, this nine‐factor structure was supported. Reliability estimates were also satisfactory. Further validation evidence was provided through canonical correlation analysis between teacher self‐regulation and teachers’ sense of efficacy. Results indicated that the Teacher Self‐Regulation Scale can be utilised as a valid and reliable instrument to assess teachers’ self‐regulatory strategies.  相似文献   

12.
Researchers have developed indices to identify persons whose test results ‘misfit’ and are considered statistically ‘aberrant’ or ‘unexpected’ and whose measures are consequently potentially invalid, drawing the test’s validity into question. This study draws on interviews of pupils and their teachers, using a sample of 31 10‐year‐olds who were flagged as most ‘aberrant’ in a standardised mathematics test. The children’s and their teachers’ explanations were analysed and attributed: (i) to item‐, person‐ (self/other) and classroom‐levels; and ii) according to causal dimensions. Children’s and teachers’ explanations were mostly in agreement in relation to unexpected negative results and they included references to previously well‐cited sources of construct‐irrelevant variance (e.g. ineffective test‐taking strategies, careless mistakes) as well as construct‐relevant variance (e.g. misconceptions, weaknesses in particular topics). Findings of this exploratory study are discussed from a test validity and attribution theory perspective: we conclude that this approach offers grounds for multi‐level explanations of person misfit and that this qualitative research approach to unexpected responses is worthy of more attention.  相似文献   

13.
A diagnostic model for large‐scale assessment was applied to TIMSS data to compare mathematics performance of eighth graders from three countries—the US, Singapore, and Israel. Compared were attribute mastery probabilities for content, skills and cognitive processes underlying students’ performance on the 1999 TIMSS‐R mathematics test. Also compared were the proportions of students from the three samples in each of eight hierarchically ordered clusters of knowledge states. The results indicated significant differences in favour of the Singaporean sample on most attributes underlying the test. The results were discussed in light of the cultural context of education in the respective countries.  相似文献   

14.
Compared to unidimensional item response models (IRMs), cognitive diagnostic models (CDMs) based on latent classes represent examinees' knowledge and item requirements using discrete structures. This study systematically examines the viability of retrofitting CDMs to IRM‐based data with a linear attribute structure. The study utilizes a procedure to make the IRM and CDM frameworks comparable and investigates how estimation accuracy is affected by test diagnosticity and the match between the true and fitted models. The study shows that comparable results can be obtained when highly diagnostic IRM data are retrofitted with CDM, and vice versa, retrofitting CDMs to IRM‐based data in some conditions can result in considerable examinee misclassification, and model fit indices provide limited indication of the accuracy of item parameter estimation and attribute classification.  相似文献   

15.
We used the Rule Space Model, a cognitive diagnostic model, to measure the learning progression for thermochemistry for senior high school students. We extracted five attributes and proposed their hierarchical relationships to model the construct of thermochemistry at four levels using a hypothesized learning progression. For this study, we developed 24 test items addressing the attributes of exothermic and endothermic reactions, chemical bonds and heat quantity change, reaction heat and enthalpy, thermochemical equations, and Hess’s law. The test was administered to a sample base of 694 senior high school students taught in 3 schools across 2 cities. Results based on the Rule Space Model analysis indicated that (1) the test items developed by the Rule Space Model were of high psychometric quality for good analysis of difficulties, discriminations, reliabilities, and validities; (2) the Rule Space Model analysis classified the students into seven different attribute mastery patterns; and (3) the initial hypothesized learning progression was modified by the attribute mastery patterns and the learning paths to be more precise and detailed.  相似文献   

16.
In this study, we present an approach to questionnaire design within educational research based on Guttman's mapping sentences and Many-Facet Rasch Measurement Theory. We designed a 54-item questionnaire using Guttman's mapping sentences to examine the grading practices of teachers. Each item in the questionnaire represented a unique student scenario that was graded by teachers. Three focus groups of elementary (N = 5), middle (N = 4), and high school (N = 2) teachers examined the scenarios for clarity, comprehensiveness, and ease of understanding. Based on the suggestions of the focus groups, the revised questionnaires were completed by 516 public school teachers located in a major metropolitan county in the Southeast. The grades assigned by the teachers to the scenarios were analyzed using the FACETS computer program. The results of the analyses suggest that teachers primarily assign grades on the basis of student achievement as expected, although for some teachers other facets (ability, behavior, and effort) may play a role in final grade assignment.  相似文献   

17.
In recent years, students’ test scores have been used to evaluate teachers’ performance. The assumption underlying this practice is that students’ test performance reflects teachers’ instruction. However, this assumption is generally not empirically tested. In this study, we examine the effect of teachers’ instruction on test performance at the item level using a hierarchical differential item functioning approach. The items are from the U.S. TIMSS 2011 4th-grade math test. Specifically, we tested whether students who had received instruction on a given item performed significantly better on that item compared with students who had not received such instruction when their overall math ability was controlled for, whether with or without controlling for student-level and class-level covariates. This study provides preliminary findings regarding why some items show instructional sensitivity and sheds light on how to develop instructionally sensitive items. Implications and directions for further research are also discussed.  相似文献   

18.
This article considers potential problems that can arise in estimating a unidimensional item response theory (IRT) model when some test items are multidimensional (i.e., show a complex factorial structure). More specifically, this study examines (1) the consequences of model misfit on IRT item parameter estimates due to unintended minor item‐level multidimensionality, and (2) whether a Projection IRT model can provide a useful remedy. A real‐data example is used to illustrate the problem and also is used as a base model for a simulation study. The results suggest that ignoring item‐level multidimensionality might lead to inflated item discrimination parameter estimates when the proportion of multidimensional test items to unidimensional test items is as low as 1:5. The Projection IRT model appears to be a useful tool for updating unidimensional item parameter estimates of multidimensional test items for a purified unidimensional interpretation.  相似文献   

19.
Numerous diagnostic studies have been conducted on large-scale assessments to illustrate the students’ mastery profile in the areas of math and reading; however, for science a limited number of investigations are reported. This study investigated Iranian eighth graders’ competency mastery of science and examined the utility of the General Diagnostic Model (GDM) to produce diagnostic information using TIMSS 2011 data. Eight diagnostic attributes were extracted, using documentary analysis of the major large-scale assessment frameworks, including basic science knowledge, using models, reasoning, using science, representing data, explaining of phenomena, predicting, and scientific inquiry. These attributes were then assigned to each item in order to construct the Q matrix, through focus group discussions, survey of head science teachers, think-aloud verbal protocol, and analysis of written answers. Results show the utility of GDM to generate rich diagnostic information for a national large-scale assessment. Moreover, the findings indicated that students performed relatively well in using science, but performed weakly in reasoning, explaining of phenomena, and scientific inquiry, which all required complex skills and higher-order thinking abilities.  相似文献   

20.
Consider test data, a specified set of dichotomous skills measured by the test, and an IRT cognitive diagnosis model (ICDM). Statistical estimation of the data set using the ICDM can provide examinee estimates of mastery for these skills, referred to generally as attributes. With such detailed information about each examinee, future instruction can be tailored specifically for each student, often referred to as formative assessment. However, use of such cognitive diagnosis models to estimate skills in classrooms can require computationally intensive and complicated statistical estimation algorithms, which can diminish the breadth of applications of attribute level diagnosis. We explore the use of sum-scores (each attribute measured by a sum-score) combined with estimated model-based sum-score mastery/nonmastery cutoffs as an easy-to-use and intuitive method to estimate attribute mastery in classrooms and other settings where simple skills diagnostic approaches are desirable. Using a simulation study of skills diagnosis test settings and assuming a test consisting of a model-based calibrated set of items, correct classification rates (CCRs) are compared among four model-based approaches for estimating attribute mastery, namely using full model-based estimation and three different methods of computing sum-scores (simple sum-scores, complex sum-scores, and weighted complex sum-scores) combined with model-based mastery sum-score cutoffs. In summary, the results suggest that model-based sum-scores and mastery cutoffs can be used to estimate examinee attribute mastery with only moderate reductions in CCRs in comparison with the full model-based estimation approach. Certain topics are mentioned that are currently being investigated, especially applications in classroom and textbook settings.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号