首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 312 毫秒
1.
This article studies the difference between the criterion validity coefficient of the widely used overall scale score for a unidimensional multicomponent measuring instrument and the maximal criterion validity coefficient that is achievable with a linear combination of its components. A necessary and sufficient condition of their identity is presented in the case of measurement errors being uncorrelated among themselves and with a used criterion. An upper bound of the difference in these validity coefficients is provided, indicating that it cannot exceed the discrepancy between the maximal reliability and composite reliability indexes. A readily applicable latent variable modeling procedure is discussed that can be used for point and interval estimation of the difference between the maximal and scale criterion validity coefficients. The outlined method is illustrated with a numerical example.  相似文献   

2.
A latent variable modeling method for testing criterion correlations with measurement error terms in multicomponent measuring instruments is outlined. The approach is based on an application of the Benjamini–Hochberg multiple testing procedure and can be used when assumptions of validity estimation related procedures need to be examined. The method also allows studying the extent to which criterion validity coefficients might be due to the relationship between a presumed underlying latent construct evaluated by a psychometric scale and a criterion variable, or could be a consequence of the relation between measurement error in the overall scale score and the criterion. The discussed procedure is widely applicable with popular latent variable modeling software, and is illustrated using a numerical example.  相似文献   

3.
ABSTRACT

We investigate whether Anchoring Vignettes (AV) improve intercultural comparability of non-cognitive student-directed factors (e.g., procrastination). So far, correlation analyses for anchored and non-anchored scores with a criterion have been used to demonstrate the effectiveness of AV in improving data quality. However, correlation analyses are often used to investigate external validity of a scale. Nonetheless, before testing for validity, the reliability of the measurement of a construct should be examined. In the present study, we tested for measurement invariance across countries and languages and compared anchored and non-anchored student-directed self-reports that are highly relevant for the students’ self and their behaviour and performance. In addition, we apply further criteria for testing reliability. The results indicate that the data quality for some of the constructs can – in fact – be improved slightly by anchoring; whereas, for other self-reports, anchoring is less successful than was hoped. We discuss with regard to possible consequences for research methodology.  相似文献   

4.
A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or simple hypotheses about these coefficients. The proposed method is illustrated with a numerical example.  相似文献   

5.
This simulation study examines the efficacy of multilevel factor mixture modeling (ML FMM) for measurement invariance testing across unobserved groups when the groups are at the between level of multilevel data. To this end, latent classes are generated with class-specific item parameters (i.e., factor loading and intercept) across the between-level classes. The efficacy of ML FMM is evaluated in terms of class enumeration, class assignment, and the detection of noninvariance. Various classification criteria such as Akaike’s information criterion, Bayesian information criterion, and bootstrap likelihood ratio tests are examined for the correct enumeration of between-level latent classes. For the detection of measurement noninvariance, free and constrained baseline approaches are compared with respect to true positive and false positive rates. This study evidences the adequacy of ML FMM. However, its performance heavily depends on the simulation factors such as the classification criteria, sample size, and the magnitude of noninvariance. Practical guidelines for applied researchers are provided.  相似文献   

6.
This study analyzed the relationship between benchmark scores from two curriculum‐based measurement probes in mathematics (M‐CBM) and student performance on a state‐mandated high‐stakes test. Participants were 298 students enrolled in grades 7 and 8 in a rural southeastern school. Specifically, we calculated the criterion‐related and predictive validity of benchmark scores from CBM probes measuring math computation and math reasoning skills. Results of this study suggest that math reasoning probes have strong concurrent and predictive validity. The study also provides evidence that calculation skills, while important, do not have strong predictive strength at the secondary level when a state math assessment is the criterion. When reading comprehension skill is taken into account, math reasoning scores explained the greatest amount of variance in the criterion measure. Computation scores explained less than 5% of the variance in the high‐stakes test, suggesting that it may have limitations as a universal screening measure for secondary students.  相似文献   

7.
基于跨时测量恒等视角与知识图谱分析,文章对我国教育技术学较常探讨的变量"自我效能"量表进行了工具检测,并以四川省某小学三年级的197名学生为被试,前后测时间间隔为6个月。文章采用结构方程模型的跨时测量恒等检验程序,依序针对不同恒等程度的模型进行比较,结果发现:数学自我效能量表不符合完全的度量恒等,放宽两道题项的参数限制后可达到部分的纯量恒等,但仍不及严格恒等的要求;跨时测量恒等性的结果会影响配对样本t检验的结论。基于此,文章提出建议:为了提升实验的内在效度,较长时间的实验研究应纳入工具的跨时测量恒等性检验。  相似文献   

8.
High self-efficacy is a marker of successful teaching and is, therefore, a subject of great interest to research on inclusive education. One of the most frequently used instruments to assess such beliefs is the Teacher Efficacy for Inclusive Practice (TEIP) scale. Although used widely, some studies did not precisely replicate the original factor structure, and no short form of the TEIP scale currently exists, although this could enhance measurement efficiency. This study (1) systematically assessed the TEIP scale's factor structure and psychometric properties, (2) identified potentially problematic items and developed a more concise short form of the scale, and (3) evaluated its dimensionality and criterion and convergent validities using three validation samples of teachers in three different countries (486 in Switzerland, 189 in Australia and 276 in Canada). Compared to the full-length TEIP scale, the TEIP-SF uses half the items, demonstrates better model fit and reveals a clearer distinction of domain-specific factors. In conclusion, the TEIP-SF represents a concise, efficient means of assessing teachers' self-efficacy about teaching in inclusive classrooms.  相似文献   

9.
The Student–Teacher Relationship Scale (STRS) is widely used for research in kindergarten and school. The increasing number of applications inside and outside of the U.S. stresses the need to investigate STRS properties, accordingly. The present study used the STRS in German-speaking countries, examining whether (a) the original factor structure is appropriate for a German version, (b) whether applications of a German STRS are invariant across contexts (kindergarten, first and second grade) as well as gender, and (c) whether construct and criterion validity are met. The original STRS was translated into German and filled out by 368 kindergarten and 503 elementary school teachers in Germany and Austria. Observations in kindergartens, student reports in schools, and teacher reports of students’ characteristics served as validity criteria. Results of confirmatory factor analyses (CFAs) did not confirm the original STRS factor structure. Subsequent exploratory factor analyses on training samples resulted in significant item reductions, followed by further CFAs on validation samples. The bootstrapped results yielded an adjusted three-factor model with subscales indicating satisfying alphas and invariance across context and gender. Construct and criterion validity were met for all subscales of the German STRS based on various criteria from both, observations and reports.  相似文献   

10.
我国现行法律规范没有对知识产权法定赔偿的计量标准予以明确规定,以致在审判实务中产生同案不同判的尴尬局面,对法制统一工作的开展十分不利。在实务中,无论是权利主体标准、侵权主体标准,还是行为标准、产品标准,都有缺陷存在。而法律最终要保护的就是权利人在法律上受保护的利益,应还原法律追求价值的本来面目,以权利作为知识产权法定赔偿适用的计量标准。  相似文献   

11.
The Moral Competence Test (MCT) was designed over 30 years ago to provide a resource for educators interested in conducting cross-cultural studies of moral development and education. Since its origin, it has been translated into at least 30 languages and used in hundreds of studies. However, few studies provide evidence to support the use of the test in the US. The test’s designer identified three criteria for evaluating the construct validity of the test and its primary scores: do correlations of stage scores reflect a simplex structure, do ratings follow the theoretical order of stages, does the test differentiate preferences and structures of reasoning. We use these criteria and evidence of criterion and content validity to assess the validity of the MCT. We present results from two US samples (n = 772). Results analyzing the test author’s criteria support the semantic validity of the test, however, evidence of criterion validity raise questions about the C-score as a measure of moral competence. After controlling for stage preferences, the C-score was negatively related to democratic attitudes and positively related to dogmatism.  相似文献   

12.
Behavior rating scales are indirect measures of emotional and social functioning used for assessment purposes. Rater bias is systematic error that may compromise the validity of behavior rating scale scores. Teacher bias in ratings of behavior has been investigated in multiple studies, but not yet assessed in a research synthesis that focuses on the role of ethnicity and culture. Teacher bias in ratings of student behavior was investigated through a comprehensive literature review that only included studies with a defensible criterion of true behavior against which to compare rating scores. A final total of 13 studies of teacher bias suggested mixed evidence for bias due to student ethnicity and strong evidence of bias due to teacher culture, particularly when positive stereotypes were violated. Limitations and future directions of research are discussed.  相似文献   

13.
School climate surveys are central to school improvement and principal evaluation policies. The quality of school climate has been linked both to student achievement and to teacher retention. Oftentimes, policymakers and practitioners are concerned with monitoring change in school climate quality in each academic year. Such applications assume longitudinal factorial invariance—it is presupposed that the surveys are measuring the same things in the same metric at each time point. While there is considerable research examining the validity of inferences based on survey‐derived climate indicators, this research is almost exclusively based on cross‐sectional data. There is little literature describing procedures for gathering evidence of factorial invariance of school climate indicators. This study proposes to adapt existing methods for evaluating factorial invariance in longitudinal designs into multilevel frameworks, and in doing so, articulates a novel method for evaluating longitudinal measurement invariance in school climate research. This technique is illustrated on a widely used school climate survey.  相似文献   

14.
以美国著名后现代作家唐·德里罗的《白噪音》为例,在批评传统翻译观念中关于"忠实"性翻译标准的基础上,提出了目前在翻译活动中应遵守的原则,即后现代文化语境中的翻译标准。它们是:符合知识的客观性;理解的合理性与解释的普遍有效性;符合原文的定向性。用暴力颠覆原文的界限,从而寻求译文的真正越界。  相似文献   

15.
作为犯罪既遂标准的构成要件说除存在逻辑缺陷外,在司法实践中的应用也日益暴露出其存在的弊端.犯罪客体侵害说比构成要件说具有明显的理论优越性,符合刑之减抑的国际潮流,能更好地保障刑罚功能和刑法目的的实现.  相似文献   

16.
Increasingly, the literature suggests that the sense of coherence (SOC) positively influences well-being in later life. This study reports the assessment of the following psychometric properties: distributional properties, construct, criterion and external-related validities, and reliability of the Orientation to Life Questionnaire (OtLQ) in an cross-national population of older adults. We recruited 1291 community-dwelling older adults aged between 75–102 years (M = 83.9; SD = 6.68). Convenience sampling was used to gather questionnaire data. The construct validity was asserted by confirmatory factor analysis and convergent and discriminant validity. Moreover, criterion and external-related validities, as well as distributional properties and reliability, were also tested. Data gathered with the 29-items OtLQ scale showed overall good psychometric properties in terms of distributional properties, construct, criterion, and external-related validities, as well as reliability. Three factors were validated for the OtLQ scale: (a) comprehensibility; (b) manageability; and (c) meaningfulness. We validated the three-factor OtLQ scale, which produced valid and reliable data for a cross-national sample with older adults. Hence, it is an adequate instrument for assessing sense of coherence among older people in health care practice and program development contexts.  相似文献   

17.
18.
A correlation structure modeling method for comparison of mediated effects is outlined. The procedure permits point and interval estimation of differences in mediator effects, and is useful with models postulating 1 or more predictor, intervening, or response variables that may also be latent constructs. The approach allows scale-free evaluation of differences in effects of any explanatory upon criterion variables transmitted via studied mediators, and is applied on data from a study of older adults with recent vision loss.  相似文献   

19.
ABSTRACT

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay’s true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By eliminating one, two, or three raters at a time, and by calculating an estimate of the true scores using the remaining raters, an independent criterion against which to judge the validity of the human raters and that of the AES system, as well as the interrater reliability was produced. The results of the study indicated that the automated scores correlate with human scores to the same degree as human raters correlate with each other. However, the findings regarding the validity of the ratings support a claim that the reliability and validity of AES diverge: although the AES scoring is, naturally, more consistent than the human ratings, it is less valid.  相似文献   

20.
Assessing the correctness of a structural equation model is essential to avoid drawing incorrect conclusions from empirical research. In the past, the chi-square test was recommended for assessing the correctness of the model but this test has been criticized because of its sensitivity to sample size. As a reaction, an abundance of fit indexes have been developed. The result of these developments is that structural equation modeling packages are now producing a large list of fit measures. One would think that this progression has led to a clear understanding of evaluating models with respect to model misspecifications. In this article we question the validity of approaches for model evaluation based on overall goodness-of-fit indexes. The argument against such usage is that they do not provide an adequate indication of the “size” of the model's misspecification. That is, they vary dramatically with the values of incidental parameters that are unrelated with the misspecification in the model. This is illustrated using simple but fundamental models. As an alternative method of model evaluation, we suggest using the expected parameter change in combination with the modification index (MI) and the power of the MI test.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号