期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Students’ test motivation in PISA: the case of Norway

Therese N. Hopfenbeck Marit Kjærnsli 《Curriculum Journal》2016,27(3):406-422

Do students make their best effort in large-scale assessment studies such as the Programme for International Student Assessment (PISA)? Despite six cycles of PISA surveys from 2000 to 2015, empirical studies regarding students’ test motivation and experience of the tests are sparse. The present study examines students’ test motivation in PISA, and how closely students’ achievement correlates with students’ reported test motivation. A total of 40 students from eight schools that participated in the PISA study in 2006, 2009 and 2012 were interviewed. In addition, questionnaire data from a total of 9400 students who participated in PISA 2009 and 2012 were collected. The findings of this study indicate that students overall were motivated to do their best in the PISA study, despite the fact that these low-stakes tests have no impact on students’ grades or future school entrance. In contrast to claims in the media, where it has been suggested that Norwegian students are more relaxed towards the PISA test than students in other countries, students report that they are motivated to do their best, but girls report significantly higher test motivation than boys. Despite the policy influence of PISA and the large number of countries participating, very few studies have reported findings from students’ perspectives regarding their test motivation and experience of the test. The present study aims to fill that gap. 相似文献

2.

A critical analysis of problems with the LBOTE category on the NAPLaN test

Sue Creagh 《The Australian Educational Researcher》2014,41(1):1-23

The National Assessment Program: Literacy and Numeracy (NAPLaN) is an annual literacy and numeracy test for all Australian students, and results from the test are disaggregated into a number of categories including language background other than English (LBOTE). For this and other categories, results on each section of the test are aggregated into state, territory and national means and standard deviations enabling comparison of performance. The NAPLaN data indicate that since the test began, in 2008, at a national level there is little difference between the results of LBOTE and non-LBOTE students on all domains of the test. This is a national result, and there is greater variation at state and territory level. However, these results defy a logic which might suggest that the LBOTE category will reflect the influence of English as a second language on test performance, rather suggesting that a second language background is not associated with test performance. In this paper, I will interrogate the variation in the LBOTE category, using data provided by the Queensland state education department, focusing on year 9 students who participated in the 2010 test. Using multiple regression and focusing on variables which are specifically related to language background, I will show that within the LBOTE category there is a wide variation of performance, and the LBOTE data are in fact hiding some of our most disadvantaged students. I will suggest alternative ways in which language learners could be identified to better empower policy and pedagogical responses to student needs. 相似文献

3.

Does the repetitive use of the same test in consecutive examination sessions facilitate cheating?

András István Kun 《Assessment & Evaluation in Higher Education》2015,40(8):1159-1181

This paper introduces an empirical study testing three kinds of bias in higher education student assessment. All of them are connected to the repetitive use of the same test questions which may facilitate academic cheating. The ‘same tests effect’ may appear if two or more groups of students are writing the same test one after the other and, as a result, a statistically significant improvement is detectable in the test scores of the second student group. The ‘revealed sameness effect’ is the impact of informing the students in some way that the test questions will be repeated. The ‘self selection effect’ arises when the students choose their examination turn themselves and this boosts their measured performance. The present study examines the three effects with independent t-tests and linear regression models on samples of 1221, 235, and 201 students (in this order), from four business courses in six academic semesters. The results do not support the ‘same test effect’, but support the ‘revealed sameness effect’ and the ‘self selection effect’. 相似文献

4.

What predicts injury from physical punishment? A test of the typologies of violence hypothesis

Gonzalez M Durrant JE Chabot M Trocmé N Brown J 《Child abuse & neglect》2008,32(8):752-765

OBJECTIVE: This study examined the power of child, perpetrator, and socio-economic characteristics to predict injury in cases of reported child physical abuse. The study was designed to assess the validity of the assumption that physically injurious incidents of child physical abuse are qualitatively different from those that do not result in injury, that their generative factors are distinctive, and that the quality of caregiving in these two types of incidents is different. METHOD: A weighted, nationally representative sample of 8,164 substantiated punishment abuse cases in Canada was used. Various models were constructed and evaluated through logistic regression. RESULTS: Of six potential predictors - child age, perpetrator sex, child functioning, parent functioning, economic stress, and social stress - none predicted injury to the child. CONCLUSIONS: The findings suggest that injurious and non-injurious physical abuse cannot be distinguished on the basis of the personal characteristics or circumstances of the child or perpetrator. PRACTICE IMPLICATIONS: A common criterion for child welfare intervention into cases of suspected physical abuse is injury or risk of injury. This criterion assumes that injurious and non-injurious assaults are qualitatively different phenomena, predicted by different risk factors. In the present study an attempt was made to differentiate between injurious and non-injurious cases of punitive physical abuse on the basis of characteristics of the child, perpetrator, family, and social context. None of these factors explained the likelihood of injury, suggesting that the prediction of injury as an intervention criterion may be questionable. 相似文献

5.

Controlling response shift bias: the use of the retrospective pre‐test design in the evaluation of a master's programme

Jonathan Drennan Abbey Hyde 《Assessment & Evaluation in Higher Education》2008,33(6):699-709

Traditionally the measures used to evaluate the impact of an educational programme on student outcomes and the extent to which students change is a comparison of the student's pre‐test scores with his/her post‐test scores. However, this method of evaluating change may be problematic due to the confounding factor of response shift bias when student self‐reports of change are used. Response shift bias occurs when the student's internal frame of reference of the construct being measured, for example research ability or critical thinking, changes between the pre‐test and the post‐test due to the influence of the educational programme. To control for response shift bias the retrospective pre‐test method was used to evaluate the outcomes achieved from students completing a research module at master's level. The retrospective pre‐test method differs from the traditional pre‐test–post‐test design in that both post‐test and pre‐test perceptions of respondents are collected at the same time. The findings indicated that response shift bias was evident in that the programme had significantly greater impact on outcomes than identified using the traditional pre‐test–post‐test design leading to the conclusion that students may overestimate their ability at the commencement of an educational programme. The retrospective pre‐test design is not a replacement for the traditional pre‐test–post‐test measures but may be a useful adjunct in the evaluation of the impact of educational programmes on student outcomes. 相似文献

6.

Optimized two-level placement of test points for multi-objective air monitoring of the Three Gorges Reservoir area

XIAO Dong-hai TAN Chun-lu WANG Jun-qiang ZHONG Yuan-chang 《重庆大学学报(英文版)》2007,6(4):231-237

To fit the complicated geographic conditions of the Three Gorges Reservoir area, a two-level multi-objective monitoring system was developed to monitor the atmosphere of the area. Statistical analysis of environmental monitoring data and the macro control principle were employed to configure the upper layer. The lower layer was designed by the application of the thumb rule to a local terrain and specific point sources of pollution therein. The optimized two-level system comprises an upper layer of 16 monitoring stations distributed at places of diverse geographical, ecological, economical and social characteristics, and a lower layer of 16 sub-machines at each monitoring station of the upper layer. This optimal outcome fits the complicated conditions of the Three Gorges Reservoir area, substantially cuts down the installation cost and the operation cost, and provides accurate monitoring data of atmosphere over the entire area with a high resolution. 相似文献

7.

Relationships between the use of test results and US students’ academic performance

Hongli Li C. Kevin Fortner Xiaoxuan Lei 《School Effectiveness & School Improvement》2013,24(2):258-278

In this study, we examined relationships between the use of test results and US students’ math, reading, and science performance in the Programme for International Student Assessment (PISA) 2009. Based on a literature review, we hypothesized that the 16 items in the PISA school questionnaire, which are related to the use of test results, can be categorized according to 4 factors. We validated this hypothesized factor structure using a confirmatory factor analysis and then obtained composite scores for each factor. As revealed by a multilevel analysis, when student and school demographic variables were controlled for, using test results to hold schools accountable to authority and the public was significantly positively related to students’ performance across all 3 subjects. No statistically significant relationship, however, was detected between students’ performance and the following uses of test scores: informing parents of their children’s performance, providing information for instructional purposes, and evaluating teachers and principals. 相似文献

8.

Investigating the use of the National Assessment Program – Literacy and Numeracy (NAPLAN) test results

《Studies in Educational Evaluation》2023

The National Assessment Program – Literacy and Numeracy (NAPLAN) is a nationwide testing program for literacy and numeracy in Australia. Several studies explored and used NAPLAN numeracy test results as a source of valuable data and a potential means to improve education. This paper presents a systematic literature review to investigate the use of NAPLAN numeracy test results in those peer-reviewed articles in relation to the purposes of NAPLAN results mentioned by the Australian Curriculum, Assessment and Reporting Authority (ACARA). Findings showed a variety of uses of the NAPLAN numeracy test results in these studies. Most of the studies used the test results to map student progress and identify strengths and weaknesses in teaching. A significant number of studies used the NAPLAN numeracy test results that differ from the purposes mentioned by ACARA. The review concluded that there is currently insufficient use which reflects the purpose of NAPLAN test results. 相似文献

9.

Research on the overload protection reliability of moulded case circuit-breakers and its test device 总被引：1，自引：0，他引：1

LI Kui LU Jian-guo WU Yi QIN Zhi-jun YAO Dong-mei 《浙江大学学报(A卷英文版)》2007,8(3):453-458

This paper analyzed the reliability and put forward the reliability index of overload protection for moulded case circuit breaker. The success rate was adopted as its reliability index of overload protection. Based on the reliability index and the reli- ability level, the reliability examination plan was analyzed and a test device for the overload protection of moulded case cir- cuit-breaker was developed. In the reliability test of overload protection, two power sources were used, which reduced the time of conversion and regulation between two different test currents in the overload protection test, which made the characteristic test more accurate. The test device was designed on the base of a Windows system, which made its operation simple and friendly. 相似文献

10.

Theory of mind in dogs: is the perspective-taking task a good test?

Roberts WA Macpherson K 《Learning & behavior》2011,39(4):303-305

Udell, Dorey, and Wynne (in press) have reported an experiment in which wolves, shelter dogs, and pet dogs all showed a significant preference for begging from a person who faced them (seer) over a person whose back was turned to them (blind experimenter). On tests with the blind person’s eyes covered with a bucket, a book, or a camera, pet dogs showed more preference for the seer than did wolves and shelter dogs. We agree with the authors’ position that most of these findings are best explained by preexperimental learning experienced by the subjects. We argue, however, that the perspective-taking task is not a good test of the domestication theory or of the theory of mind in dogs. The problem we see is that use of the perspective-taking task, combined with preexperimental learning in all the subjects, strongly biases the outcome in favor of a behavioral learning interpretation. Tasks less influenced by preexperimental training would provide less confounded tests of domestication and theory of mind. 相似文献

11.

A preliminary investigation of role‐taking in descriptions of self and others in the twenty statements test

Robert T. Craig 《Communication quarterly》2013,61(3):187-195

相似文献

12.

Using the Illinois test of psycholinguistic ability with bilingual and monolingual language‐impaired children

Ernst Ottem Unni Jakobsen 《Scandinavian Journal of Educational Research》2013,57(2):159-167

The current popular case against the use of standardised ability tests in bilingual assessment is not as unequivocal as may be commonly assumed. Evidence currently available indicates that such tests generally measure the same constructs, with equal accuracy, regardless of language backgrounds. The purpose of this paper is to contribute to this debate by presenting a detailed comparison of the Illinois Test of Psycholinguistic Ability (ITPA) profiles of monolingual language‐impaired children who have received the diagnosis ‘specifically language‐impaired’ (SLI) (N = 95) and ITPA profiles of bilingual children who are referred for assessment of their language problems (N = 43). It is concluded that most bilingual children suffer from a ‘knowledge‐based’ problem, while most monolingual children suffer from a more fundamental ‘processing‐based’ problem. By presenting this analysis, we hope to contribute to the general debate about whether this commonly used test can be used in the assessment of bilingual children. 相似文献

13.

Does teaching to the test improve student learning?

《International Journal of Educational Development》2021

A central concern surrounding test-based accountability is that teachers may narrow teaching practices to improve test performance on a curriculum-based specific knowledge test rather than student learning more broadly. Two of the most common teaching practices that “teach to the test” are providing test-specific classwork and increasing the frequency with which students take practice tests. Whether such teaching practices improve student learning—both in terms of learning the content associated with a specific knowledge test as well as more general learning—is a largely unanswered question. To approach this question, this paper uses a student fixed effects approach to analyze the impact of these kinds of narrow teaching practices on student performance on a specific test as well as a general knowledge test. We find that test-specific classwork and practice tests with specific test items tend to have little or negative impacts on curriculum specific or general knowledge test performance, except for male students, and that subject practice tests (without emphasizing test-specific items) have positive effects on student outcomes on both kinds of tests, but larger on the curriculum-specific than on the general test, and much larger on the curriculum-specific test for male students. We discuss the logic for these results and what they tell us about the effectiveness of test-focused teaching practices more generally. 相似文献

14.

Teaching referents and the warrants used to test the viability of students' mental models: Is there a link?

Stephen M. Ritchie Kenneth Tobin Karl S. Hook 《科学教学研究杂志》1997,34(3):223-238

From a constructivist perspective, learners construct viable knowledge rather than acquire representations of truth. The warrants of authority, coherence, and empirical evidence are identified as means by which the viability of knowledge claims can be established by learners. In this interpretive study, we examined which warrants were invoked by Grade 8 science classroom participants in their daily interactions, and whether these were linked to particular teaching referents. We found that when the teacher embraced a content exposure referent at the expense of constructivism, the warrant of authority dominated interactions. The students' alternative conceptions and mental models were not explored appropriately. Instead, students most frequently were expected to accept the voice of authority uncritically. By such actions students were restrained from meaningful learning. © 1997 John Wiley & Sons, Inc. J Res Sci Teach 34: 223–238, 1997. 相似文献

15.

A new test for China? Stages in the development of an assessment for professional purposes

Yan Jin Liz Hamp-Lyons 《Assessment in Education: Principles, Policy & Practice》2015,22(4):397-426

It is increasingly recognised that attention should be paid to investigating the needs of a new test, especially in contexts where specific purpose language needs might be identified. This article describes the stages involved in establishing the need for a new assessment of English for professional purposes in China. We first investigated stakeholders’ perceptions of the target language use activities and the necessity of the proposed assessment. We then analysed five existing tests and six language frameworks to evaluate their suitability for the need of the proposed assessment. The resulting proposal is for an advanced-level English assessment capable of providing a diagnostic evaluation of the proficiency of potential employees in areas of relevance to multinationals operating in China. The study has demonstrated the value of following a principled procedure to investigate the necessity for and the needs of a new test at the very beginning of the test development. 相似文献

16.

Teacher education graduates’ entrance into the teaching profession: Development and test of a model 总被引：1，自引：0，他引：1

Isabel Rots Antonia Aelterman 《European Journal of Psychology of Education - EJPE》2009,24(4):453-471

This study aims to advance insight into the relationship between teacher education and graduates’ intended and actual entrance into the teaching profession. Moreover, it indicates how this relationship varies between teacher training for primary education (i.e., programs for class teachers-to-be) and teacher training for secondary education (i.e., programs for subject teachers-to-be). A hypothetical model of graduates’ entrance into the teaching profession comprising empirically grounded variables was developed. Besides teacher education related variables, other factors (e.g., motivation and labour market) were included. Data for this study were collected based on a sample of 301 teacher education graduates. Logistic regression analysis shows that intention to enter teaching is a valid predictor of actual entrance. Path analysis was applied to test the hypothetical pattern of (inter)relationships between the variables. For graduates from teacher training for pre-school and primary education, the path model accounts for 33% of the variance in intention to enter teaching. For teacher training for secondary education the explained amount of variance is 44%. Results are largely consistent across the two types of teacher training showing a small, although statistically significant, contribution of teacher education above other antecedents of graduates’ intention to enter teaching. 相似文献

17.

Measuring scientific reasoning – a review of test instruments

Ansgar Opitz Moritz Heene Frank Fischer 《Educational Research and Evaluation》2017,23(3-4):78-101

Education systems increasingly emphasize the importance of scientific reasoning skills such as generating hypotheses and evaluating evidence. Despite this importance, we do not know which tests of scientific reasoning exist, which skills they emphasize, how they conceptualize scientific reasoning, and how well they are evaluated. Therefore, this article reviews 38 scientific reasoning tests. They used to primarily consist of multiple-choice questions, but since then have become more diverse, even including tests that automatically analyse virtual experiments. Furthermore, this review revealed that the tests focus on the skills of generating hypotheses, generating evidence, evaluating evidence, and drawing conclusions. Additionally, conceptualizations of scientific reasoning have moved towards a domain-specific set of different but coordinated skills over the years. Finally, concluding from gaps in test evaluation, a future focus should be on testing theoretical assumptions, comparing different scientific reasoning tests, and how relevant test results are in predicting criterion variables like academic performance. 相似文献

18.

Cyclic behavior test of a new double-arch steel gate

LUO Yao-zhi ZHU Shi-zhe CHEN Xi 《浙江大学学报(A卷英文版)》2007,8(11):1731-1739

A new double-arch structure for the gate used as tidal barrage and sluice was adopted in Caoe River Dam in China. It was a spatial structure made up of the right arch, the invert arch, the chord, etc., and was designed to bear bilateral loads. To research the cyclic behavior of the new double-arch structure, a scale-model cyclic test was conducted. First, the test setup and test method were presented in detail, and according to the test results, the cyclic behavior and failure characteristics of this structure were discussed. Then by analyzing the test cyclic envelope curve, it was found the curve was divided into three stages: the elastic stage, the local plastic stage and the failure stage at the local yield point and structural yield point. The gate model has local yield strength and structural yield strength, with both their values being bigger than that of the designing load. Therefore, the gate is safe enough for the projects. At last, dynamic property of the gate was analyzed considering additional mass of the water. It was found that the tidal bore shock would not cause resonance vibration of the gate. 相似文献

19.

What makes a matrix so effective? An empirical test of the relative benefits of signaling, extraction, and localization

Douglas F. Kauffman Kenneth A. Kiewra 《Instructional Science》2010,38(6):679-705

What type of display helps students learn the most and why? This study investigated how displays differing in terms of signaling, extraction, and localization impact learning. In Experiment 1, 72 students were assigned randomly to one cell of a 4 × 2 design. Students studied a standard text, a text with key ideas extracted, an outline that localized ideas topically, and a matrix that localized ideas topically and categorically. One version of the displays signaled the displays’ organization and one version did not. The matrix display proved best for facilitating fact and relationship learning because of its ability to localize related information within topics and categories. Simply signaling or extracting text ideas was not helpful. Experiment 2 demonstrated that not all matrices are created equal because they can vary in terms of how information is localized. About 54 students were assigned randomly to one cell of a 2 × 2 design that varied localization of matrix topics and categories. Students studied matrices high or low in topical organization and high or low in categorical organization. Results confirmed that a high, natural ordering of matrix topics is necessary to highlight relationships and bolster relationship and fact learning. 相似文献

20.

Taking and teaching the test are not the same: a case study of first-year teachers’ experiences in high-stakes contexts

Christopher P. Brown 《Teachers and Teaching》2013,19(8):1026-1044

Background: Policymakers’ use of high-stakes exams to improve students’ academic achievement affects teachers and their tenure in the field at all levels of schooling. Novice teachers now being inducted into the field have been educated almost exclusively in these high-stakes learning environments. Yet, how their familiarity with these contexts combined with their experiences in their own classrooms affect novices’ induction into the field of teaching has not been fully examined. Aim: This article presents findings from an investigation into the experiences of two first-year teachers who were educated and trained to be teachers in the same high-stakes education system in which they taught. It examines how these first-year teachers viewed policymakers’ reforms affecting their teaching and tenure in the field. Methods: This qualitative case study centers on the experiences of two first-year teachers working in the same high-stakes standards-based accountability teaching context in which they were educated – the case. This study provides insight into the issue of how novices’ familiarity with high-stakes reform combined with their experiences in their own classrooms impacts their conceptions of their teaching and their tenure in the field. Outcomes: The findings from this case study reveal how policymakers’ high-stakes reforms impacted the development of these novice teachers in significant ways. Not only did they have to learn how to teach as they taught, but they also had to ensure they were teaching all of their students to pass the high-stakes exams. Their varied experiences also demonstrate how these high-stakes exams can ‘test’ beginning teachers out of the classroom. Conclusions: These findings demonstrate that first-year teachers’ familiarity with policymakers’ high-stakes reforms is not enough to prepare them for the expectation that they immediately improve students’ academic achievement on high-stakes exams. Such findings not only challenge what it means to be a educator in these contexts, but they also shed light on how larger political and economic forces impact the teaching and tenure of novices. To support new teachers, teacher educators and mentors should rethink the education and induction processes while helping novices understand as well as prepare for the role context plays in their teaching and development as professionals. 相似文献