首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
In this study, a multiple-choice test entitled the Science Process Assessment was developed to measure the science process skills of students in grade four. Based on the Recommended Science Competency Continuum for Grades K to 6 for Pennsylvania Schools, this instrument measured the skills of (1) observing, (2) classifying, (3) inferring, (4) predicting, (5) measuring, (6) communicating, (7) using space/time relations, (8) defining operationally, (9) formulating hypotheses, (10) experimenting, (11) recognizing variables, (12) interpreting data, and (13) formulating models. To prepare the instrument, classroom teachers and science educators were invited to participate in two science education workshops designed to develop an item bank of test questions applicable to measuring process skill learning. Participants formed “writing teams” and generated 65 test items representing the 13 process skills. After a comprehensive group critique of each item, 61 items were identified for inclusion into the Science Process Assessment item bank. To establish content validity, the item bank was submitted to a select panel of science educators for the purpose of judging item acceptability. This analysis yielded 55 acceptable test items and produced the Science Process Assessment, Pilot 1. Pilot 1 was administered to 184 fourth-grade students. Students were given a copy of the test booklet; teachers read each test aloud to the students. Upon completion of this first administration, data from the item analysis yielded a reliability coefficient of 0.73. Subsequently, 40 test items were identified for the Science Process Assessment, Pilot 2. Using the test-retest method, the Science Process Assessment, Pilot 2 (Test 1 and Test 2) was administered to 113 fourth-grade students. Reliability coefficients of 0.80 and 0.82, respectively, were ascertained. The correlation between Test 1 and Test 2 was 0.77. The results of this study indicate that (1) the Science Process Assessment, Pilot 2, is a valid and reliable instrument applicable to measuring the science process skills of students in grade four, (2) using educational workshops as a means of developing item banks of test questions is viable and productive in the test development process, and (3) involving classroom teachers and science educators in the test development process is educationally efficient and effective.  相似文献   

2.
Students’ perceptions of their classroom climate have been found to relate significantly to students’ learning outcomes. The purpose of the present study was to construct an instrument for assessing elementary-school students’ perceptions of classroom climate, based on a previous instrument that was being used in Chile by a public national school mental health program as a tool for aiding teachers in improving classroom management, but which showed poor psychometric properties. We used a six-staged mixed-methods approach to construct relevant items and dimensions based on this measure and by adapting previously-existing scales. Item development included participatory construction of items involving program officials, focus groups with students, and a pilot study. The final version was administered to a sample of 6813 elementary-school students. Results showed adequate reliability and construct validity, convergent validity with school climate, and divergent validity with peer victimisation. When consequential validity was explored through semi-structured interviews with program officials and school administrators, we found that the instrument was being used as a tool for helping teachers to improve their school climate and management skills. We discuss the importance of constructing instruments using a mixed-methods approach.  相似文献   

3.
4.
A test for measuring science attitudes, named Test of Science Related Attitudes (TOSRA), was initially developed in Australia by Fraser (1977, 1978). This study investigated the crosscultural validity of this instrument when used with American high school students. Three hundred and thirty-six students (12th and 11th graders) in three high schools in suburban Chicago took the test. The results of the study, confirming previous validation of the test, revealed that the seven subscales of TOSRA were, in general, highly reliable. The discriminant validity of each of these scales, however, was found to be generally low. The item/scale correlation for all but four items of the test met Shrigley's (Journal of Research in Science Teaching, 20 (1), 87–89, 1983) criterion of being more than 0.30. The results of the principal components with varimax rotation did not support the distinctiveness of the subscale structure of the test.  相似文献   

5.

In this research, we developed and validate an instrument for assessing high-school students’ perceptions of socio-scientific issues (SSI)-based learning in biology. The development of the instrument was carried out using the following four steps: item formulation, content validation, construct validation and reliability calculation. According to an extensive literature review, 28 initial items were formulated for four SSI-based instruction aspects: (1) contextualisation of SSI, (2) student involvement, (3) student attitudes towards SSI-learning and (4) SSI-based learning objectives. Based on the pilot study conducted for construct validation, 24 items were successfully constructed for the four scales and subsequently administered in biology classrooms to 151 students. Factor analysis showed that each of the 24 items had a factor loading at least 0.40 on its own scale and no other scale. Moreover, the four scales accounted for 58.32% of the total variance and the alpha reliability coefficients for the scales ranged from 0.75 to 0.85. According to these findings, the SSI-based Learning Environment Inventory in Biology is valid, reliable and useful.

  相似文献   

6.
Early identification of children experiencing difficulties in learning is essential for timely and effective intervention. The aim of this study was to develop a screening instrument for identifying students with learning difficulties at the end of the first term of Primary 1 in Hong Kong. This paper describes the development and validation of a 27‐item checklist on 594 students. Validity testing performed included content and expert, contrasted group, concurrent and predictive validity; all were found to be satisfactory. In addition, test–re‐test reliability was assessed to be good. The checklist can be administered in 10 minutes for each student based on everyday observations of the student; no specific testing of the students would be required. A longer version with 97 items was also made available for teachers to conduct a more comprehensive evaluation of a child’s performance.  相似文献   

7.
This article describes the development, validation and application of a Rasch-based instrument, the Elementary School Science Classroom Environment Scale (ESSCES), for measuring students’ perceptions of constructivist practices within the elementary science classroom. The instrument, designed to complement the Reformed Teaching Observation Protocol (RTOP), is conceptualised using the RTOP’s three construct domains: Lesson Design and Implementation; Content; and Classroom Culture. Data from 895 elementary students was used to develop the Rasch scale, which was assessed for item fit, invariance and dimensionality. Overall, the data conformed to the assumptions of the Rasch model. In addition, the structural relationships among the retained items of the Rasch model supported and validated the instrument for measuring the reformed science classroom environment theoretical construct. The application of the ESSCES in a research study involving fourth grade students provides evidence that educators and researchers have a reliable instrument for understanding the elementary science classroom environment through the lens of the students.  相似文献   

8.
The paper describes the development and validation of a group test of integrated process skills. The test assesses student performance on a set of twelve objectives related to the generic objective: planning and conducting an investigation. Evidence of content validity, construct validity, and reliability are presented in the paper. A range of generalizability coefficients from 0.77 to 0.98 is reported for specific uses of the 24-item test. Since the items measure performance on objectives that can be readily translated into classroom activity, the test has direct applicability to classroom based research, and evaluation of instruction. In addition to sound psychometric properties, the Test of Integrated Science Processes is distincitve because it includes a set of interrelated, cumulative objectives which reflect autonomous problem solving.  相似文献   

9.
An instrument was developed to assess the perceived learner–teacher transactional distance in the context of the Hellenic Open University and to provide psychometric evidence of reliability and validity of the interpretation of the scores derived from the scale. First, an elicitation study gathered information on students’ beliefs and feelings about their teachers’ behavior. Two dimensions of learner–teacher transactional distance, namely, co-understanding and awareness, were revealed. Then, an item pool was developed and the preliminary questionnaires (one for each dimension) were field tested on 60 postgraduate students who commented on their clarity. Reliability and validity were then tested using exploratory and confirmatory factor analysis with two samples of 220 and 201 postgraduate students. The confirmatory factor analysis results indicated good model fit for the dimensions of co-understanding and awareness. Univariate F tests showed no significant difference between males and females for (a) co-understanding of cognitive skills and aptitudes, (b) co-understanding of emotional state, (c) awareness of cognitive skills and aptitudes, and (d) awareness of emotional state. A final questionnaire included ten items for the co-understanding dimension and fifteen items for the awareness dimension.  相似文献   

10.
The objective of this project was to develop a multiple choice test of graphing skills appropriate for science students from grades seven through twelve. Skills associated with the construction and interpretation of line graphs were delineated, and nine objectives encompassing these skills were developed. Twenty-six items were then constructed to measure these objectives. To establish content validity, items and objectives were submitted to a panel of reviewers. The experts agreed over 94% of the time on assignment of items to objectives and 98% on the scoring of items. TOGS was first administered to 119 7th, 9th, and 11th graders. The reliability (KR-20) was 0.81. Poorly functioning items were rewritten based on the item difficulty and discrimination data. The revised version of the test was given to 377 7th through 12th grade students. Total scores ranged from 2 to 26 correct (X = 13.3, S.D. = 5.3). The reliability (KR-20) was 0.83 for all subjects and ranged from 0.71 for eighth graders to 0.88 for ninth graders. Point biserial correlations showed 24 of the 26 items above 0.30 with an average value of 0.43. It was concluded from this and other data that TOGS was a valid and reliable instrument for measuring graphing abilities.  相似文献   

11.
This report sets out the procedures followed in developing a set of science items to test a variety of intellectual skills deemed important in secondary school science and then analyzing them in order to examine their construct validity in relation to a technique of evaluation which analyzes the way in which individuals structure their responses to previously learned material (Biggs & Collis, 1982). The items covered the four sciences commonly taught in Australian schools, Geology, Biology, Physics, and Chemistry. Each item followed the superitem format devised by Cureton (1965) and consisted of a stem followed by four questions. Each group of four questions was devised so that they formed a hierarchy of difficulty levels. Nineteen of the items finally accepted as meeting the initial criteria were arranged for group testing to enable a validation trial to be carried out. The analysis showed that the items had construct validity in terms of the theory and were viable for testing certain science skills at the High School level. Implications of the study point to a need for further investigations in both the curriculum and teaching areas of school science.  相似文献   

12.
The aim of this study is to validate an instrument measuring students’ academic behavioral skills and engagement—skills identified as vital for student achievement. We inspect the reliability and validity of the survey with respect to item fit, factorial structure, relations with academic performance, and the fairness of the items across student groups. The fairness analyses are critical to making valid comparisons between groups and across countries. Data comprising 8520 grade 10 students from four countries were analysed using item response theory. We found that both scales were multidimensional, acted fairly across students’ gender, country, immigrant-, and socio-economic background (after removing four items), and were positively and significantly correlated with self-reported and performance-based academic performance.  相似文献   

13.
This research centers on the psychometric examination of the structure of an instrument, known as the 5E Lesson Plan (5E ILPv2) rubric for inquiry-based teaching. The instrument is intended to measure an individual’s skill in developing written 5E lesson plans for inquiry teaching. In stage one of the instrument’s development, an exploratory factor analysis on a fifteen-item 5E ILP instrument revealed only three factor loadings instead of the expected five factors, which led to its subsequent revision. Modifications in the original instrument led to a revised 5E ILPv2 instrument comprised of twenty-one items. This instrument, like its precursor, has a scoring scale that ranges from zero to four points per item. Content validity of the 5E ILPv2 was determined through the expertise of a panel of science educators. Over the course of five semesters, three elementary science methods instructors in three different universities collected post lesson plan data from 224 pre-service teachers enrolled in their courses. Each instructor scored their students’ post 5E inquiry lesson plans using the 5E ILPv2 instrument recording a score for each item on the instrument. A factor analysis with maximum likelihood extraction and promax oblique rotation provided evidence of construct validity for five factors and explained 85.5 % of the variability in the total instrument. All items loaded with their theoretical factors exhibiting high ordinal alpha reliability estimates of .94, .99, .96, .97, and .95 for the engage, explore, explain, elaborate, and evaluate subscales respectively. The total instrument reliability estimate was 0.98 indicating strong evidence of total scale reliability.  相似文献   

14.
The study reported here concerns the development and predictive validation of an instrument to assess the achievement outcomes of DE/online learning success. A 38‐item questionnaire was developed and administered to 167 students who were about to embark on an online course. Factor analysis indicated a four‐factor solution, interpreted as “general beliefs about DE,” “confidence in prerequisite skills,” “self‐direction and initiative” and “desire for interaction.” Using multiple regression we found that two of these factors predicted achievement performance (i.e., Cumulative Course Grade). Comparisons of pretest and posttest administrations of the questionnaire revealed that some changes in opinion occurred between the beginning and the end of the course. Also, categories of demographic characteristics were compared on the four factors. The overall results suggest that this instrument has some predictive validity in terms of achievement, but that Cumulative Grade Point Average (i.e., the university's record of overall achievement) is a much better predictor.  相似文献   

15.
This article describes the development and validation of an instrument that can be used for content analysis of inquiry-based tasks. According to the theories of educational evaluation and qualities of inquiry, four essential functions that inquiry-based tasks should serve are defined: (1) assisting in the construction of understandings about scientific concepts, (2) providing students opportunities to use inquiry process skills, (3) being conducive to establishing understandings about scientific inquiry, and (4) giving students opportunities to develop higher order thinking skills. An instrument – the Inquiry-Based Tasks Analysis Inventory (ITAI) – was developed to judge whether inquiry-based tasks perform these functions well. To test the reliability and validity of the ITAI, 4 faculty members were invited to use the ITAI to collect data from 53 inquiry-based tasks in the 3 most widely adopted senior secondary biology textbooks in Mainland China. The results indicate that (1) the inter-rater reliability reached 87.7%, (2) the grading criteria have high discriminant validity, (3) the items possess high convergent validity, and (4) the Cronbach’s alpha reliability coefficient reached 0.792. The study concludes that the ITAI is valid and reliable. Because of its solid foundations in theoretical and empirical argumentation, the ITAI is trustworthy.  相似文献   

16.
Literature relating to the well‐being of older adults was reviewed to identify indicators relevant to the construct of self‐responsibility for wellness. The wellness model proposed by Travis (1981) has produced a variety of concepts which can be useful in improving the quality of life for older adults. The purpose of this study was to develop an instrument which would assess an individual's self‐responsibility for wellness. A 47‐item instrument developed for this purpose was evaluated by experts in gerontology and psychology. After revision and reevaluation it was field‐tested on a sample of 180 older adults (60 years of age and over). In order to take preliminary steps in establishing the validity and reliability of this instrument, the data were evaluated and an item analysis conducted to identify poor items. Cronbach's coefficient alpha was also computed (α = .90). A test‐retest correlation coefficient was computed, and an analysis of variance was performed to test for the relationship between self‐responsibility for wellness and demographic variables obtained during the field test.

The field testing of the instrument served as an educational needs assessment study. Evidence has been provided that there is a significant need for education programs which can provide training in the wellness skills as assessed by the instrument.  相似文献   

17.
The so-called control-of-variables strategy (CVS) incorporates the important scientific reasoning skills of designing controlled experiments and interpreting experimental outcomes. As CVS is a prominent component of science standards appropriate assessment instruments are required to measure these scientific reasoning skills and to evaluate the impact of instruction on CVS development. A detailed review of existing CVS instruments suggests that they utilize different, and only a few of the four, critical CVS sub-skills in the item development. This study presents a new CVS assessment instrument (CVS Inventory, CVSI) and investigates the validity of student measures derived from this instrument utilizing Rasch analyses. The results indicate that the CVSI produces reliable and valid student measures with regard to CVS. Furthermore, the results show that the item difficulty depends on the CVS sub-skills utilized in item development, but not on the item content. Accordingly, previous instruments that are restricted to a few CVS sub-skills tend to over- or underestimate students’ CVS skills. In addition, these results indicate that students are able to use CVS as a domain general strategy in multiple content areas. Consequences for science instruction and assessment are discussed.  相似文献   

18.
This article describes the development of an instrument – the Career and Talent Development Self‐Efficacy Scale (CTD‐SES) – for assessing students’ self‐efficacy in applying life skills essential for personal talent development, acquisition of positive work habits, and career exploration. In Study 1, data were obtained from a large sample of Chinese middle‐school students (N=15,113) in Grades 7–9 in Hong Kong. The CTD‐SES is an 18‐item questionnaire with subscales containing items that address students’ orientations toward developing their own talents, acquiring and applying positive work habits, and exploring their career possibilities. Evidence is provided for internal consistency, temporal stability, and factor structure of the CTD‐SES. Goodness of fit statistics provided support for a three‐primary‐factor‐plus‐higher‐factor model, and this solution was used in the statistical analyses. The data also indicated that students with plans for university study reported significantly higher scores than those without on all three domains of career and talent development. In Study 2 (N=308) Grade 10 high‐ability students’ scores in CTD‐SES were correlated with scores in career decision self‐efficacy and academic performance. The development and validation of CTD‐SES is the first step toward investigating career exploration, work habits, and talent development among Asian middle‐school adolescents.  相似文献   

19.
This study aimed to develop an instrument, named the inquiry-based laboratory classroom environment instrument (ILEI), for assessing senior high-school science students’ preferred and perceived laboratory environment. A total of 262 second-year students, from a senior-high school in Taiwan, were recruited for this study. Four stages were included in the development of the ILEI: (1) item formulation, (2) content validation, (3) construct validation and (4) reliability calculation. The study revealed that the ILEI was valid and reliable. Most students showed a preference for a student-directed and teacher-guided hybrid learning environment. The development of the ILEI is likely to help researchers and teachers to more effectively understand student views of the laboratory classroom environment. Our profound hope is to utilise ILEI to improve teachers’ assessment and students’ learning in inquiry-based laboratory classroom environments.  相似文献   

20.
This study established a Chinese scale for measuring high school students’ ocean literacy. This included testing its reliability, validity, and differential item functioning (DIF) with the aim of compensating for the lack of DIF tests focusing on current scales. The construct validity and reliability were verified and tested by analyzing the established scale’s items using the Rasch model, and a gender DIF test was conducted to ensure the test results’ fairness when distinct groups were compared simultaneously. The results indicated that the scale established in this study is unidimensional and possesses favorable internal consistency and construct validity. The gender DIF test results indicated that several items were difficult for either female or male students to correctly answer; however, the experts and scholars discussed these items individually and suggested retaining them. The final Chinese version of the ocean literacy scale developed here comprises 48 items that can reflect high school students’ understanding of ocean literacy—which helps students understand the topics of marine science encountered in real life.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号