首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 390 毫秒
1.
Should computer-based study tasks use multiple-choice or constructed-response question format? It was hypothesized that a constructucted-response study task (CR) with feedback would be superior to multiple-choice study tasks that allowed either single or multiple tries (STF and MTF). Two additional recognition study task treatments were included that required an overt constructed response after feedback (STF+OR and MTF+OR) in order to control for possible confounding caused by response form mismatch between the recognition study task and recall posttest. Graduate students (N=133) were randomly assigned to one of the five computer-delivered treatments. Relative to STF, posttest effect sizes were: STF 相似文献   

2.
ABSTRACT

This study investigated how studying a refutational map, a type of argument map, affected conceptual change. Refutational maps visually display both correct and alternative conceptions. Participants (N?=?120) were randomly assigned to (1) a refutational map condition, (2) a refutational text condition, and (3) a non-refutational text condition. The post-test results showed that studying the refutational map led to better performance on free recall and learning transfer measures. Specifically, participants who studied the refutational map performed significantly better than others on a free recall test, and they significantly outperformed the non-refutational text group on a short-answer transfer test. The multiple-choice test, another transfer measure, failed to detect any differences among the three groups. The research also found that individual differences in need for cognition and logical thinking ability interacted with the type of study materials. Participants scoring lower on logical thinking ability gained more from studying the refutational map.  相似文献   

3.
Equivalent forms of a ten-item completion test were constructed. The same test items then were rewritten in matching format and in multiple-choice format, resulting in two forms (A and B) of each of three types of test. All tests were administered to 73 examinees, and parallel-forms reliability coefficients (correlation between scores on A and B) were calculated. These empirically obtained values were compared to the values of the reliability coefficient predicted from theoretically derived equations which indicate the influence of chance success due to guessing on test reliability. In accordance with theory it was found that the completion test was more reliable than the matching test and that the matching test was more reliable than the multiple-choice test. The empirically obtained reliability coefficients were very close to those predicted from the mathematically derived formulas.  相似文献   

4.
In contrast to multiple-choice test questions, figural response items call for constructed responses and rely upon figural material, such as illustrations and graphs, as the response medium. Figural response questions in various science domains were created and administered to a sample of 4th-, 8th-, and 12th-grade students. Item and test statistics from parallel sets of figural response and multiple-choice questions were compared. Figural response items were generally more difficult, especially for questions that were difficult (p < .5) in their constructed-response forms. Figural response questions were also slightly more discriminating and reliable than their multiple-choice counterparts, but they had higher omit rates. This article addresses the relevance of guessing to figural response items and the diagnostic value of the item type. Plans for future research on figural response items are discussed.  相似文献   

5.
Abstract

A test of achievement motivation was administered to 260 sixth graders. One month later, students participated in a science reading lesson followed by a multiple-choice test based upon that lesson. Feedback regarding performance was provided according to a standardized procedure either immediately after the test, or with one, two or three day delays. Then a retention test was administered to each group three days alter feedback. Results demonstrated that students who received feedback with a delay of one day manifested greater retention than students who received immediate feedback. There were no significant differences among groups who were exposed to delays of one, two or three days. A positive relation between achievement motivation and retention was demonstrated. There was no interaction between achievement motivation and feedback schedules.  相似文献   

6.
Standardised and other multiple-choice examinations often require the use of an answer sheet with fill-in bubbles (i.e. ‘bubble’ or Scantron sheet). Students with disabilities causing impairments in attention, learning and/or visual-motor skill may have difficulties with multiple-choice examinations that employ such a response style. Such students may request and receive testing accommodations that intend to mitigate these impairments, such as circling responses in a test booklet, which contains both the questions and corresponding multiple-choice answers. The current study evaluated this test accommodation as compared to using a bubble sheet or Scantron on a multiple-choice vocabulary test. College students with (n = 25) and without (n = 76) disabilities completed a vocabulary test under both booklet (accommodated) and bubble sheet (standard) conditions. Results demonstrated that answering in a test booklet, a much preferred response mode, allowed students to attempt significantly more items than using a bubble sheet, improving their overall test scores. Booklet responding tends to improve overall performance, even for students without disabilities, calling into question the specificity and validity of this accommodation.  相似文献   

7.
Abstract

The purpose of this investigation was to develop and validate a simulation device to measure a teacher's ability to identify verbal and nonverbal emotions expressed by students (teacher affective sensitivity). The scale consists of videotaped excerpts of teacher-learner interactions and accompanying multiple-choice instrumentation. Respondents select the answer from each multiple-choice item that they believe most accurately describes the affective state of the pupil viewed on the monitor. Previously produced media focusing on classroom interactions were used to obtain the examples of learner affective expressions. Expert judges constructed two multiple-choice items for each simulation episode. Pilot test administrations allowed for numerous scale revisions. Finally, assessments of scale reliability, and scale construct, predictive, concurrent, and content validity were made.  相似文献   

8.
Study time and test performance change as a function of subjects expecting either open questions or a multiple-choice test on the contents of a history text. After studying a first history text, the subjects immediately received a test consisting of either open questions or multiple-choice items. They were led to expect the same type of test on a second text. Study time on Text 2 was self-paced, and the type of test was either the expected or unexpected one (either open questions or a multiple-choice test). The main hypothesis was confirmed that subjects consider open questions a more demanding test than a multiple-choice test. Accordingly, subjects expecting open questions on Text 2 used more study time and performed better on both types of test than did subjects expecting a multiple-choice test. Internal analyses revealed that the differences on study time and test achievement between the two expectation conditions occurred only when the subjects were thoroughly acquainted with the processing requirements of the learning material and its expected test.  相似文献   

9.
Abstract

College students (N=51) read a 600-word passage. Advance organizers (Ausubel) were either interspersed in the text contiguous with related paragraphs (part method) or were presented en masse before the entire passage (whole method). Additionally, a reading-only control group was employed. The whole and part method of presenting advance organizers yielded equivalent recall of passage information and neither experimental group exceeded the control group in this regard. However, both advance organizer groups produced significantly (p>.01) more recall than the control group of the organizers themselves. These results are similar to those commonly found with another type of reading aid, adjunct questions.  相似文献   

10.
11.
ObjectiveThe present study extends field research on interviews with young children suspected of having been abused by examining multiple assessment interviews designed to be inquisitory and exploratory, rather than formal evidential or forensic interviews.MethodsSixty-six interviews with 24 children between the ages of 3 and 6 years who were undergoing an assessment for suspected child abuse were examined. Each child was interviewed 2, 3, or 4 times. The interviewer's questions were categorized in terms of openness (open, closed or choice), in terms of the degree of interviewer input (free recall, direct, leading, suggestive), and for topic (whether the question was abuse-specific or nonabuse-related). Children's on-task responses were coded for amount of information (number of clauses) reported in relation to each question type and topic, and off-task responses were categorized as either ignoring the question or a diverted response.ResultsChildren provided a response to most questions, independent of question type or topic and typically responded with one or two simple clauses. Some children disclosed abuse in response to open-ended questions; generally, however, failure to respond to a question was more likely for abuse-specific than for nonabuse-related questions.ConclusionThe findings are discussed in terms of the growing literature on interviewing children about suspected abuse, particularly in interviews conducted over multiple sessions.Practice implicationsAssessment of suspected child abuse may involve more than a single investigative interview. Research examining children's responses to questioning over multiple interviews (or single interviews conducted over multiple sessions) is necessary for the development of best practise guidelines for the assessment of abuse.  相似文献   

12.
《教育实用测度》2013,26(1):55-77
The effects of test consequences, response formats (multiple choice or constructed response), gender, and ethnicity were studied for the math and science sections of a high school diploma endorsement test. There was an interaction between response format and test consequences: Under both response formats, students performed better under high stakes (diploma endorsement) than under low stakes (pilot test), but the difference was larger for the constructed response items. Gender and ethnicity did not interact with test stakes; the means of all groups increased when the test had high stakes. Gender interacted with format; boys scored higher than girls on multiple-choice items, girls scored higher than boys on constructed-response items.  相似文献   

13.
Teachers often recommend their students to generate test questions and answers as a means of preparing for an exam. There is a paucity of research on the effects of this instructional strategy. Two recent studies showed positive effects of generating test questions relative to restudy, but these studies did not control for time on task. Moreover, the scarce research available has been limited to the effects of generating open-ended questions. Therefore, the aim of this study was to investigate whether generating multiple-choice test questions would foster retention (as measured by a multiple-choice test) relative to restudy when time would be kept constant across conditions. Using a 2 × 2 design, university students (N = 143) studied a text with the intention of either generating test items or performing well on a test, and then either generated multiple-choice items or restudied the text. Retention was measured by means of a multiple-choice test, both immediately after learning and after a one-week delay. Results showed no effects of study intention. Generating multiple-choice items resulted in lower test performance than restudying the text for the same amount of time.  相似文献   

14.
Abstract

This research involved the mastery of all steps in the multiplication of fractions by a superior group (IQ's, 125-165) and a strong normal group (IQ's, 100-109). Both groups were taught by the same person using the same instructional materials and methods. Initial arithmetic tests included a standardized test, a diagnostic test, and tests for each step. Step tests were administered thereafter to determine the elapsed times of mastery by each group and also recall scores, with the elapsed times kept equal. Following the last recall test, each group was given a second diagnostic test. Results: the superior group mastered each step faster than the normal group and retained a slightly higher degree of mastery.  相似文献   

15.
A new computerised testing system was used at home to promote learning and also to save classroom instruction time. The testing system combined the features of short-answer and multiple-choice formats. The questions of the multiple-choice problems were presented without the options so that students had to generate answers for themselves; they could click for the options when they were ready, and could choose one of the options within a brief, specified time period. One hundred thirty-eight Korean sixth-grade students (12-year olds) were divided into two groups: the experimental group took the intervening test on social studies using the new computerised testing method, and the control group used a computerised version of the traditional multiple-choice method. A few days after the intervening computerised test, a recall posttest was given in paper-and-pencil format. The mean posttest score was greater for the experimental group than for the control group. Implications of the result are discussed.  相似文献   

16.
Abstract

The major aim of the present research was to compare students’ attitudes and dispositions toward teacher-made essay versus multiple-choice type exams. The primary study was conducted on a sample of 174 junior high school students, who were administered a test attitude inventory specifically designed to assess students’ attitudes towards essay versus multiple-choice type formats on a variety of critical dimensions. The study was partially replicated on a sample of 101 seventh- and eighth-grade students who were administered a modified version of the test attitude inventory that was used in the first study. Overall, the data from both studies were remarkably consistent, pointing to more favorable student attitudes towards multiple-choice compared to essay type formats on most dimensions assessed. The practical significance of the results for classroom test construction are discussed and some suggestions are made about potential future applications of test attitude inventories in the classroom setting.  相似文献   

17.
Abstract

The arrangement of response options in multiple-choice (MC) items, especially the location of the most attractive distractor, is considered critical in constructing high-quality MC items. In the current study, a sample of 496 undergraduate students taking an educational assessment course was given three test forms consisting of the same items but the positions of the most attractive distractor varied across the forms. Using a multiple-indicators–multiple-causes (MIMIC) approach, the effects of the most attractive distractor's positions on item difficulty were investigated. The results indicated that the relative placement of the most attractive distractor and the distance between the most attractive distractor and the keyed option affected students’ response behaviors. Moreover, low-achieving students were more susceptible to response-position changes than high-achieving students.  相似文献   

18.
Abstract

This study investigated the effects of retelling (free recall) upon the comprehension and recall of text information for 93 fourth-grade students. Subjects were assigned randomly to one of two generative learning strategy treatment conditions: retelling or illustrating. Subjects participated in four training sessions and one test session. For each of the four training sessions subjects silently read a passage and then, according to treatment condition, either retold the important parts of the passage or illustrated the important parts of the passage. For the test passage all subjects silently read the passage, and then rendered a free recall. Two days later all subjects rendered a delayed free recall and answered 10 literal and 10 inferential questions about the test passage. Statistically significant differences were found on all measures of reading comprehension and recall (immediate free recall, two-day delayed free recall, and responses to literal and inferential questions) in favor of the subjects who received practice in retelling. The results suggest that retelling is a highly potent generative learning strategy and that retelling has direct, beneficial consequences for children's processing of subsequent text.  相似文献   

19.
Item-response changing as a function of test anxiety was investigated. Seventy graduate students completed the Test Anxiety Scale and 73 multiple-choice items during the quarter. The data supported the hypothesis that high test-anxious students make more item-response changes than low test-anxious students. Results also suggested that both high- and low-anxious students profit to a similar extent proportionally from answer changing. It was further found that more responses were changed on difficult than on easy items for both high- and low-anxious students. Test anxiety is suggested as a factor forming test-taking style.  相似文献   

20.
Abstract

An attempt was made to extend and clarify prior research which had demonstrated consistently that changed answers to objective test items tend to be correct. Results extended the basic effect of profiting from changed answers to Air Force personnel responding to multiple-choice questions regarding technical skills; the profit from changes was very similar to that observed in a university group responding to relatively "academic" items. Secondly, most individuals in both groups profited from changes. Third, individuals with the highest test scores tended to profit more from changes than those with the lowest test scores. Fourth, neither Airman Qualifying Exam scores (for the military personnel) nor Scholastic Aptitude Test scores (for the university students) were related to profit. Finally, a systematic case against the popular belief that one should not change answers on objective tests was made, based on an integration of the research to date.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号