De validiteit van luistertoetsen Moderne Vreemde Talen

1978 ◽  
Vol 5 ◽  
pp. 59-74
Author(s):  
H.W.M. van den Nieuwenhof

For ten years now multiple choice tests have been used in the Dutch school system to measure listening comprehension of English, French and German. The tests were developed in a research program, conducted at the Insitute of Applied Linguistics by Dr. ? Groot. Now that the tests have been in use for 10 years we are confronted with the following questions. Are the tests still reliable, as they were 10 years ago? In how far does the multiple choice technique give a true picture of the listening comprehension of students? Does the multiple choice technique help studens to cope with language material that they could not have coped with otherwise, in other words, to what extent does the language material used in tests suggest a higher level of listening comprehension than the students actually have? An experiment has been carried out at C.I.T.O. (Central Institute for Test Development). Students had to answer both multiple choice questions and open ended questions concerning the same language material. The results suggested that the language material used in tests was verydifficult for students to handle in an open ended question test form. The results also suggested that various levels of difficulty of the langua material used within a single test was reflected in the open ended test results, but not in the results of the multiple choice tests. The multiple choice technique seems to obscure the relative difficulty of the various test components. It has been found that an appropriate use of the multiple choice technique can cover only a restricted range of language material. The measuring technique must not restrict the choice of language material, and thereby influence content validity. A possible solution to the problem would be the development of a new kind of test. In this test a great variety of language material should be tested with a great variety of testing techniques: a great variety of language material in order to improve the content validity of the test, a great variety of testing techniques in order to reduce, as much as possi ble, the disadvantages of every single testing technique by itself.

1979 ◽  
Vol 1 (2) ◽  
pp. 24-33 ◽  
Author(s):  
James R. McMillan

Most educators agree that classroom evaluation practices need improvement. One way to improve testing is to use high-quality objective multiple-choice exams. Almost any understanding or ability which can be tested by another test form can also be tested by means of multiple-choice items. Based on a survey of 173 respondents, it appears that marketing teachers are disenchanted with multiple-choice questions and use them sparingly. Further, their limited use is largely in the introductory marketing course even though there are emerging pressures for universities to take a closer look at the quality of classroom evaluation at all levels.


2018 ◽  
Vol 22 (2) ◽  
pp. 219-230 ◽  
Author(s):  
Khoirul Bashooir ◽  
Supahar Supahar

Penelitian ini merupakan bagian dari penelitian pengembangan asesmen kinerja literasi sains berbasis STEM pada pembelajaran fisika. Tujuan dari penelitian ini adalah untuk mengungkapkan validitas isi, validitas empiris, dan reliabilitas instrumen asesmen kinerja literasi sains berbasis STEM yang sebelumnya telah disusun. Instrumen yang dikembangkan berupa lembar pengamatan dan tes pilihan ganda. Analisis validitas isi dari lembar pengamatan menggunakan Koefisien V oleh Aiken sedangkan validitas isi instrumen tes dianalisis dengan menggunakan CVI (Content Validity Index) oleh Lawshe. Validitas empiris reliabilitas instrumen tes diestimasi dengan IRT (Item Response Theory). Reliabilitas lembar pengamatan ditentukan dengan ICC (Item Correlation Coefficient). Hasil dari penelitian ini menunjukan bahwa (1) Lembar pengamatan berupa rubrik penskoran dan penilaian diri  terbuktivalid dengan koefisien  V Aiken 0,75 dan reliabel dengan koefisien Reliabilitas Alfa > 0,8 dan ICC yang Excellent. (2) Instrumen tes terbukti realiabel untuk digunakan pada peserta didik dengan kategori sedang sampai dengan tinggi (-0,7 sampai dengan 6,7 ) dengan CVI=1 dan INFIT MNSQ sesuai model Rasch. Berdasarkan hasil penelitian tersebut maka asesmen kinerja Literasi Sains berbasis STEMlayak digunakan.Kata kunci:validitas isi, validitas empiris, asesmen kinerja, literasi sains, STEM VALIDITY AND RELIABILITY INSTRUMENT OF SCIENTIFIC LITERACY PERFORMANCE ASSESSMENT IN PHYSICS TEACHING BASED ON STEMAbstractThis research is part of the development of scientific literacy performance assessment based on STEM in teaching physics. The aim of this research is to reveal the validity (content and also empiric) and reliability of scientific literacy performance assessment instrument based on STEM. The kind of instruments were developed are observational sheet and multiple choice test. The content validity of observational sheet was revealed by used the Aiken’s V Coefficient. The content validity of multiple choice tests was revealed by used Content Validity Index (CVI) which proposed by Lawshe. The empirical validity and reliability of multiple choice tests was revealed by used Item Response Theory Analysis. The reliability of observational sheet was revealed by used ICC (Item Correlation Coefficient) Analysis. The results of this study are the validity from the contents and empirical trials from the developed instruments. The observation sheet from scoring rubric and self-assessment has been valid with Aiken’s V value that exceeds the standard of 0,75. The reliability of the scoring rubric has Alfa Reliability> 0.8 and Excellent of ICC. Validity values from The written test is shown with CVI of 1 and the MNSQ INFIT value which match to the Rasch model. Based on the TIC and SEM graphs, the written test is stated to be reliable for use in students with moderate to high categories (-0.7 to 6.7). STEM-based Science Literacy performance assessment with caloric material is appropriate to use.Keywords: content validity, empirical validity, performance assessment, scientific literacy, STEM


2018 ◽  
Vol 22 (2) ◽  
pp. 168-181 ◽  
Author(s):  
Raden Roro Yayuk Srirahayu ◽  
Indyah Sulistyo Arty

Penelitian ini merupakan bagian dari penelitian pengembangan asesmen kinerja literasi sains berbasis STEM pada pembelajaran fisika. Tujuan dari penelitian ini adalah untuk mengungkapkan validitas isi, validitas empiris, dan reliabilitas instrumen asesmen kinerja literasi sains berbasis STEM yang sebelumnya telah disusun. Instrumen yang dikembangkan berupa lembar pengamatan dan tes pilihan ganda. Analisis validitas isi dari lembar pengamatan menggunakan Koefisien V oleh Aiken sedangkan validitas isi instrumen tes dianalisis dengan menggunakan CVI (Content Validity Index) oleh Lawshe. Validitas empiris reliabilitas instrumen tes diestimasi dengan IRT (Item Response Theory). Reliabilitas lembar pengamatan ditentukan dengan ICC (Item Correlation Coefficient). Hasil dari penelitian ini menunjukan bahwa (1) Lembar pengamatan berupa rubrik penskoran dan penilaian diri  terbuktivalid dengan koefisien  V Aiken 0,75 dan reliabel dengan koefisien Reliabilitas Alfa > 0,8 dan ICC yang Excellent. (2) Instrumen tes terbukti realiabel untuk digunakan pada peserta didik dengan kategori sedang sampai dengan tinggi (-0,7 sampai dengan 6,7 ) dengan CVI=1 dan INFIT MNSQ sesuai model Rasch. Berdasarkan hasil penelitian tersebut maka asesmen kinerja Literasi Sains berbasis STEMlayak digunakan.Kata kunci:validitas isi, validitas empiris, asesmen kinerja, literasi sains, STEM VALIDITY AND RELIABILITY INSTRUMENT OF SCIENTIFIC LITERACY PERFORMANCE ASSESSMENT IN PHYSICS TEACHING BASED ON STEMAbstractThis research is part of the development of scientific literacy performance assessment based on STEM in teaching physics. The aim of this research is to reveal the validity (content and also empiric) and reliability of scientific literacy performance assessment instrument based on STEM. The kind of instruments were developed are observational sheet and multiple choice test. The content validity of observational sheet was revealed by used the Aiken’s V Coefficient. The content validity of multiple choice tests was revealed by used Content Validity Index (CVI) which proposed by Lawshe. The empirical validity and reliability of multiple choice tests was revealed by used Item Response Theory Analysis. The reliability of observational sheet was revealed by used ICC (Item Correlation Coefficient) Analysis. The results of this study are the validity from the contents and empirical trials from the developed instruments. The observation sheet from scoring rubric and self-assessment has been valid with Aiken’s V value that exceeds the standard of 0,75. The reliability of the scoring rubric has Alfa Reliability> 0.8 and Excellent of ICC. Validity values from The written test is shown with CVI of 1 and the MNSQ INFIT value which match to the Rasch model. Based on the TIC and SEM graphs, the written test is stated to be reliable for use in students with moderate to high categories (-0.7 to 6.7). STEM-based Science Literacy performance assessment with caloric material is appropriate to use.Keywords: content validity, empirical validity, performance assessment, scientific literacy, STEM


2010 ◽  
Vol 1 (4) ◽  
pp. 32-41 ◽  
Author(s):  
E. Serradell-Lopez ◽  
P. Lara ◽  
D. Castillo ◽  
I. González

The purpose of this paper is to determine the effectiveness of using multiple choice tests in subjects related to the administration and business management. To this end the authors used a multiple-choice test with specific questions to verify the extent of knowledge gained and the confidence and trust in the answers. The analysis made, conducted by tests given out to a group of 200 students, has been implemented in one subject related with investment analysis and has measured the level of knowledge gained and the degree of trust and security in the responses at two different times of the business administration and management course. Measurements were taken into account at different levels of difficulty in the questions asked and the time spent by students to complete the test. Results confirm that students are generally able to obtain more knowledge along the way and get increases in the degree of trust and confidence. It is estimated that improvement in skills learned is viewed favourably by businesses and are important for job placement. Finally, the authors proceed to analyze a multi-choice test using a combination of knowledge and confidence levels.


1979 ◽  
Vol 49 (2) ◽  
pp. 445-446 ◽  
Author(s):  
B. A. Bracken ◽  
T. L. Ledford ◽  
R. S. McCallum

Ability to perform successfully on multiple-choice tests was assessed for students displaying various cognitive styles. Male and female undergraduate students were classified according to right, left, or integrated cerebral functioning as determined by Your Style of Learning and Thinking test. The students participated in introductory classes in educational psychology and completed multiple-choice questions designed to assess content. The effects of cerebral dominance on student's ability to complete multiple-choice questions successfully were determined. Students designated by SOLAT as left dominant correctly completed significantly more multiple-choice questions than did right-dominant students. Implications for education were discussed.


2021 ◽  
Vol 99 ◽  
pp. 01032
Author(s):  
Svetlana Vlazneva ◽  
Olga Androsova

The article is devoted to assessment tools in teaching economics. The authors distinguish and define four levels of understanding economics: elementary, intermediate, systemic and creative. They describe multiple choice questions and essay questions as two possible assessment tools in teaching economics. Multiple-choice questions are represented as the most popular testing format. The advantages of multiple-choice questions include low grading costs, perceived objectivity and availability of comparative analysis. The authors have developed multiple-choice tests, which measure students’ knowledge at three first levels of understanding economics. They enable instructors to see where exactly the students’ understanding has stopped and provide guidance. The authors conclude that multiple-choice questions can be used to measure the basic levels of students’ understanding economics. In measuring higher levels the essay as an assessment tool has a great potential. The authors highlight the advantages and pitfalls of essay testing in economics.


2019 ◽  
Vol 38 (1) ◽  
pp. 120-129 ◽  
Author(s):  
Destri Sambara Sitorus ◽  
Siswandari Siswandari ◽  
Kristiani Kristiani

This study was aimed to examine the effectiveness of e-accounting module integrated character values to improve students’ learning outcomes and honesty. This was motivated by lack of students' understanding of accounting materials so that students tended to take dishonest actions like cheating while doing assignments or examinations. Honesty is one of the characters developed in many curriculla, so that honest character needs to be integrated in learning activities. The data collected in this study were the data on students’ learning outcomes collected through multiple choice tests and the data on students’ honesty collected through questionnaires. Students’ learning outcomes data were analyzed through independent sample t test and the data on students' honesty were analyzed descriptively by narrative. The t test results obtained sig values 0.014 < 0.05 so that there were significant differences between the learning outcomes of the experimental class and the control class. The results of the questionnaire analysis showed that the students’ honesty level of the experimental class was in a very good category and the control class was in the good category.


2020 ◽  
Vol 2 (2) ◽  
pp. 115-128
Author(s):  
Dio Aditya

This research aimed to know the significant effect of Gamification based on Balinese Local Story toward students’ listening comprehension. The implementation of one group pretest posttest was a design for collecting the data. The pretest and posttest was in the form of multiple choice tests. The research population consisted of 164 students of SD Negeri 2 Anturan. Then, the research sample was 5th grade students who consisted of 21 students. The result of the data presented that, there was significance different score between pretest and posttest. The mean score of pretest was 43.37. In contrast, the mean score of posttest was 79.74. Based on the results, the mean score of posttest was higher than pretest. The result of effect size was 3.190 that means to large effect. The result of effect size showed Gamification based on Balinse local story could improve students’ listening comprehension.


2020 ◽  
Vol 8 (2) ◽  
pp. 126
Author(s):  
Nur Utami Amaliah ◽  
I Wayan Darmadi ◽  
Sahrul Saehana

This study aims to determine and analyze students' understanding of the motion concept taught with video-based learning assisted by Tracker Software. The subjects of this descriptive qualitative research were 30 students from 10th grade at SMA Negeri 5 Palu. The instruments in this study were multiple-choice tests and interviewed guide. Multiple-choice tests are used with reasons and are equipped with a Specific Response Index (CRI). It was given before and after video-based learning assisted by Tracker Software applied. To add the information about students' understanding, the interview with a few respondents was done. Respondents were selected based on subjects' written test results in the high, medium, and low categories. They also were interviewed related to their conception and certainty of their answers. In learning by using video-assisted by tracker software, the student also employed the worksheet in experimenting. The results showed that a conceptual understanding of student motion on the initial test could be categorized as low. After using video media that is assisted by Tracker Software, students' conceptual understanding increases. Thus, the teacher can use video media and software to teach physics concepts that can be directly observed and displayed using mathematical and graphical representations.


2020 ◽  
Vol 1 (1) ◽  
Author(s):  
Abdul Malik Ibrahim ◽  
Laelia Nurpratiwiningsih ◽  
Diah Sunarsih

Abstract. Education is one effort that is conscious of the goal with systematically directed at changing behavior towards maturity of students. The method used in this research is quantitative correlation. Data collection techniques using questionnaires, tests and documentation. This quantitative research instrument uses a questionnaire and multiple choice tests. The results obtained are (1) there is an influence of motivation with learning outcomes obtained a significant value of probability of 0.026, which means less than 0.05, (2) there is an influence of motivation towards responsibility from the t test results obtained a significant value of probability  of (0,000<0.05), (3) motivation towards learning outcomes and responsibility from the F test results obtained a significance score of 0,000, so it is concluded motivation is a change in energy in a person characterized by the emergence of “feeling” and preceded by a response to the existence of goals.


Sign in / Sign up

Export Citation Format

Share Document