scholarly journals Test Items Analysis of Mathematical Problem Solving Ability using a Classical Test Theory Approach

2021 ◽  
Vol 22 (1) ◽  
pp. 98-111
Author(s):  
Muhammad Rais Ridwan ◽  
Edi Istiyono ◽  
Widihastuti Widihastuti
2021 ◽  
Vol 5 (2) ◽  
pp. 210-221
Author(s):  
Anis Faridah

This research is a study of quantitative descriptive. The purpose of this research is to describe the characteristics of final semester exam items for grade XI in the History subject at SMA Negeri 1 Pangkalpinang using the classical test theory approach. The research of the subject was 138 students of class XI in Social Sciences Major. The result of the research shows that final exam questions in the history subject class XI of SMA Negeri 1 Pangkalpinang are proper to use. This shows that from the validity of the items which there are 39 items of questions (97.5%) which are proven empirically valid with a 0.818 reliability coefficient. Other than that, there are 27 items of questions (67,5%) that can fulfill the criteria for the difficulty level, distinguishing power, and distractor function so it can be used directly to measure the student's ability without correction. While 12 items of questions (30%) need to be fixed and 1 item of question (2,5%) is declared to be invalid so it can't be used to measure the student's ability in History Subject. Permasalahan yang melatarbelakangi penelitian ini adalah pengembangan soal penilaian akhir semester mata pelajaran sejarah yang tidak melalui tahapan analisis butir soal sehingga kualitas butir soal tidak diketahui. Penelitian ini merupakan penelitian deskriptif kuantitatif. Tujuan penelitian ini adalah untuk mendeskripsikan karakteristik butir soal penilaian akhir semester mata pelajaran sejarah kelas XI SMA Negeri 1 Pangkalpinang menggunakan pendekatan teori tes klasik. Subjek penelitian berjumlah 138 peserta didik kelas XI jurusan IPS. Hasil penelitian menunjukkan bahwa soal PAS mata pelajaran sejarah kelas XI SMA Negeri 1 Pangkalpinang telah layak digunakan. Hal ini dibuktikan dari validitas butir soal yang mana terdapat 39 butir soal (97,5%) terbukti valid secara empirik dengan koefisien reliabilitas sebesar 0,818. Selain itu terdapat 27 butir soal (67,5%) yang memenuhi kriteria tingkat kesukaran, daya beda, dan keberfungsian distraktor sehingga dapat digunakan langsung untuk mengukur kemampuan peserta didik tanpa perbaikan. Sedangkan sebanyak 12 butir soal (30%) perlu dilakukan perbaikan dan 1 butir soal (2,5%) dinyatakan gugur sehingga tidak dapat digunakan untuk mengukur kemampuan peserta didik pada mata pelajaran sejarah.


2019 ◽  
Vol 23 (1) ◽  
pp. 124-153 ◽  
Author(s):  
Daniel R. Smith ◽  
Michael E. Hoffman ◽  
James M. LeBreton

This article provides a review of the approach that James used when conducting item analyses on his conditional reasoning test items. That approach was anchored in classical test theory. Our article extends this work in two important ways. First, we offer a set of test development protocols that are tailored to the unique nature of conditional reasoning tests. Second, we further extend James’s approach by integrating his early test validation protocols (based on classical test theory) with more recent protocols (based on item response theory). We then apply our integrated item analytic framework to data collected on James’s first test, the conditional reasoning test for relative motive strength. We illustrate how this integrated approach furnishes additional diagnostic information that may allow researchers to make more informed and targeted revisions to an initial set of items.


Author(s):  
Reyhaneh Aminalroaya ◽  
Fatemeh Sadat Mirzadeh ◽  
Kazem Heidari ◽  
Mahtab Alizadeh-Khoei ◽  
Farshad Sharifi ◽  
...  

A validation study the Iranian Modified Barthel Index (MBI) in hospitalized acute stroke elderly by classical test theory approach and investigate Rasch analysis for both Iranian version MBI and BI and compare the hierarchical item difficulty of them. Face-to-face interview with 100 geriatric stroke inpatients 60+ or their caregivers was done in a cross-sectional study. First, construct validity of MBI analyzed by the classical test theory, then Rasch analysis were done for BI and MBI. The reliability of the Iranian MBI was significant at 0.955. One factor achieved by the variance of 83.2%. In Rasch analysis for MBI, the most difficult item was stair climbing, whereas the simplest items were bowel and bladder control. In BI, the most difficult items were toilet use and ambulation. The Iranian MBI is very accurate and reliable; therefore the use of MBI to measure better outcomes in stroke elderly inpatients is recommended comparing with BI.


2019 ◽  
Vol 9 (2) ◽  
pp. 133-146
Author(s):  
Yance Manoppo ◽  
Djemari Mardapi

This study aimed to reveal: (1) the characteristics of items of Chemistry Test in National Examination by using the classical test theory and item response theory; (2) the amount of cheating which occured by using Angoff's B-index Method, Pair 1 Method, Pair 2 Method, Modified Error Similarity Analysis (MESA) Method, and G2 Method; (3) the methods that detect more cheating in the implementation of the Chemistry Test in National Examination for high schools in the year 2011/2012 in Maluku Province. The results of the analysis with the classical test theory approach show that 77.5% items have item difficulty functioning well, 55% items have discrimination yet qualified and 70% items have distractor that works well with the index reliability test of 0,772. The analysis using the item response theory approach shows that 14 (35%) items fit with the model, the maximum function information is 11,4069 at θ = -1,6, and the magnitude of the error of measurement is 2,296. The number of pairs who are suspected of cheating is as follows: 13 pairs according to Angoff's B-index Method, 212 pairs according to Pair 1 Method, 444 pairs according to Pair 2 Method, 7 pairs according to MESA Method, and 102 pairs according to G2 Method. The most widely detecting cheating in a row is a   Pair 2, Pair 1, G2, Angoff's B-index, and MESA.


2011 ◽  
Vol 16 (2) ◽  
pp. 109
Author(s):  
Mr Nahadi ◽  
Mrs Wiwi Siswaningsih ◽  
Mr Ana Rofiati

This research is title “Test Development and Analysis of First Grade Senior High School Final Examination in chemistry Based on Classical Test Theory and Item Response Theory”. This research is conducted to develop a standard test instrument for final examination in senior high school at first grade using analysis based on classical test theory and item response theory. The test is a multiple choice test which consists of 75 items. Each item has five options. The research method is research and development method to get a product of test items which fulfill item criterion such as validity, reliability, item discrimination, item difficulty and distracting options quality based on classical test theory and validity, reliability, item discrimination, item difficulty and pseudo-guessing based on item response theory. The three parameter item response theory model is used in this research. Research and development method is conducted until preliminary field test to 102 first grade students in senior high school. Based on the research result, the test fulfills criterion as a good instrument based on classical test theory and item response theory. The final examination test items have vary of item quality so that some of them need a revision to make them better either for the stem and the options. From the total of 75 test items, 21 test items are declined and 54 test items are accepted.


2020 ◽  
Vol 24 (2) ◽  
Author(s):  
Atin Argianti ◽  
Heri Retnawati

National Standardized School Exam (USBN) is used to determine Student’s graduation. This research aims to determine the characteristics of items for Math USBN in SMP on grade 9. This kind of research is a descriptive-explorative quantitative research. The data collected is a USBN test instrument at SMP Negeri 3 Pati and participants' answers, which are collected by documentation. The USBN instrument was validated by experts and the characteristic items of USBN instrument were analyzed using the classical test theory approach. The question items of math USBN test at SMP Negeri 3 Pati is generally moderately good. Based on the classical theory approach, the result of the validity is 0.924. 56.7% of items are very valid. The reliability is 0.78 categorized as high reliability. Generally, Math USBN items are in the easy category with a percentage of 83.3%. The results of discrimination index indicate that in general, USBN items are in a moderate category with a percentage of 60%. The distraction effectiveness shows that USBN items are in the functional category with a percentage of 50%.


Sign in / Sign up

Export Citation Format

Share Document