AKURASI METODE KALIBRASI FIXED PARAMETER: STUDI PADA PERANGKAT UJIAN NASIONAL MATA PELAJARAN MATEMATIKA

Penelitian ini bertujuan untuk (1) mengidentifikasi karakteristik butir-butir tes pada perangkat soal ujian nasional mata pelajaran Matematika tingkat SMP tahun pelajaran 2009/2010 yang dikalibrasi dengan metode kalibrasi fixed parameter, dan (2) mengetahui metode kalibrasi fixed parameter yang paling akurat di antara metode NWU-OEM (no prior weights updating and one expectation-maximization cycle), NWU-MEM (no prior weights updating and multiple expectation-maximization cycles), OWU-OEM (one prior weights updating and one expectation-maximization cycle), OWU-MEM (one prior weights updating and multiple expectation-maximization cycles), dan MWU-MEM (multiple weights updating and multiple expectation-maximization cycles). Penelitian ini menggunakan pendekatan kuantitatif deskriptif. Subjek penelitian adalah data respons ujian nasional mata pelajaran Matematika tingkat SMP tahun pelajaran 2009/2010 dari provinsi DI Yogyakarta. Kriteria akurasi metode adalah nilai fungsi informasi tes dan kesalahan pengukuran. Hasil penelitian adalah sebagai berikut. (1) Statistik parameter butir-butir tes pada perangkat ujian nasional mata pelajaran Matematika tingkat SMP tahun pelajaran 2009/2010 menunjukkan rerata indeks daya beda butir berada pada interval [1,07 sampai 1,14], rerata indeks kesukaran butir [-0,35 sampai -0,20], dan rerata pseudo guessing < 0,25. Nilai theta-nilai kemampuan-pada posisi fungsi informasi butir menjadi maksimal menunjukkan grafik fungsi kelima metode kalibrasi fixed-parameter hampir berimpit. (2) Metode OWU-OEM merupakan metode yang paling akurat dalam mengestimasi parameter butir pada perangkat tes ujian nasional mata pelajaran Matematika tahun pelajaran 2009/2010.Kata kunci: akurasi, kalibrasi, fixed parameter, algoritma, Expectation-Maximization______________________________________________________________THE ACCURACY OF THE FIXED PARAMETER CALIBRATION METHOD:STUDY OF MATHEMATICS NATIONAL EXAMINATION TESTAbstract This study aimed to: (1) identify the characteristics of the test items on the mathematics test of the national examination which are calibrated with the fixed parameter calibration methods, and (2) reveal the most accurate fixed parameter calibration methods among NWU-OEM (no prior weights updating and one expectation-maximization cycle), NWU-MEM (no prior weights updating and multiple expectation-maximization cycles), OWU-OEM (one prior weights updating and one expectation-maximization cycle), OWU-MEM (one prior weights updating and multiple expectation-maximization cycles), and MWU-MEM (multiple weights updating and multiple expectation-maximization cycles) methods. This study used descriptive quantitative approach. The subject is the testee’ responses to the mathematics national examination in junior high school in 2009/2010. The criteria of the accuracy methods are TIF and SEM. The research results are as follows. (1) Item of statistical parameter on Mathematics national examination test in 2009/2010 showed the average of item discrimination on the interval [1.07, 1.14], the average of item difficulty on the interval [-0.35, -0.20], and the average of pseudo guessing is c < 0.25. Theta - ability - score where the item information function maximalist showed the function of five fixed-parameter calibration methods almost coincides. (2) OEM-OWU method is the most accurate in estimating the parameters on mathematics national examination test in 2009/2010. Keywords: Accuracy, Calibration, Fixed Parameter, Algorithm, Expectation-Maximization

Download Full-text

Two IRT Fixed Parameter Calibration Methods for the Bifactor Model

Journal of Educational Measurement ◽

10.1111/jedm.12230 ◽

2019 ◽

Vol 57 (1) ◽

pp. 29-50 ◽

Cited By ~ 1

Author(s):

Kyung Yong Kim

Keyword(s):

Bifactor Model ◽

Parameter Calibration ◽

Calibration Methods ◽

Fixed Parameter

Download Full-text

The quality of an English summative test of a public junior high school, Kupang-NTT

English Language Teaching Educational Journal ◽

10.12928/eltej.v3i2.2311 ◽

2020 ◽

Vol 3 (2) ◽

pp. 133

Author(s):

Thresia Trivict Semiun ◽

Fransiska Densiana Luruk

Keyword(s):

High School ◽

Public School ◽

Junior High School ◽

Content Validity ◽

Item Difficulty ◽

Item Analysis ◽

Item Discrimination ◽

Test Items ◽

Evaluative Research

This study aimed at examining the quality of an English summative test of grade VII in a public school located in Kupang. Particularly, this study examined content validity, reliability, and conducted item analysis including item validity, item difficulty, item discrimination, and distracter effectiveness. This study was descriptive evaluative research with documentation to collect data. The data was analyzed quantitatively except for content validity, which was done qualitatively. Content validity was analyzed by matching the test items with materials stated in the curriculum. The findings revealed that the English summative test had a high content validity. The reliability was estimated by applying the Kuder-Richardson’s formula (K-R20). The result showed that the test was reliable and very good for a classroom test. The item analysis was conducted by using ITEMAN 3.0. and it revealed that the the test was mostly constructed by easy items, most of the items could discriminate the students, most distracters were able to perform well, and the most of items were valid.

Download Full-text

A Comparative Study of IRT Fixed Parameter Calibration Methods

Journal of Educational Measurement ◽

10.1111/j.1745-3984.2006.00021.x ◽

2006 ◽

Vol 43 (4) ◽

pp. 355-381 ◽

Cited By ~ 43

Author(s):

Seonghoon Kim

Keyword(s):

Comparative Study ◽

Parameter Calibration ◽

Calibration Methods ◽

Fixed Parameter

Download Full-text

DIAGNOSIS KESALAHAN SISWA BERBASIS PENSKORAN POLITOMUS MODEL PARTIAL CREDIT PADA MATEMATIKA

Jurnal Penelitian dan Evaluasi Pendidikan ◽

10.21831/pep.v15i2.1099 ◽

2013 ◽

Vol 15 (2) ◽

pp. 308-325 ◽

Cited By ~ 2

Author(s):

Awal Isgiyanto

Keyword(s):

High School Students ◽

Junior High School ◽

Diagnostic Information ◽

Diagnostic Study ◽

Partial Credit Model ◽

Partial Credit ◽

School Students ◽

Test Items ◽

National Examination ◽

Post Hoc

Penelitian ini bertujuan untuk menemukan informasi diagnostik dari kesalahan jawaban peserta pada Ujian Nasional (UN) Matematika. Informasi diagnostik yang ditemukan meli-puti atribut yang mendasari butir soal, ketidaktuntasan atribut, dan jenis kesalahan yang dilakukan oleh peserta. Penelitian ini merupakan diagnosis post-hoc, yang digambarkan sebagai pende-katan retrofitting. Analisis butir soal dan respons butir pada UN mata pelajaran matematika untuk menemukan informasi diag-nostik pada kategori isi, proses, dan keterampilan siswa SMP di Bantul Yogyakarta 2007/2008. Hasil penelitian (1) atribut yang mendasari butir soal pada matematika ada 47 atribut, meliputi 4 atribut isi, 36 atribut proses, dan 7 atribut keterampilan, (2) ketidaktuntasan atribut isi, proses, dan keterampilan yang ter-tinggi pada geometri dan pengukuran, (3) jenis kesalahan ter-tinggi pada bilangan, aljabar, dan geometri dan pengukuran adalah kesalahan konsep, dan jenis kesalahan tertinggi pada sta-tistika dan peluang adalah kesalahan interpretasi bahasa, dan (4) penemuan informasi diagnostik data UN Matematika dapat dilakukan melalui mekanisme identifikasi atribut, pengembang-an rubrik penskoran politomus, perhitungan ketidaktuntasan atribut, dan diagnosis kesalahan peserta tes. Kata kunci: diagnostik, matematika, partial credit model ______________________________________________________________A DIAGNOSIS OF STUDENTS’ MISTAKES BASED ON POLYTOMOUS SCORNG PARTIAL CREDIT MODELS IN MATHEMATICS Abstract This study aims to reveal diagnostic information from the participants’ incorrect answers in the National Examination (NE) of Mathematics. The diagnostic information includes the attributes underlying test items, the attribute exhaustiveness, and the types of mistakes that the participants made. This study was a post-hoc diagnostic study, described as the retrofitting approach. The analysis of test items and item responses in the NE of Mathematics aimed to reveal the diagnostic information in the Junior High School students’ content, process, and skill categories. The results are that (1) there are 47 attributes underlying the mathematics test items, consisting of 4 content attributes, 36 process attributes, and 7 skill attributes, (2) the highest inexhaustiveness of the content, process, and skill attributes is in the topics of geometry and measurement, (3) most mistakes in the topics of numbers, algebra, geometry, and measurement are those of concepts, and most mistakes in statistics and probability are those of language interpretation, and (4) the findings of diagnostic information in the data on the NE of Mathematics can be revealed through the mechanism of identifying the attributes, developing a polytomous scoring rubric, finding out the attribute inexhaustiveness, and diagnosing mistakes the test participants.Keywords: diagnosis, mathematics, partial credit model

Download Full-text

ITEM ANALYSIS OF AN ENGLISH SUMMATIVE TEST

PEJLaC: Pattimura Excellence Journal of Language and Culture ◽

10.30598/pejlac.v1.i1.pp9-18 ◽

2021 ◽

Vol 1 (1) ◽

pp. 9-18

Author(s):

Leni Amelia Suek

Keyword(s):

Junior High School ◽

Item Difficulty ◽

Item Analysis ◽

Discrimination Index ◽

Item Discrimination ◽

Good Test ◽

Test Items ◽

Level Of Knowledge ◽

The Government ◽

Difficulty Index

While almost half of the teachersâ€™ activities are assessing their students, they are not well-prepared with assessment literacy training. Hence, they are unable to produce good tests to measure studentsâ€™ level of knowledge and skills. This study is aimed at analyzing item difficulty and item discrimination of a test made by an English teacher at a junior high school in Kupang. It was descriptive qualitative research and the instruments of the research were test items, answer keys, and studentsâ€™ answer sheets. For the difficulty index, it was revealed that more than half of the test items were easy, while only 2% of the test items were difficult. In terms of the discrimination index, it was found that only 10% of the test items were excellent and most of the test items (46%) were poor. These findings indicated that the English test had a poor item difficulty index and low item discrimination index. Hence, it did not fulfill the criteria of a good test and could not measure studentsâ€™ true ability. It is highly recommended for the teachers to improve the test items and for the government to provide assessment training for the teachers so that they can produce good tests.

Download Full-text

THE IMPACT OF ZONATION SYSTEM POLICY IN THE PPDB ON STUDENTS' OBTAINING AT SMA NEGERI 8 SURABAYA

JURNAL PAJAR (Pendidikan dan Pengajaran) ◽

10.33578/pjr.v4i3.7978 ◽

2020 ◽

Vol 4 (3) ◽

pp. 525

Author(s):

Idawati Idawati

Keyword(s):

Junior High School ◽

Low Income ◽

Educational Background ◽

Quantitative Approach ◽

National Examination ◽

New Students ◽

The Impact ◽

New Student ◽

Academic Year ◽

Descriptive Method

This research was conducted by using a descriptive method with a quantitative approach. The quantitative approach was chosen to be tested theories by examining and measuring variables in the form of relationships, differences, influences, contributions, and the others. The research was carried out by describing the students acquisition data on the new student admission (PPDB) using zoning system based on the academic year 2019-2020 and the student acquisition data on the academic year PPDB 2018-2019 as a comparison. Based on the results of the study, the new students of PPDB using zoning system was considered lower in terms of economic and educational background of parents. There were more parents with less education (elementary & junior high school) in the zoning system than in the rayon system, whereas parents with higher education in the zoning system were fewer than the rayon system. Likewise, in terms of income, there were more people with the low income in the zoning system than in the rayon system, and those having high income were fewer than in the rayon system. The study showed that the intelligence and the result of National Examination Score (NUN) in the zoning system is lower than in the rayon system. The intelligent level of the students in the zoning system is mostly dominated by the scores under 90-109, while in the rayon system were dominated by the scores above 90-109. The National Examination Scores (NUN) in the zoning system were evenly distributed from a range of scores 0 to 30, while in the rayon system the scores were dominated by a range of scores 28-30, with the lowest score 24.

Download Full-text

Analysis on Reading and Writing Section of Test Items in 2009/2010 English National Final Examination (UAN) for Junior High School Students

ETERNAL (English Teaching Journal) ◽

10.26877/eternal.v1i2.183 ◽

2012 ◽

Vol 1 (2) ◽

Author(s):

Musarokah Siti ◽

Dwi Anggani Linggar Bharati

Keyword(s):

High School ◽

High School Students ◽

Junior High School ◽

Junior High ◽

Junior High School Students ◽

Final Examination ◽

School Students ◽

Cognitive Domains ◽

Reading And Writing ◽

Test Items

ABSTRACT This paper attempts to analyze the test items in the English National Final Examination (UAN) for Junior High School Students in the academic year 2009/2010. This study aims at analyzing the compatibility of the test items of package A in reading and writing section of English National Final Examination for Junior High School Students with the standard of graduate competence and the cognitive domains used in the test. The data were collected by using documentary method, while in analyzing the data the writers focused on matching the compatibility of the test items with the Standard of Graduate Competence and identifying the cognitive domain used. From the analysis, it was found that the test items of package A in reading and writing section of English National Final Examination 2009/2010 were in line with the Standard of Graduate Competence (SKL) arranged by the government. The cognitive domains in reading section mostly used level of comprehension. The cognitive domains in writing section used the application and the synthesis level. ?é?á Key words: analysis, test items, national final examination

Download Full-text

KEMAMPUAN GURU BAHASA INDONESIA DALAM MENYELESAIKAN SOAL UKBI DI PEKANBARU

Madah Jurnal Bahasa dan Sastra ◽

10.31503/madah.v5i2.502 ◽

2017 ◽

Vol 5 (2) ◽

pp. 147

Author(s):

Elvina Syahrir

Keyword(s):

High School ◽

Elementary School ◽

Language Proficiency ◽

Junior High School ◽

Proficiency Test ◽

Senior High School ◽

Language Teachers ◽

National Examination ◽

Language Proficiency Test ◽

Bahasa Indonesia

This study was motivated by the low score of the students‘ Indonesian language in national examination that were certainly correlated with the Indonesian language teachers‘ competences. The Indonesian language techers‘ competences can be observed in doing the Indonesian language proficiency test. This study was aimed to determine the Indonesian language teachers‘ capabilities indoing UKBI and to describe about it. The data obtained describe that the Indonesian language teachers of Senior High School (SMA) have the highest scores, then the Indonesian language teachers of Junior High School (SMP), and the last ones were the teachers of elementary school (SD). Moreover, among the Indonesian language techers‘ capabilities of SMA, SMP, and SD in doing UKBI have not shown the significant differences.AbstrakPenelitian ini dilatarbelakangi oleh rendahnya nilai Ujian Nasional (UN) bahasa Indonesia siswa yang tentunya berkorelasi dengan kompetensi guru bahasa Indonesianya. Kompetensi guru bahasa Indonesia dapat terlihat dari Uji Kemahiran Berbahasa Indonesia (UKBI). Penelitian ini bertujuan untuk mengetahui kemampuan guru bahasa dan sastra Indonesia di Pekanbaru dalam menyelesaikan soal UKBI dan untuk mendeskripsikan kemampuan guru dalam menyelesaikan soal UKBI. Dari penganalisisan data diperoleh temuan bahwa nilai rerata UKBI guru SMA memperoleh skor paling tinggi, nilai rerata UKBI guru SMP memperoleh skor sedang, sedangkan nilai rerata UKBI guru SD memperoleh skor paling rendah. Akan tetapi, kemampuan guru bahasa dan sastra Indonesia dari ketiga tingkatan sekolah tersebut (SMA, SMP, SD) dalam menyelesaikan soal UKBI tidak menunjukkan perbedaan yang berarti (tidak signifikan) pada taraf kepercayaan 5 %, hal ini terlihat dari nilai Fhitung lebih kecil dari Ftabel (Fhitung = 0,11 < Ftabel = 4,35)

Download Full-text

KARAKTERISTIK BUTIR SOAL ULANGAN KENAIKAN KELAS SEBAGAI PERSIAPAN BANK SOAL BAHASA INGGRIS

Jurnal Penelitian dan Evaluasi Pendidikan ◽

10.21831/pep.v15i1.1089 ◽

2013 ◽

Vol 15 (1) ◽

pp. 92-114

Author(s):

Nur Hidayanto Pancoro

Keyword(s):

High School ◽

Quantitative Analysis ◽

Junior High School ◽

Quantitative Research ◽

Junior High ◽

Expert Judgment ◽

Item Bank ◽

Test Items ◽

Item Characteristics ◽

Test Information Function

Penelitian ini bertujuan untuk mengembangkan rintisan bank soal Bahasa Inggris SMP kelas VIII pada tingkat provinsi dengan memperhatikan (1) karakteristik soal Ulangan Kenaikan Kelas (UKK); (2) karakteristik butir-butir soal; (3) jumlah butir soal yang memenuhi standar. Metode yang digunakan adalah pengembangan dengan pendekatan kuantitatif deskriptif. Sam-pel sebanyak 6554 respon siswa (lembar jawab). Perangkat tes dianalisis kesesuaian indikatornya dengan Standar Kompetensi (SK) dan Kompetensi Dasar (KD) yang telah ditentukan, serta ditelaah secara kualitatif oleh ahli berdasarkan kriteria kons-truksi, bahasa dan materi. Respons siswa dianalisis secara kuantitatif dengan Iteman 3.00, dan Bilog MG 3.0 3 PL untuk mendapatkan informasi tentang tingkat kesulitan, daya beda, pseudoguessing, serta informasi butir. Bank soal yang dihasilkan masuk kategori baik dilihat dari (1) Hasil analisis kualitatif dari 150 butir soal, sebanyak 105 butir soal (70%) dinyatakan baik. (2) Hasil analisis kuantitatif dengan program Iteman 3.00 diperoleh 115 butir soal tergolong baik. (3) Analisis dengan Bilog MG 3 PL 77 butir soal (66,96%) masuk kategori baik.Kata kunci: bank soal, tingkat kesulitan, daya beda, pseudo guessing, expert judgment, informasi butir dan tes______________________________________________________________ THE ITEM CHARACTERISTICS OF THE FINAL SEMESTER TEST AS A PREPARATION FOR ENGLISH ITEM BANKAbstract This research aims at developing a pilot english test item bank of the 2nd grade junior high school based on: (1) the characterisctics of test items for final semester test; (2) the total number of test items for the 2nd grade junior high school in Yogyakarta Special Province. This descriptive quantitative research used 6554 samples of students’ answer sheets in 2009 and also employs expert judgment to review the construct, language, as well as the materials of the questions. Students’ responses were also analyzed by using Iteman 3.00, Bilog MG 3.0 3 PL. The result of qualitative analysis with expert judgment which shows that 105 out of the 150 test items are categorized as “good”, and the other are “bad”. The quantitative analysis with Iteman 3.00 shows that 115 out of the 150 test items are categorized as “good” and the quantitative analysis with Bilog MG 3.00 3 PL shows that 67% test items are categorized as good. Keywords: item bank, level of difficulty, discrimination index, pseudoguessing, expert judgment, item and test information function

Download Full-text

The utilization of junior high school mathematics national examination data: A conceptual error diagnosis

Research and Evaluation in Education ◽

10.21831/reid.v3i2.18120 ◽

2017 ◽

Vol 3 (2) ◽

pp. 163 ◽

Cited By ~ 1

Author(s):

Kartianom Kartianom ◽

Djemari Mardapi

Keyword(s):

High School ◽

Junior High School ◽

Junior High ◽

Classical Test Theory ◽

School Mathematics ◽

Test Theory ◽

High School Mathematics ◽

Error Diagnosis ◽

National Examination ◽

Quantitative Descriptive

The goal of the research is to gain insights into the characteristics of the items in the mathematics national examination, the attributes on which the items were formulated and the result of a conceptual error diagnosis of the mathematics materials based on the result of the junior high school mathematics national examination. This is quantitative descriptive research. The data were collected from 3,079 grade-nine students of junior high schools who took the National Examination in the academic year of 2015/2016. The sample was established randomly based on the package code of the examination which is P0C5520 with 574 students as the examinees. Documentation method was applied in collecting the data. The result of the research shows that – upon the implementation of the classical test theory – there are 16 items in ‘difficult’ category, 24 in ‘intermediate’ category, and no items in ‘easy’ category. Furthermore, upon the implementation of the item response theory, the result shows that 28 items are in ‘good’ category and 12 items are in ‘poor’ category. In addition, there are 50 attributes on which the Junior High School Mathematics National Examination test (package P0C520) is formulated. Four attributes are content attributes and the rest (46) are process skill attributes. The result of the diagnosis shows that there are 11 types of errors made by the students when trying to complete the content items. Most of the errors are conceptual errors related to the geometric materials especially in the sub-materials of polyhedron, triangles, and quadrangles.

Download Full-text