The quality of an English summative test of a public junior high school, Kupang-NTT

Thresia Trivict Semiun; Fransiska Densiana Luruk

doi:10.12928/eltej.v3i2.2311

The quality of an English summative test of a public junior high school, Kupang-NTT

English Language Teaching Educational Journal ◽

10.12928/eltej.v3i2.2311 ◽

2020 ◽

Vol 3 (2) ◽

pp. 133

Author(s):

Thresia Trivict Semiun ◽

Fransiska Densiana Luruk

Keyword(s):

High School ◽

Public School ◽

Junior High School ◽

Content Validity ◽

Item Difficulty ◽

Item Analysis ◽

Item Discrimination ◽

Test Items ◽

Evaluative Research

This study aimed at examining the quality of an English summative test of grade VII in a public school located in Kupang. Particularly, this study examined content validity, reliability, and conducted item analysis including item validity, item difficulty, item discrimination, and distracter effectiveness. This study was descriptive evaluative research with documentation to collect data. The data was analyzed quantitatively except for content validity, which was done qualitatively. Content validity was analyzed by matching the test items with materials stated in the curriculum. The findings revealed that the English summative test had a high content validity. The reliability was estimated by applying the Kuder-Richardson’s formula (K-R20). The result showed that the test was reliable and very good for a classroom test. The item analysis was conducted by using ITEMAN 3.0. and it revealed that the the test was mostly constructed by easy items, most of the items could discriminate the students, most distracters were able to perform well, and the most of items were valid.

Download Full-text

ITEM ANALYSIS OF AN ENGLISH SUMMATIVE TEST

PEJLaC: Pattimura Excellence Journal of Language and Culture ◽

10.30598/pejlac.v1.i1.pp9-18 ◽

2021 ◽

Vol 1 (1) ◽

pp. 9-18

Author(s):

Leni Amelia Suek

Keyword(s):

Junior High School ◽

Item Difficulty ◽

Item Analysis ◽

Discrimination Index ◽

Item Discrimination ◽

Good Test ◽

Test Items ◽

Level Of Knowledge ◽

The Government ◽

Difficulty Index

While almost half of the teachersâ€™ activities are assessing their students, they are not well-prepared with assessment literacy training. Hence, they are unable to produce good tests to measure studentsâ€™ level of knowledge and skills. This study is aimed at analyzing item difficulty and item discrimination of a test made by an English teacher at a junior high school in Kupang. It was descriptive qualitative research and the instruments of the research were test items, answer keys, and studentsâ€™ answer sheets. For the difficulty index, it was revealed that more than half of the test items were easy, while only 2% of the test items were difficult. In terms of the discrimination index, it was found that only 10% of the test items were excellent and most of the test items (46%) were poor. These findings indicated that the English test had a poor item difficulty index and low item discrimination index. Hence, it did not fulfill the criteria of a good test and could not measure studentsâ€™ true ability. It is highly recommended for the teachers to improve the test items and for the government to provide assessment training for the teachers so that they can produce good tests.

Download Full-text

The Analysis of Junior High School Teacher-Made Tests for the Students in Enrekang

FOSTER: Journal of English Language Teaching ◽

10.24256/foster-jelt.v1i2.14 ◽

2020 ◽

Vol 1 (2) ◽

pp. 122-138

Author(s):

Husnani Aliah

Keyword(s):

High School ◽

Junior High School ◽

Junior High ◽

Item Difficulty ◽

Item Analysis ◽

Multiple Choice ◽

Item Bank ◽

Cognitive Domain ◽

Multiple Choice Tests ◽

Choice Tests

The research aimed at finding out information about the preparation of constructing teacher-made tests in Enrekang, the quality of English teacher-made test according to item analysis, and the level cognitive domain of the teacher-made test. The test quality was determined after it was used in school examination test. This research employed survey research using descriptive method. The researcher analyzed the data and then described the research finding quantitatively. The population of this research was the teachers who teach in ninth grade at junior high schools in Enrekang. This research applied simple random sampling technique by taking four different schools as sampel. The results of analysis show preparation that junior high school teachers follow in constructing teacher-made tests in Enrekang is divided into five main parts. In preparing the test, the procedures were considering tests’ materials and proportion of each topic, choosing to check the item bank that match to syllabus and indicators, or preparing test specification. In writing test, teachers’ procedures were re-writing chosen test item from internet and textbook, re-writing items that was used before and allowing the other teachers to verify it, combining items from item bank and text book, or making new item. While in analyzing a test, the procedures used by the teachers were analyzing and revising test based on its item difficulty, predicting the item difficulty and revising the test, or doing nothing to analyze the test. About the timing in preparing the test, there are three out of five teachers who need only one week to construct multiple choice tests. Besides, there are two out of five teachers who need two weeks to construct multiple choice tests. While the teachers have different ways in providing test based on students’ ability. Moreover, the item analysis shows that no test is perfectly good. It was found that almost all tests need to be revised. It was also found that there were only three categories works in all tests based on the cognitive domain of the test namely knowledge, comprehension, and application categories. There was no item belong to analysis, synthesis, and evaluation categories.

Download Full-text

Utilizing test items analysis to examine the level of difficulty and discriminating power in a teacher-made test

EduLite Journal of English Education Literature and Culture ◽

10.30659/e.6.2.256-269 ◽

2021 ◽

Vol 6 (2) ◽

pp. 256

Author(s):

Sayit Abdul Karim ◽

Suryo Sudiro ◽

Syarifah Sakinah

Keyword(s):

Reading Comprehension ◽

Junior High School ◽

Test Item ◽

English Language ◽

Item Analysis ◽

Quality Level ◽

Test Items ◽

Discriminating Power ◽

Level Of Difficulty

Apart from teaching, English language teachers need to assess their students by giving a test to know the students� achievements. In general, teachers are barely conducting item analysis on their tests. As a result, they have no idea about the quality of their test distributed to the students. The present study attempts to figure out the levels of difficulty (LD) and the discriminating power (DP) of the multiple-choice (MC) test item constructed by an English teacher in the reading comprehension test utilizing test item analysis. This study employs a qualitative approach. For this purpose, a test of 50-MC test items of reading comprehension was obtained from the students� test results. Thirty-five students of grade eight took part in the MC test try-out. They are both male (15) and female (20) students of junior high school 2 Kempo, in West Nusa Tenggara Province. The findings revealed that16 items out of 50 test items were rejected due to the poor and worst quality level of difficulty and discriminating index. Meanwhile, 12 items need to be reviewed due to their mediocre quality, and 11 items are claimed to have good quality items. Besides, 11 items out of 50 test items were considered as the excellent quality as their DP scores reached around 0.44 through 0.78. The implications of the present study will shed light on the quality of teacher-made test items, especially for the MC test.

Download Full-text

Evaluating Quality of Teacher-Developed English Test in Vocational High School: Content Validity and Item Analysis

Education Quarterly Reviews ◽

10.31014/aior.1993.02.02.67 ◽

2019 ◽

Vol 2 (2) ◽

Author(s):

Putu Irmayanti Wiyasa ◽

◽

I Ketut Darma Laksana ◽

Ni Luh Ketut Mas Indrawati

Keyword(s):

High School ◽

Content Validity ◽

Item Analysis ◽

Vocational High School

Download Full-text

Validitas Soal-Soal Ujian Nasional Mata Pelajaran Bahasa Indonesia Untuk SMP/ MTs

Jurnal VARIDIKA ◽

10.23917/varidika.v26i1.733 ◽

2015 ◽

Vol 26 (1) ◽

pp. 56-68

Author(s):

Siti Jamilatul Muyasaroh

Keyword(s):

High School ◽

Construct Validity ◽

Junior High School ◽

Content Validity ◽

Research Method ◽

Junior High ◽

Choice Test ◽

National Assessment ◽

Language And Culture ◽

Test Items

The purpose of this research is to find out the level of the content validity and con struct validity of the questions of the national assessment of Indonesian language subject for junior high school / MTs. The Research method applied in this research was qualitative research method. This research employed the qualitative analysis. To support this qualita tive research, the writer used some tools in the data analysis. They are 1) to figure out the content validity, the writer had matched the test items with the indicators listed in the SKL (Graduation Standard Competency) of the Indonesian Language subject 20102011 aca demic years; 2) For the construct validity, the writer used the evaluation format of multiple choice test items by applying material aspect, construct aspect, and language and culture aspect. After the research was conducted, it can be concluded that the questions of the National Assessment of Indonesian Language Subject for Junior High School / MTs in the 20102012 academic years have high content validity and construct validity. The content validity, the entire indicators in the SKL (Graduation Standard Competency) has been ap plied in the test items. However, the writer found that there are two indicators that are used in four test items. In fact, each indicator should be applied in one test item. The construct validity, by using analysis method of the evaluation format of multiplechoice test items, the writer figured out that 56% 100 % test items are appropriate with the aspects. Meanwhile, the test items which are not deal with the aspects are 16 – 44%.

Download Full-text

An item analysis on multiple-choice questions: a case of a junior high school English try-out test in Indonesia

LEKSIKA ◽

10.30595/lks.v15i1.8768 ◽

2021 ◽

Vol 15 (1) ◽

pp. 9

Author(s):

Rohmatul Jannah ◽

Didin Nuruddin Hidayat ◽

Nida Husna ◽

Imam Khasbani

Keyword(s):

High School ◽

Junior High School ◽

Junior High ◽

Item Analysis ◽

Multiple Choice ◽

Multiple Choice Questions ◽

High School English ◽

Test Items ◽

Discriminating Power ◽

Trial Testing

The present study aims to analyze multiple-choice questions obtained from a trial testing conducted in a state junior high school in Indonesia. The study seeks to reveal the level of difficulty, discriminating power and distractor efficiency of the selected test items by employing item analysis. The result of the study discovers that levels of difficulty on the question items are varied. Some question tended to be easy and moderately difficult while the others are difficult to answer. It also uncovers that, in regard to discriminating power, some questions are well constructed while the others are ambiguously worded that can potentially cause the questions to fail to evaluate the students’ ability. The analysis on distractor efficiency presents information how the chosen multiple-choice questions were frequently constructed with less effective distractors that caused more high achieving students to choose wrong answers.

Download Full-text

Developing an assessment instrument of junior high school students’ higher order thinking skills in mathematics

Research and Evaluation in Education ◽

10.21831/reid.v2i1.8268 ◽

2016 ◽

Vol 2 (1) ◽

pp. 92 ◽

Cited By ~ 1

Author(s):

Samritin Samritin ◽

Suryanto Suryanto

Keyword(s):

High School ◽

Construct Validity ◽

Junior High School ◽

Content Validity ◽

Goodness Of Fit ◽

Higher Order Thinking ◽

Thinking Skills ◽

Higher Order Thinking Skills ◽

Test Items ◽

Fit Index

This study is a research and development study. It aims to produce an instrument for assessing junior high school (JHS) students’ higher order thinking skills (HOTS) in mathematics. Its procedure consists of nine steps: (1) Constructing the test specification; (2) writing test items; (3) analyzing test items; (4) conducting the first tryout; (5) analyzing the results of the first try out; (6) revising the test; (7) assembling the test; (8) conducting the second tryout; and (9) analyzing the results of the second tryout. The instrument content validity was obtained through the focus group discussion (FGD) forum, and Delphi technique. The construct validity was found out through the tryout data analysis. The instrument tryout was conducted twice involving 264 participants in the first tryout and 821 participants in the second tryout. The results of the study indicate that the instrument for assessing JHS students’ HOTS in mathematics has met the validity and reliability criteria. From the results of the content validity analysis, it can be concluded that the instrument is valid, and it was supported by the items validity indices above 0.79. From the results of the construct validity analysis, it can be concluded that the instrument is valid, as indicated by the value of χ2 = 67.69, with p-value = 0.10, Root Mean Square Error of Approximation (RMSEA) = 0.03, supported by Goodness of Fit Index (GFI) of 0.97, Normed Fit Index (NFI) of 0.95, and Adjusted Goodness of Fit Index (AGFI) of 0.95. The instrument reliability is 0.88. The developed instrument for assessing HOTS in mathematics consists of 12 items, each of which is of essay test type. The test items have difficulty indices in a range of 0.30 ≤ Pi ≤ 0.7.

Download Full-text

Item Analysis of English Final Semester Test

Indonesian Journal of EFL and Linguistics ◽

10.21462/ijefl.v5i2.302 ◽

2020 ◽

Vol 5 (2) ◽

pp. 491

Author(s):

Amalia Vidya Maharani ◽

Nur Hidayanto Pancoro Setyo Putro

Keyword(s):

Quantitative Research ◽

Item Difficulty ◽

Item Analysis ◽

Low Achievers ◽

Item Discrimination ◽

Good Test ◽

Test Analysis ◽

Difficult Item ◽

Academic Year

Numerous studies have been conducted on the item test analysis in English test. However, investigation on the characteristics of a good test of English final semester test is still rare in several districts in East Java. This research sought to examine the quality of the English final semester test in the academic year of 2018/2019 in Ponorogo. A total of 151 samples in the form of students’ answers to the test were analysed based on item difficulty, item discrimination, and distractors’ effectiveness using Quest program. This descriptive quantitative research revealed that the test does not have good proportion among easy, medium, and difficult item. In the item discrimination, the test had 39 excellent items (97.5%) which meant that the test could discriminate among high and low achievers. Besides, the distractors could distract students since there were 32 items (80%) that had effective distractors. The findings of this research provided insights that item analysis became important process in constructing test. It related to find the quality of the test that directly affects the accuracy of students’ score.

Download Full-text

Pengembangan Instrumen Asesmen Higher Order Thinking Skills (HOTS) pada Mata Pelajaran Bahasa Indonesia SMA dan SMK

DIGLOSIA Jurnal Kajian Bahasa Sastra dan Pengajarannya ◽

10.30872/diglosia.v3i1.24 ◽

2020 ◽

Vol 3 (1) ◽

pp. 102-113

Author(s):

Sutami

Keyword(s):

Item Difficulty ◽

Multiple Choice ◽

Thinking Skills ◽

Assessment Instrument ◽

Item Discrimination ◽

Test Items ◽

Tenth Grade ◽

Multiple Choice Items ◽

Essay Test

This research aims to produce a valid and reliable Indonesian language assessment instrument in form of HOTS test items and it describes the quality of HOTS test items to measure HOTS skill for the tenth grade of SMA and SMK students. This study was a research and development study adapted from Borg & Gall’s development model, including the following steps: research and information collection, planning, early product development, limited try out, revising the early product, field try out, and revising the final product. The research’s result shows that the HOTS assessment instrument in the form of HOTS test consists of 40 multiple choice items and 5 essay test items. Based on the judgment of the materials, construction, and language was valid and appropriate to be used. The reliability coefficients were 0.88 for the multiple-choice items, and 0.79 for essays. The multiple-choice items have the average difficulty 0.57 (average), the average of item discrimination 0.44 (good), and the distractors function well. The essay items have the average of item difficulty 0.60 (average) and the average of item discrimination 0.45 (good)

Download Full-text

Analisis instrumen ujian formatif mata pelajaran pendidikan jasmani olahraga dan kesehatan tingkat SMP

Jurnal Pendidikan Jasmani Indonesia ◽

10.21831/jpji.v13i2.20989 ◽

2017 ◽

Vol 13 (2) ◽

pp. 79-91

Author(s):

Iswanto Iswanto

Keyword(s):

High School ◽

Physical Education ◽

Junior High School ◽

Content Validity ◽

Reliability Estimation ◽

Quality Test ◽

Education And Health ◽

Practical Matter ◽

Level Of Difficulty

Penelitian ini bertujuan untuk mengetahui cakupan dan kualitas instrumen ujian formatif mata pelajaran Penjasorkes dilihat dari aspek kognitif dan psikomotor. Penelitian ini merupakan penelitian deskriptif. Ada 2 jenis instrumen digunakan, yakni soal essay dan soal ujian praktik. Kualitas soal essay dan soal unjuk kerja dilihat dari validitas isi dengan menggunakan formula Aiken, hasil estimasi reliabilitasnya menggunakan koefesien Alpha dan Cronbach ICC. Hasil penelitian menunjukkan dari 12 sekolah, 9 sekolah memiliki instrumen lengkap, yakni kompetensi kognitif dan psikomotor. Selanjutnya, dapat dijelaskan sebagai berikut: (1) ada 2 sekolah tidak memiliki soal essay, karena mengutamakan praktik. (2) ada 1 sekolah tidak memiliki soal praktik, karena tidak memiliki lapangan. Karakteristik soal ujian formatif mata pelajaran Penjasorkes, yaitu: (1) soal essay 9 sekolah memiliki indeks Aiken tergolong baik, dan soal 1 sekolah memiliki indeks Aiken tidak baik; (2) soal essay memiliki tingkat kesukaran sedang dengan reliabilitas tidak memenuhi syarat; (3) soal praktik 11 sekolah memiliki indeks Aiken tergolong baik dengan reliabilitas 9 sekolah tidak memenuhi syarat, dan reliabilitas 2 sekolah memenuhi syarat.Kata Kunci: instrumen, ujian formatif, deskriptif kualitatif dan kuantitatif. Ananalysis of Formative Test Instruments for Subjects of Physical Education and Health Among Junior High School AbstractThis study aims to determine the scope and quality test instruments for Subjects of Physical Education and Health seen formative subjects of cognitive and psychomotor aspects. This research is descriptive. There are 2 types of instruments are used, the essay and exam practice. Quality of essay questions and problems of performance seen from the content validity by the formula Aiken, reliability estimation results of coefficient Alpha and ICC. The result showed from 12 schools, 9 school has a complete instrument, namely the cognitive and psychomotor competency. Furthermore, it can be described as follows: (1) there are two schools which do not have the essay, because the two schools prioritize learning practice. (2) there is one school which has no practical matter, because the school does not have a field facility, because it does not have a field. Characteristics exam for Subjects of Physical Education and Health formative subjects, namely: (1) essay 9 school has an index of Aiken classified is good, and about 1 school has an index of Aiken is not good; (2) The essay has a moderate level of difficulty with reliability ineligible; (3) about the practices of 11 schools have relatively good index of Aiken with reliability 9 schools were not eligible, and the reliability of two schools qualify.

Download Full-text