scholarly journals The quality of an English summative test of a public junior high school, Kupang-NTT

2020 ◽  
Vol 3 (2) ◽  
pp. 133
Author(s):  
Thresia Trivict Semiun ◽  
Fransiska Densiana Luruk

This study aimed at examining the quality of an English summative test of grade VII in a public school located in Kupang. Particularly, this study examined content validity, reliability, and conducted item analysis including item validity, item difficulty, item discrimination, and distracter effectiveness. This study was descriptive evaluative research with documentation to collect data. The data was analyzed quantitatively except for content validity, which was done qualitatively. Content validity was analyzed by matching the test items with materials stated in the curriculum. The findings revealed that the English summative test had a high content validity. The reliability was estimated by applying the Kuder-Richardson’s formula (K-R20). The result showed that the test was reliable and very good for a classroom test. The item analysis was conducted by using ITEMAN 3.0. and it revealed that the the test was mostly constructed by easy items, most of the items could discriminate the students, most distracters were able to perform well, and the most of items were valid.

Author(s):  
Leni Amelia Suek

While almost half of the teachers’ activities are assessing their students, they are not well-prepared with assessment literacy training. Hence, they are unable to produce good tests to measure students’ level of knowledge and skills. This study is aimed at analyzing item difficulty and item discrimination of a test made by an English teacher at a junior high school in Kupang. It was descriptive qualitative research and the instruments of the research were test items, answer keys, and students’ answer sheets. For the difficulty index, it was revealed that more than half of the test items were easy, while only 2% of the test items were difficult. In terms of the discrimination index, it was found that only 10% of the test items were excellent and most of the test items (46%) were poor. These findings indicated that the English test had a poor item difficulty index and low item discrimination index. Hence, it did not fulfill the criteria of a good test and could not measure students’ true ability. It is highly recommended for the teachers to improve the test items and for the government to provide assessment training for the teachers so that they can produce good tests.


2020 ◽  
Vol 1 (2) ◽  
pp. 122-138
Author(s):  
Husnani Aliah

The research aimed at finding out information about the preparation of constructing teacher-made tests in Enrekang, the quality of English teacher-made test according to item analysis, and the level cognitive domain of the teacher-made test. The test quality was determined after it was used in school examination test. This research employed survey research using descriptive method. The researcher analyzed the data and then described the research finding quantitatively. The population of this research was the teachers who teach in ninth grade at junior high schools in Enrekang. This research applied simple random sampling technique by taking four different schools as sampel. The results of analysis show preparation that junior high school teachers follow in constructing teacher-made tests in Enrekang is divided into five main parts. In preparing the test, the procedures were considering tests’ materials and proportion of each topic, choosing to check the item bank that match to syllabus and indicators, or preparing test specification. In writing test, teachers’ procedures were re-writing chosen test item from internet and textbook, re-writing items that was used before and allowing the other teachers to verify it, combining items from item bank and text book, or making new item. While in analyzing a test, the procedures used by the teachers were analyzing and revising test based on its item difficulty, predicting the item difficulty and revising the test, or doing nothing to analyze the test. About the timing in preparing the test, there are three out of five teachers who need only one week to construct multiple choice tests. Besides, there are two out of five teachers who need two weeks to construct multiple choice tests. While the teachers have different ways in providing test based on students’ ability. Moreover, the item analysis shows that no test is perfectly good. It was found that almost all tests need to be revised. It was also found that there were only three categories works in all tests based on the cognitive domain of the test namely knowledge, comprehension, and application categories. There was no item belong to analysis, synthesis, and evaluation categories.


2021 ◽  
Vol 6 (2) ◽  
pp. 256
Author(s):  
Sayit Abdul Karim ◽  
Suryo Sudiro ◽  
Syarifah Sakinah

Apart from teaching, English language teachers need to assess their students by giving a test to know the students� achievements. In general, teachers are barely conducting item analysis on their tests. As a result, they have no idea about the quality of their test distributed to the students. The present study attempts to figure out the levels of difficulty (LD) and the discriminating power (DP) of the multiple-choice (MC) test item constructed by an English teacher in the reading comprehension test utilizing test item analysis. This study employs a qualitative approach. For this purpose, a test of 50-MC test items of reading comprehension was obtained from the students� test results. Thirty-five students of grade eight took part in the MC test try-out. They are both male (15) and female (20) students of junior high school 2 Kempo, in West Nusa Tenggara Province. The findings revealed that16 items out of 50 test items were rejected due to the poor and worst quality level of difficulty and discriminating index. Meanwhile, 12 items need to be reviewed due to their mediocre quality, and 11 items are claimed to have good quality items. Besides, 11 items out of 50 test items were considered as the excellent quality as their DP scores reached around 0.44 through 0.78. The implications of the present study will shed light on the quality of teacher-made test items, especially for the MC test.


2019 ◽  
Vol 2 (2) ◽  
Author(s):  
Putu Irmayanti Wiyasa ◽  
◽  
I Ketut Darma Laksana ◽  
Ni Luh Ketut Mas Indrawati

2015 ◽  
Vol 26 (1) ◽  
pp. 56-68
Author(s):  
Siti Jamilatul Muyasaroh

The purpose of this research is to find out the level of the content validity and con struct validity of the questions of the national assessment of Indonesian language subject for junior high school / MTs. The Research method applied in this research was qualitative research method. This research employed the qualitative analysis. To support this qualita tive research, the writer used some tools in the data analysis. They are 1) to figure out the content validity, the writer had matched the test items with the indicators listed in the SKL (Graduation Standard Competency) of the Indonesian Language subject 20102011 aca demic years; 2) For the construct validity, the writer used the evaluation format of multiple choice test items by applying material aspect, construct aspect, and language and culture aspect. After the research was conducted, it can be concluded that the questions of the National Assessment of Indonesian Language Subject for Junior High School / MTs in the 20102012 academic years have high content validity and construct validity. The content validity, the entire indicators in the SKL (Graduation Standard Competency) has been ap plied in the test items. However, the writer found that there are two indicators that are used in four test items. In fact, each indicator should be applied in one test item. The construct validity, by using analysis method of the evaluation format of multiplechoice test items, the writer figured out that 56% 100 % test items are appropriate with the aspects. Meanwhile, the test items which are not deal with the aspects are 16 – 44%.


LEKSIKA ◽  
2021 ◽  
Vol 15 (1) ◽  
pp. 9
Author(s):  
Rohmatul Jannah ◽  
Didin Nuruddin Hidayat ◽  
Nida Husna ◽  
Imam Khasbani

The present study aims to analyze multiple-choice questions obtained from a trial testing conducted in a state junior high school in Indonesia. The study seeks to reveal the level of difficulty, discriminating power and distractor efficiency of the selected test items by employing item analysis. The result of the study discovers that levels of difficulty on the question items are varied. Some question tended to be easy and moderately difficult while the others are difficult to answer. It also uncovers that, in regard to discriminating power, some questions are well constructed while the others are ambiguously worded that can potentially cause the questions to fail to evaluate the students’ ability. The analysis on distractor efficiency presents information how the chosen multiple-choice questions were frequently constructed with less effective distractors that caused more high achieving students to choose wrong answers.


2016 ◽  
Vol 2 (1) ◽  
pp. 92 ◽  
Author(s):  
Samritin Samritin ◽  
Suryanto Suryanto

This study is a research and development study. It aims to produce an instrument for assessing junior high school (JHS) students’ higher order thinking skills (HOTS) in mathematics. Its procedure consists of nine steps: (1) Constructing the test specification; (2) writing test items; (3) analyzing test items; (4) conducting the first tryout; (5) analyzing the results of the first try out; (6) revising the test; (7) assembling the test; (8) conducting the second tryout; and (9) analyzing the results of the second tryout. The instrument content validity was obtained through the focus group discussion (FGD) forum, and Delphi technique. The construct validity was found out through the tryout data analysis. The instrument tryout was conducted twice involving 264 participants in the first tryout and 821 participants in the second tryout. The results of the study indicate that the instrument for assessing JHS students’ HOTS in mathematics has met the validity and reliability criteria. From the results of the content validity analysis, it can be concluded that the instrument is valid, and it was supported by the items validity indices above  0.79. From the results of the construct validity analysis, it can be concluded that the instrument is valid, as indicated by the value of χ2 = 67.69, with p-value = 0.10, Root Mean Square Error of Approximation (RMSEA) = 0.03, supported by Goodness of Fit Index (GFI) of 0.97, Normed Fit Index (NFI) of 0.95, and Adjusted Goodness of Fit Index (AGFI) of 0.95. The instrument reliability is 0.88. The developed instrument for assessing HOTS in mathematics consists of 12 items, each of which is of essay test type. The test items have difficulty indices in a range of 0.30 ≤ Pi ≤ 0.7.


2020 ◽  
Vol 5 (2) ◽  
pp. 491
Author(s):  
Amalia Vidya Maharani ◽  
Nur Hidayanto Pancoro Setyo Putro

Numerous studies have been conducted on the item test analysis in English test. However, investigation on the characteristics of a good test of English final semester test is still rare in several districts in East Java. This research sought to examine the quality of the English final semester test in the academic year of 2018/2019 in Ponorogo. A total of 151 samples in the form of students’ answers to the test were analysed based on item difficulty, item discrimination, and distractors’ effectiveness using Quest program. This descriptive quantitative research revealed that the test does not have good proportion among easy, medium, and difficult item. In the item discrimination, the test had 39 excellent items (97.5%) which meant that the test could discriminate among high and low achievers. Besides, the distractors could distract students since there were 32 items (80%) that had effective distractors. The findings of this research provided insights that item analysis became important process in constructing test. It related to find the quality of the test that directly affects the accuracy of students’ score.


2020 ◽  
Vol 3 (1) ◽  
pp. 102-113
Author(s):  
Sutami

This research aims to produce a valid and reliable Indonesian language assessment instrument in form of HOTS test items and it describes the quality of HOTS test items to measure HOTS skill for the tenth grade of SMA and SMK students. This study was a research and development study adapted from Borg & Gall’s development model, including the following steps: research and information collection, planning, early product development, limited try out, revising the early product, field try out, and revising the final product. The research’s result shows that the HOTS assessment instrument in the form of HOTS test consists of 40 multiple choice items and 5 essay test items. Based on the judgment of the materials, construction, and language was valid and appropriate to be used. The reliability coefficients were 0.88 for the multiple-choice items, and 0.79 for essays. The multiple-choice items have the average difficulty 0.57 (average), the average of item discrimination 0.44 (good), and the distractors function well. The essay items have the average of item difficulty 0.60 (average) and the average of item discrimination 0.45 (good)


2017 ◽  
Vol 13 (2) ◽  
pp. 79-91
Author(s):  
Iswanto Iswanto

Penelitian ini bertujuan untuk mengetahui cakupan dan kualitas instrumen ujian formatif mata pelajaran Penjasorkes dilihat dari aspek kognitif dan psikomotor. Penelitian ini merupakan penelitian deskriptif. Ada 2 jenis instrumen digunakan, yakni soal essay dan soal ujian praktik. Kualitas soal essay dan soal unjuk kerja dilihat dari validitas isi dengan menggunakan formula Aiken, hasil estimasi reliabilitasnya menggunakan koefesien  Alpha dan Cronbach  ICC. Hasil penelitian menunjukkan dari 12 sekolah, 9 sekolah memiliki instrumen lengkap, yakni kompetensi kognitif dan psikomotor. Selanjutnya, dapat dijelaskan sebagai berikut: (1) ada 2 sekolah tidak memiliki soal essay, karena mengutamakan praktik. (2) ada 1 sekolah tidak memiliki soal praktik, karena tidak memiliki lapangan. Karakteristik soal ujian formatif mata pelajaran Penjasorkes, yaitu: (1) soal essay 9 sekolah memiliki indeks Aiken tergolong baik, dan soal 1 sekolah memiliki indeks  Aiken tidak baik; (2) soal essay memiliki tingkat kesukaran sedang dengan reliabilitas tidak memenuhi syarat; (3) soal praktik 11 sekolah memiliki indeks Aiken tergolong baik dengan reliabilitas 9 sekolah tidak memenuhi syarat, dan reliabilitas 2 sekolah memenuhi syarat.Kata Kunci: instrumen, ujian formatif, deskriptif kualitatif dan kuantitatif. Ananalysis of Formative Test Instruments for Subjects of Physical Education and Health Among Junior High School  AbstractThis study aims to determine the scope and quality test instruments for Subjects of Physical Education and Health seen formative subjects of cognitive and psychomotor aspects. This research is descriptive. There are 2 types of instruments are used, the essay and exam practice. Quality of essay questions and problems of performance seen from the content validity by the formula Aiken, reliability estimation results of coefficient Alpha and ICC. The result showed from 12 schools, 9 school has a complete instrument, namely the cognitive and psychomotor competency. Furthermore, it can be described as follows: (1) there are two schools which do not have the essay, because the two schools prioritize learning practice. (2) there is one school which has no practical matter, because the school does not have a field facility, because it does not have a field. Characteristics exam for Subjects of Physical Education and Health formative subjects, namely: (1) essay 9 school has an index of Aiken classified is good, and about 1 school has an index of Aiken is not good; (2) The essay has a moderate level of difficulty with reliability ineligible; (3) about the practices of 11 schools have relatively good index of Aiken with reliability 9 schools were not eligible, and the reliability of two schools qualify.


Sign in / Sign up

Export Citation Format

Share Document