scholarly journals Item Analysis of Final Test for the 9th Grade Students of SMPN 44 Surabaya in the Academic Year of 2019/2020

2020 ◽  
Vol 2 (1) ◽  
pp. 34-46
Author(s):  
Siti Fatimah ◽  
Achmad Bernhardo Elzamzami ◽  
Joko Slamet

This research was conducted by focusing on the formulated question regarding the test scores validity, reliability and item analysis involving the discrimination power and index difficulty in order to provide detail information leading to the improvement of test items construction. The quality of each particular item was analyzed in terms of item difficulty, item discrimination and distractor analysis. The statistical tests were used to compute the reliability of the test by applying The Kuder-Richardson Formula (KR20). The analysis of 50 test items was computed using Microsoft Office Excel. A descriptive method was applied to describe and examined the data. The research findings showed the test fulfilled the criteria of having content validity which was categorized as a low validity. Meanwhile, the reliability value of the test scores was 0.521010831 (0.52) categorized as lower reliability and revision of test. Through the 50 items examined, there were 21 items that were in need of improvement which were classified into “easy” for the index difficulty and “poor” category for the discriminability by the total 26 items (52%). It means more than 50% of the test items need to be revised as the items do not meet the criteria. It is suggested that in order to measure students’ performance effectively, essential improvement need to be evaluated where items with “poor” discrimination index should be reviewed.    

2019 ◽  
Vol 120 (5/6) ◽  
pp. 383-406 ◽  
Author(s):  
Ömer Demir ◽  
Süleyman Sadi Seferoğlu

Purpose The lack of a reliable and valid measurement tool for coding achievement emerges as a major problem in Turkey. Therefore, the purpose of this study is to develop a Scratch-based coding achievement test. Design/methodology/approach Initially, an item pool with 31 items was created. The item pool was classified within the framework of Bayman and Mayer’s (1988) types of coding knowledge to support content validity of the test. Then the item pool was applied to 186 volunteer undergraduates at Hacettepe University during the spring semester of the 2017-2018 academic year. Subsequently, the item analysis was conducted for construct validity of the test. Findings In all, 13 items were discarded from the test, leaving a total of 18 items. Out of the 18-item version of the coding achievement test, 4, 5 and 9 items measured syntactic, conceptual and strategic knowledge, respectively, among the types of coding knowledge. Furthermore, average item discrimination index (0.531), average item difficulty index (0.541) and Cronbach Alpha reliability coefficient (0.801) of the test were calculated. Practical implications Scratch users, especially those who are taking introductory courses at Turkish universities, could benefit from a reliable and valid coding achievement test developed in this study. Originality/value This paper has theoretical and practical value, as it provides detailed developmental stages of a reliable and valid Scratch-based coding achievement test.


Author(s):  
Tesalonika Br Karo ◽  
Viator Lumbanraja ◽  
Novalina Sembiring

The purpose of this research is to describe the ability of the eleventh-grade students of SMA Deli Murni Bandar Baru on using Countable and Uncountable Nouns. The population of this research was the eleventh-grade students, with 58 students taken as sample. The instrument of collecting data is a test concerning Countable and Uncountable Nouns. The tryout test was done to know the validity, reliability, item difficulty of test items. The result showed that 5 students (15 %) belong to the high category, 24 students (73 %) to the moderate category, and 4 students (12 %) to the low category. The mean score was 61,39 it was only 24 % of the total students who can do the test well with 12 students who get a score above 75, it means that the eleventh-grade students of SMA Deli Murni Bandar Baru are not yet able to use Countable and Uncountable Nouns. Based on the total incorrect answers made by the students in using countable and uncountable was 502. The percentage of students’ mistakes made by students in uncountable multiple choice including indefinite and quantifier uncountable was 33 %, in countable multiple choice including singular, regular, irregular countable 34%, in the countable essay including regular and irregular countable was 33%. Based on the findings and conclusions, some suggestions are offered to English teachers, English students, and other researchers. Especially to English teachers, who teach in school, are advised to improve students' ability to use countable and uncountable nouns.


2020 ◽  
Vol 5 (1) ◽  
pp. 610-615
Author(s):  
Anisa Fitriani Lailamsyah ◽  
Fitri Apriyanti

This research is intended to analysis of the English summative test for the tenth grade students of SMA Muhammadiyah 3 Jakarta in the 2013/2014 academic year. The research method used to analyze the item of English summative test for senior high school are quantitative and qualitative method. Quantitative method was used to analyze data of the facility value and discriminating power of each item. The qualitative method was used to describe and analyze the quality of test items. Keywords: Item Analysis, English Summative Test, Tenth Grade Student


2020 ◽  
Vol 3 (2) ◽  
pp. 133
Author(s):  
Thresia Trivict Semiun ◽  
Fransiska Densiana Luruk

This study aimed at examining the quality of an English summative test of grade VII in a public school located in Kupang. Particularly, this study examined content validity, reliability, and conducted item analysis including item validity, item difficulty, item discrimination, and distracter effectiveness. This study was descriptive evaluative research with documentation to collect data. The data was analyzed quantitatively except for content validity, which was done qualitatively. Content validity was analyzed by matching the test items with materials stated in the curriculum. The findings revealed that the English summative test had a high content validity. The reliability was estimated by applying the Kuder-Richardson’s formula (K-R20). The result showed that the test was reliable and very good for a classroom test. The item analysis was conducted by using ITEMAN 3.0. and it revealed that the the test was mostly constructed by easy items, most of the items could discriminate the students, most distracters were able to perform well, and the most of items were valid.


2013 ◽  
Vol 66 (1) ◽  
Author(s):  
Ong Eng Tek ◽  
Mohd Al-Junaidi Mohamad

This study aims to develop a valid and reliable multiple-choice test referred to as Test of Basic and Integrated Process Skills (T-BIPS) for secondary schools to measure the acquisition of a full range of 12 science process skills (SPS), namely 7 basic SPS and 5 integrated SPS. This study involves two phases. Phase one entails the generation of test items according to a set of item objectives, and the establishment of the content and face validities as well as response objectivity in a qualitative manner through the use of panel experts. Phase two involves validating the psychometric properties of the instrument using field testing data from 104 Form 4 students of top, average and bottom sets in urban and rural schools. The final set of T-BIPS consists of 60 items: 28 items for basic SPS (with the KR-20 reliability of 0.86) and 32 items for integrated SPS (with the KR-20 reliability of 0.89). The mean item difficulty index is 0.60, ranging between 0.37 and 0.75, while the mean item discrimination index is 0.52, ranging between 0.20 and 0.77. The results of item analysis indicate that T-BIPS with the appropriate psychometric characteristics is an acceptable, valid and reliable test to measure the acquisition of science process skills. 


Author(s):  
Horhon Lumbantoruan ◽  
Sri Minda Murni ◽  
Isli Iriani Indiah Pane

The objective of this study is to find out the quality of the English final test designed for the second semester of third grade students of SMAN 1 Pagaran in academic year 2016/2017. It describes whether or not the test items have good characteristic of test in terms of validity, reliability, difficulty level, and discriminating power. The test consists of 35 items multiple choice forms. The research design uses in this study was Descriptive Qualitative Research. To find out the discriminating power of the test, the writer chose the top 31% for the upper group and top 31% for the lower group. The result of this study shows that there are 18 (51%) acceptable items to meet the criteria of validity and 17 items (49%) is Invalid. The test is reliable since has 0.676 the level of validity. The test has unacceptable index of difficulty since has 15 items (43%) too difficult and are only 5 items (14%) easy items. Whereas for discriminating power index, the writer found there are 7 (20%) has negative result of the point have to be discard, 6 (17%) poor items, 8 (22%) satisfactory items, 13 (38%) good items, and 1 (3 %) excellent item. In conclusion, English final test designed for the second semester of third grade students of SMAN 1 Pagaran in academic year 2016/2017 does not meet the criteria of effective and acceptable test.Keywords:  Validity, Reliability, Level of Difficulty, Discrimination Power


2020 ◽  
Vol 5 (2) ◽  
pp. 491
Author(s):  
Amalia Vidya Maharani ◽  
Nur Hidayanto Pancoro Setyo Putro

Numerous studies have been conducted on the item test analysis in English test. However, investigation on the characteristics of a good test of English final semester test is still rare in several districts in East Java. This research sought to examine the quality of the English final semester test in the academic year of 2018/2019 in Ponorogo. A total of 151 samples in the form of students’ answers to the test were analysed based on item difficulty, item discrimination, and distractors’ effectiveness using Quest program. This descriptive quantitative research revealed that the test does not have good proportion among easy, medium, and difficult item. In the item discrimination, the test had 39 excellent items (97.5%) which meant that the test could discriminate among high and low achievers. Besides, the distractors could distract students since there were 32 items (80%) that had effective distractors. The findings of this research provided insights that item analysis became important process in constructing test. It related to find the quality of the test that directly affects the accuracy of students’ score.


1992 ◽  
Vol 21 (2) ◽  
pp. 151-160 ◽  
Author(s):  
Michael G. Aamodt ◽  
Teige McShane

The current study reported the results of a meta-analytic investigation of the effects on test scores and test completion times of three aspects of writing test items: The number of answers in multiple-choice exams, the order of item difficulty, and the organization of items by content. The results of meta-analysis indicated that three-choice questions are slightly easier than four-choice questions (d = .90) and take significantly less time to complete (d = −.61). Exams beginning with easier items and then moving to more difficult items are slightly easier than exams with randomly ordered items (d = .11) or exams beginning with difficult items (d — .22). Exams in which the items are organized by content are slightly easier than exams containing randomly ordered items (d = .04). All of the above effect sizes are small.


2005 ◽  
Vol 11 (2) ◽  
pp. 93-96 ◽  
Author(s):  
Amanda D Heidgerken ◽  
Adam B Lewin ◽  
Gary R Geffken ◽  
Kenneth M Gelfand ◽  
Eric A Storch ◽  
...  

An educational Website was designed by the Florida Initiative in Telehealth and Education group, and an online diabetes education test was developed using a sample of 60 children and young adults aged 8–22 years, all of whom had diabetes. The 31 items were analysed for item difficulty. Eight test items were eliminated as being unsuitable. The test was then used in 67 prospective diabetes counsellors (23 men, 44 women) who volunteered for a summer camp. Camp counsellors ranged in age from 17 to 33 years (mean 22 years, SD 3). The counsellors' mean pre-test scores were 80% and their mean post-test scores were 92%. There was a significant improvement ( P < 0.001) of approximately 1.25 questions from pre- to post-test. This supports the use of the online educational Website for training individuals working with children with diabetes. The Website may prove to be useful for online education in other areas of diabetes management.


2020 ◽  
Vol 12 (2-2) ◽  
Author(s):  
Nor Aisyah Saat

Item analysis is the process of examining student responses to test items individually in order to get clear picture on the quality of the item and the overall test. Teachers are encouraged to perform item analysis for each administered test in order to determine which items should be retained, modified, or discarded in the given test. This study aims to analyse items in 2 summative examination question papers by using classical test theory (CTT). The instruments used were the SPM Mathematics Trial Examination Questions 1 2019 which involved 50 students in form 5 students and the SPM Mathematics Trial Examination Question 1 2019 which involved 20 students. The SPM Mathematics Trial Examination Question paper 1 contains 40 objective questions while the SPM Mathematics Trial Examination paper 1 contains 25 subjective questions. The data obtained were analysed using Microsoft Excel software based on the formulas of item difficulty index and discrimination index. This analysis can help teachers for better understanding about the difficulty level of the items used. Finally, based on the analysis items obtained, the items were classified as good, good but improved, marginal or weak items.


Sign in / Sign up

Export Citation Format

Share Document