scholarly journals Validity and reliability of eight-grade digital culture test in light of item response theory

2021 ◽  
Vol 16 (4) ◽  
pp. 1816-1835
Author(s):  
Moen Salman Alnasraween ◽  
Ayat Mohammad Almughrabi ◽  
Raeda Mofid Ammari ◽  
Mohammad Saleh Alkaramneh

The purpose of this study is to construct a digital culture test in light of the Item Response Theory and to investigate its psychometric properties. The study sample consisted of six hundred fifty (650) male and female students in the eighth grade from the Directorate of Education and Teaching of Salt District. To obtain the results, the descriptive approach was used. The results showed that the items have acceptable indicators of discrimination and extend on the continuum of difficulty adequately. The validity and reliability of the test were verified by using several methods, including content validity and internal consistency. The study findings showed that most of the test items fit the assumptions of the two-parameter logistic model. The results also displayed statistically significant differences in the arithmetic means of the digital culture test due to gender in favor of female students. Moreover, the outcomes presented statistically significant differences attributed to the education sector variable favoring the private sector. Keywords: Digital Culture Test, IRT, Psychometric Properties, Teaching.

2021 ◽  
Vol 20 (1) ◽  
pp. 55-62
Author(s):  
Anthony Pius Effiom

This study used Item Response Theory approach to assess Differential Item Functioning (DIF) and detect item bias in Mathematics Achievement Test (MAT). The MAT was administered to 1,751 SS2 students in public secondary schools in Cross River State. Instrumentation research design was used to develop and validate a 50-item instrument. Data were analysed using the maximum likelihood estimation technique of BILOG-MG V3 software. The result of the study revealed that 6% of the total items exhibited differential item functioning between the male and female students. Based on the analysis, the study observed that there was sex bias on some of the test items in the MAT. DIF analysis attempt at eliminating irrelevant factors and sources of bias from any kind for a test to yield valid results is among the best methods of recent. As such, test developers and policymakers are recommended to take into serious consideration and exercise care in fair test practice by dedicating effort to more unbiased test development and decision making. Examination bodies should adopt the Item Response Theory in educational testing and test developers should therefore be mindful of the test items that can cause bias in response pattern between male and female students or any sub-group of consideration. Keywords: Assessment, Differential Item Functioning, Validity, Reliability, Test Fairness, Item Bias, Item Response Theory.


2021 ◽  
Vol 14 (2) ◽  
Author(s):  
Muhamad Ali Misri ◽  
Saifuddin Saifuddin ◽  
Reza Oktiana Akbar ◽  
Nok Rini Kamelia

[English]: This research aims to develop and evaluate a higher-order thinking skill (HOTS)-based test for a matrix topic. The development was carried out in two stages; items development and validation. The first stage was to review relevant literature about HOTS, design the test items, have experts review, and try out the items. Fifty-one upper secondary school students were involved in the tryout. In the second stage, results of the tryout were validated referring to the classical test and item response theory, including items characteristics, validity and reliability, items discrimination, and difficulty levels. The validation resulted in five valid test items (r1=0,54; r2=0,88; r3=0,72; r4=0,78; r5=0,82). The developed test represents the topic, fulfills HOTS criteria, is reliable rα=0,85, can differentiate students with higher-order thinking, and has varied difficulty levels. [Bahasa]: Penelitian ini bertujuan untuk mengembangkan dan mengevaluasi soal tes berbasis keterampilan berpikir tingkat tinggi (HOTS) pada materi matriks. Pengembangan instrumen tes melalui dua tahap, yaitu pengembangan draf soal dan validasi. Pada tahap pertama, dilakukan kajian literatur yang relevan, penyusunan rencana butir soal, evaluasi butir soal yang diusulkan, dan uji coba draf butir soal. Sebanyak 51 siswa sekolah menengah dilibatkan pada tahapan uji coba. Pada tahap validasi, dilakukan analisis menggunakan teori tes klasik dan teori respon butir mencakup: karakterisasi, validitas dan reliabilitas, uji daya beda, dan tingkat kesulitan soal. Penelitian ini menghasilkan lima butir soal yang valid (r1=0,54; r2=0,88; r3=0,72; r4=0,78; r5=0,82). Tes yang dikembangkan mewakili materi matriks, memenuhi kriteria HOTS, dapat diandalkan dengan nilai reliabilitas tes sebesar rα=0,85, dapat membedakan siswa yang memiliki kemampuan berpikir tingkat tinggi, dan memiliki keragaman tingkat kesulitan.


2021 ◽  
Vol 10 (3) ◽  
pp. 388
Author(s):  
Melissa Alves Braga de Oliveira ◽  
Euclides de Mendonça Filho ◽  
Alicia Carissimi ◽  
Luciene Lima dos Santos Garay ◽  
Marina Scop ◽  
...  

Background: Recent studies with the mood rhythm instrument (MRhI) have shown that the presence of recurrent daily peaks in specific mood symptoms are significantly associated with increased risk of psychiatric disorders. Using a large sample collected in Brazil, Spain, and Canada, we aimed to analyze which MRhI items maintained good psychometric properties across cultures. As a secondary aim, we used network analysis to visualize the strength of the association between the MRhI items. Methods: Adults (n = 1275) between 18–60 years old from Spain (n = 458), Brazil (n = 415), and Canada (n = 401) completed the MRhI and the self-reporting questionnaire (SRQ-20). Psychometric analyses followed three steps: Factor analysis, item response theory, and network analysis. Results: The factor analysis indicated the retention of three factors that grouped the MRhI items into cognitive, somatic, and affective domains. The item response theory analysis suggested the exclusion of items that displayed a significant divergence in difficulty measures between countries. Finally, the network analysis revealed a structure where sleepiness plays a central role in connecting the three domains. These psychometric analyses enabled a psychometric-based refinement of the MRhI, where the 11 items with good properties across cultures were kept in a shorter, revised MRhI version (MRhI-r). Limitations: Participants were mainly university students and, as we did not conduct a formal clinical assessment, any potential correlations (beyond the validated SRQ) cannot be ascertained. Conclusions: The MRhI-r is a novel tool to investigate self-perceived rhythmicity of mood-related symptoms and behaviors, with good psychometric properties across multiple cultures.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Yunsoo Lee ◽  
Ji Hoon Song ◽  
Soo Jung Kim

Purpose This paper aims to validate the Korean version of the decent work scale and examine the relationship between decent work and work engagement. Design/methodology/approach After completing translation and back translation, the authors surveyed 266 Korean employees from various organizations via network sampling. They assessed Rasch’s model based on item response theory. In addition, they used classical test theory to evaluate the decent work scale’s validity and reliability. Findings The authors found that the current version of the decent work scale has good validity, reliability and item difficulty, and decent work has a positive relationship with work engagement. However, based on item response theory, the assessment showed that three of the items are extremely similar to another item within the same dimension, implying that the items are unable to discriminate among individual traits. Originality/value This study validated the decent work scale in a Korean work environment using Rasch’s (1960) model from the perspective of item response theory.


2013 ◽  
Vol 30 (4) ◽  
pp. 479-486
Author(s):  
Odoisa Antunes de Queiroz ◽  
Ricardo Primi ◽  
Lucas de Francisco Carvalho ◽  
Sônia Regina Fiorim Enumo

Dynamic testing, with an intermediate phase of assistance, measures changes between pretest and post-test assuming a common metric between them. To test this assumption we applied the Item Response Theory in the responses of 69 children to dynamic cognitive testing Children's Analogical Thinking Modifiability Test adapted, with 12 items, totaling 828 responses, with the purpose of verifying if the original scale yields the same results as the equalized scale obtained by Item Response Theory in terms of "changes quantifying". We followed the steps: 1) anchorage of the pre and post-test items through a cognitive analysis, finding 3 common items; 2) estimation of the items' difficulty level parameter and comparison of those; 3) equalization of the items and estimation of "thetas"; 4) comparison of the scales. The Children's Analogical Thinking Modifiability Test metric was similar to that estimated by the TRI, but it is necessary to differentiate the pre and post-test items' difficulty, adjusting it to samples with high and low performance.


Author(s):  
Mehmet Barış Horzum ◽  
Gülden Kaya Uyanik

The aim of this study is to examine validity and reliability of Community of Inquiry Scale commonly used in online learning by the means of Item Response Theory. For this purpose, Community of Inquiry Scale version 14 is applied on 1,499 students of a distance education center’s online learning programs at a Turkish state university via internet. The collected data is analyzed by using a statistical software package. Research data is analyzed in three aspects, which are checking model assumptions, checking model-data fit and item analysis. Item and test features of the scale are examined by the means of Graded Response Theory. In order to use this model of IRT, after testing the assumptions out of the data gathered from 1,499 participants, data model compliance was examined. Following the affirmative results gathered from the examinations, all data is analyzed by using GRM. As a result of the study, the Community of Inquiry Scale adapted to Turkish by Horzum (in press) is found to be reliable and valid by the means of Classical Test Theory and Item Response Theory.


2019 ◽  
Vol 36 (17) ◽  
pp. 2493-2505 ◽  
Author(s):  
Jana Ranson ◽  
Brooke E. Magnus ◽  
Nancy Temkin ◽  
Sureyya Dikmen ◽  
Joseph T. Giacino ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document