Irt Versus Conventional Equating Methods: A Comparative Study of Scale Stability

1983 ◽  
Vol 8 (2) ◽  
pp. 137-156 ◽  
Author(s):  
Nancy S. Petersen ◽  
Linda L. Cook ◽  
Martha L. Stocking

Scale drift for the verbal and mathematical portions of the Scholastic Aptitude Test (SAT) was investigated using linear, equipercentile and item response theory (IRT) equating methods. The linear methods investigated were the Tucker, Levine Equally Reliable and Levine Unequally Reliable models. Three IRT calibration designs were employed. These designs are referred to as (1) concurrent, (2) fixed b’s method, and (3) characteristic curve transformation method. The results of the various equating methods were compared both graphically and analytically. These results indicated that for reasonably parallel tests, linear equating methods perform adequately. However, when tests differ somewhat in content and length, methods based on the three-parameter logistic IRT model lead to greater stability of equating results. Of the conventional equating methods investigated, the Levine Equally Reliable model appears to be the most robust for the type of equating situation used in this study. The IRT method that provided the most stable equating results overall was the concurrent calibration method.

1968 ◽  
Vol 23 (1) ◽  
pp. 119-134 ◽  
Author(s):  
Leon H. Belcher ◽  
Joel T. Campbell

Two word-association lists of 50 words were each administered to 50 Negro college students. 41 words were taken from the Kent-Rosanoff list, 29 from the Palermo-Jenkins list, and 30 were words used in analogy items of the Scholastic Aptitude Test. Comparisons with previous normative studies showed generally similar results. The present study did result in slightly smaller proportions of matching from class primary responses to noun, pronoun, and adverb stimulus words and of opposite responses to “opposite-evoking stimuli.” A number of the responses indicated reading difficulty or misunderstanding of the word.


2014 ◽  
Vol 519-520 ◽  
pp. 636-639
Author(s):  
Bao Long Zhang ◽  
Shao Jing Zhang ◽  
Wei Qi Ding ◽  
Hui Shuang Shi

The fisheye lens is a kind of ultra wide angle lens, which can produce a big super-wide-angle lens distortion. In order to cover a large scope of light, barrel distortion is artificially added to the optical system. However, in some cases this distortion is not allowed, then it requires calibrations of those distortions. Most of the traditional distortion calibration method uses target plane calibration to do it. This paper discusses the way of design fisheye lens, through which we can know the forming process of distortion clearly. Based on this paper, a simple and effective calibration method can be understood. Different from common camera calibration method, the proposed calibration method can avoid the error occurring in the process of calibrating test, that directly use the lens’ characteristic curve. Through multiple sets of experimental verifications, this method is effective and feasible.


2021 ◽  
Author(s):  
Ehsan Shahiri Tabarestani ◽  
Hossein Afzalimehr

Abstract Floods are one of the most damaging natural disasters throughout the world. The purpose of this study is to develop a reliable model for identification of flood susceptible areas. Three Multi-criteria decision-making techniques, namely Analytical Hierarchy Process (AHP), Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS), and Attributive Border Approximation Area Comparison (MABAC) methods combined with weight of evidence (WOE) were used in Mazandaran Province, Iran. MABAC method is applied to determine the flood susceptibility in this study, for the first time. At first, 160 flood locations were identified in the study area, of which 112 (70%) locations were selected randomly for modeling, and the remaining 48 (30%) locations were used for validation. Using Geographic Information System (GIS) with eight conditioning factors including rainfall, distance from rivers, slope, soil, geology, elevation, drainage density, and land use, the flood susceptibility maps were prepared. The results showed that the area under receiver operating characteristic curve (AUROC) for the test data of AHP-WOE, TOPSIS-WOE-AHP, and MABAC-WOE-AHP methods were 75.3%, 91.6%, and 86.1%, respectively, which indicate the reasonable accuracy of models. High accuracy of the proposed new model (MABAC) clarifies its applicability for preventive measures.


Sensors ◽  
2018 ◽  
Vol 18 (9) ◽  
pp. 2842 ◽  
Author(s):  
Wei Liu ◽  
Bing Liang ◽  
Zhenyuan Jia ◽  
Di Feng ◽  
Xintong Jiang ◽  
...  

High precision position control is essential in the process of parts manufacturing and assembling, where eddy current displacement sensors (ECDSs) are widely used owing to the advantages of non-contact sensing, compact volume, and resistance to harsh conditions. To solve the nonlinear characteristics of the sensors, a high-accuracy calibration method based on linearity adjustment is proposed for ECDSs in this paper, which markedly improves the calibration accuracy and then the measurement accuracy. After matching the displacement value and the output voltage of the sensors, firstly, the sensitivity is adjusted according to the specified output range. Then, the weighted support vector adjustment models with the optimal weight of the zero-scale, mid-scale and full-scale are established respectively to cyclically adjust the linearity of the output characteristic curve. Finally, the final linearity adjustment model is obtained, and both the calibration accuracy and precision are verified by the established calibration system. Experimental results show that the linearity of the output characteristic curve of ECDS adjusted by the calibration method reaches over 99.9%, increasing by 1.9–5.0% more than the one of the original. In addition, the measurement accuracy improves from 11–25 μ m to 1–10 μ m in the range of 6mm, which provides a reliable guarantee for high accuracy displacement measurement.


1993 ◽  
Vol 18 (2) ◽  
pp. 131-154 ◽  
Author(s):  
John R. Donoghue ◽  
Nancy L. Allen

This Monte Carlo study examined strategies for forming the matching variable for the Mantel-Haenszel (MH) differential item functioning (DIF) procedure; thin matching on total test score was compared to forms of thick matching, pooling levels of the matching variable. Data were generated using a three-parameter logistic (3PL) item response theory (IRT) model with common guessing parameter. Number of subjects and test length were manipulated, as were the difficulty, discrimination, and presence/absence of DIF in the studied item. Outcome measures were the transformed log-odds &Deltacirc; MH, its standard error, and the MH chi-square statistic. For short tests (5 or 10 items), thin matching yielded very poor results, with a tendency to falsely identify items as possessing DIF against the reference group. The best methods of thick matching yielded outcome measure values closer to the expected value for non-DIF items, as well as a larger value than thin matching when the studied item possessed DIF. Intermediate length tests yielded similar results for thin matching and the best methods of thick matching. The method of thick matching that performed best depended on the measure used to detect DIF. Both difficulty and discrimination of the studied item were found to have a strong effect on the value of &Deltacirc; MH.


Psihologija ◽  
2012 ◽  
Vol 45 (2) ◽  
pp. 189-207 ◽  
Author(s):  
Bojana Dinic ◽  
Bojan Janicic

The aim of this research was to examine the psychometric properties of the Buss-Perry Aggression Questionnaire on Serbian sample, using the IRT model for graded responses. AQ contains four subscales: Physical aggression, Verbal aggression, Hostility and Anger. The sample included 1272 participants, both gender and age ranged from 18 to 68 years, with average age of 31.39 (SD = 12.63) years. Results of IRT analysis suggested that the subscales had greater information in the range of above-average scores, namely in participants with higher level of aggressiveness. The exception was Hostilisty subscale, because it was informative in the wider range of trait. On the other hand, this subscale contains two items which violate assumption of homogenity. Implications for measurement of aggressiveness are discussed.


Sign in / Sign up

Export Citation Format

Share Document