scholarly journals A novel technique of detecting DIF in items using z-score based on IRT model

2021 ◽  
Author(s):  
Ankita Dey

A novel technique of detecting DIF in items using z-score based on IRT model. A simple technique based on z-score is proposed and derived for Rasch Model and also compared with traditional DIF detection methods. The data set used for application is the State Level Achievement Survey data set from West Bengal, India. Binary responses on 40-item tests of 12518 examinees are taken.

2021 ◽  
Author(s):  
Ankita Dey

A novel technique of detecting DIF in items using z-score based on IRT model. A simple technique based on z-score is proposed and derived for Rasch Model and also compared with traditional DIF detection methods. The data set used for application is the State Level Achievement Survey data set from West Bengal, India. Binary responses on 40-item tests of 12518 examinees are taken.


2021 ◽  
Author(s):  
Ankita Dey

A novel technique of detecting DIF in items using z-score based on IRT model. A simple technique based on z-score is proposed and derived for Rasch Model and also compared with traditional DIF detection methods. The data set used for application is the State Level Achievement Survey data set from West Bengal, India. Binary responses on 40-item tests of 12518 examinees are taken.


Author(s):  
Michael S. Danielson

The first empirical task is to identify the characteristics of municipalities which US-based migrants have come together to support financially. Using a nationwide, municipal-level data set compiled by the author, the chapter estimates several multivariate statistical models to compare municipalities that did not benefit from the 3x1 Program for Migrants with those that did, and seeks to explain variation in the number and value of 3x1 projects. The analysis shows that migrants are more likely to contribute where migrant civil society has become more deeply institutionalized at the state level and in places with longer histories as migrant-sending places. Furthermore, the results suggest that political factors are at play, as projects have disproportionately benefited states and municipalities where the PAN had a stronger presence, with fewer occurring elsewhere.


2019 ◽  
Vol 9 (6) ◽  
pp. 1128 ◽  
Author(s):  
Yundong Li ◽  
Wei Hu ◽  
Han Dong ◽  
Xueyan Zhang

Using aerial cameras, satellite remote sensing or unmanned aerial vehicles (UAV) equipped with cameras can facilitate search and rescue tasks after disasters. The traditional manual interpretation of huge aerial images is inefficient and could be replaced by machine learning-based methods combined with image processing techniques. Given the development of machine learning, researchers find that convolutional neural networks can effectively extract features from images. Some target detection methods based on deep learning, such as the single-shot multibox detector (SSD) algorithm, can achieve better results than traditional methods. However, the impressive performance of machine learning-based methods results from the numerous labeled samples. Given the complexity of post-disaster scenarios, obtaining many samples in the aftermath of disasters is difficult. To address this issue, a damaged building assessment method using SSD with pretraining and data augmentation is proposed in the current study and highlights the following aspects. (1) Objects can be detected and classified into undamaged buildings, damaged buildings, and ruins. (2) A convolution auto-encoder (CAE) that consists of VGG16 is constructed and trained using unlabeled post-disaster images. As a transfer learning strategy, the weights of the SSD model are initialized using the weights of the CAE counterpart. (3) Data augmentation strategies, such as image mirroring, rotation, Gaussian blur, and Gaussian noise processing, are utilized to augment the training data set. As a case study, aerial images of Hurricane Sandy in 2012 were maximized to validate the proposed method’s effectiveness. Experiments show that the pretraining strategy can improve of 10% in terms of overall accuracy compared with the SSD trained from scratch. These experiments also demonstrate that using data augmentation strategies can improve mAP and mF1 by 72% and 20%, respectively. Finally, the experiment is further verified by another dataset of Hurricane Irma, and it is concluded that the paper method is feasible.


Electronics ◽  
2021 ◽  
Vol 10 (11) ◽  
pp. 1285
Author(s):  
Mohammed Al-Sarem ◽  
Faisal Saeed ◽  
Zeyad Ghaleb Al-Mekhlafi ◽  
Badiea Abdulkarem Mohammed ◽  
Tawfik Al-Hadhrami ◽  
...  

Security attacks on legitimate websites to steal users’ information, known as phishing attacks, have been increasing. This kind of attack does not just affect individuals’ or organisations’ websites. Although several detection methods for phishing websites have been proposed using machine learning, deep learning, and other approaches, their detection accuracy still needs to be enhanced. This paper proposes an optimized stacking ensemble method for phishing website detection. The optimisation was carried out using a genetic algorithm (GA) to tune the parameters of several ensemble machine learning methods, including random forests, AdaBoost, XGBoost, Bagging, GradientBoost, and LightGBM. The optimized classifiers were then ranked, and the best three models were chosen as base classifiers of a stacking ensemble method. The experiments were conducted on three phishing website datasets that consisted of both phishing websites and legitimate websites—the Phishing Websites Data Set from UCI (Dataset 1); Phishing Dataset for Machine Learning from Mendeley (Dataset 2, and Datasets for Phishing Websites Detection from Mendeley (Dataset 3). The experimental results showed an improvement using the optimized stacking ensemble method, where the detection accuracy reached 97.16%, 98.58%, and 97.39% for Dataset 1, Dataset 2, and Dataset 3, respectively.


2021 ◽  
Author(s):  
Hansi Hettiarachchi ◽  
Mariam Adedoyin-Olowe ◽  
Jagdev Bhogal ◽  
Mohamed Medhat Gaber

AbstractSocial media is becoming a primary medium to discuss what is happening around the world. Therefore, the data generated by social media platforms contain rich information which describes the ongoing events. Further, the timeliness associated with these data is capable of facilitating immediate insights. However, considering the dynamic nature and high volume of data production in social media data streams, it is impractical to filter the events manually and therefore, automated event detection mechanisms are invaluable to the community. Apart from a few notable exceptions, most previous research on automated event detection have focused only on statistical and syntactical features in data and lacked the involvement of underlying semantics which are important for effective information retrieval from text since they represent the connections between words and their meanings. In this paper, we propose a novel method termed Embed2Detect for event detection in social media by combining the characteristics in word embeddings and hierarchical agglomerative clustering. The adoption of word embeddings gives Embed2Detect the capability to incorporate powerful semantical features into event detection and overcome a major limitation inherent in previous approaches. We experimented our method on two recent real social media data sets which represent the sports and political domain and also compared the results to several state-of-the-art methods. The obtained results show that Embed2Detect is capable of effective and efficient event detection and it outperforms the recent event detection methods. For the sports data set, Embed2Detect achieved 27% higher F-measure than the best-performed baseline and for the political data set, it was an increase of 29%.


2021 ◽  
Author(s):  
Masaki Uto

AbstractPerformance assessment, in which human raters assess examinee performance in a practical task, often involves the use of a scoring rubric consisting of multiple evaluation items to increase the objectivity of evaluation. However, even when using a rubric, assigned scores are known to depend on characteristics of the rubric’s evaluation items and the raters, thus decreasing ability measurement accuracy. To resolve this problem, item response theory (IRT) models that can estimate examinee ability while considering the effects of these characteristics have been proposed. These IRT models assume unidimensionality, meaning that a rubric measures one latent ability. In practice, however, this assumption might not be satisfied because a rubric’s evaluation items are often designed to measure multiple sub-abilities that constitute a targeted ability. To address this issue, this study proposes a multidimensional IRT model for rubric-based performance assessment. Specifically, the proposed model is formulated as a multidimensional extension of a generalized many-facet Rasch model. Moreover, a No-U-Turn variant of the Hamiltonian Markov chain Monte Carlo algorithm is adopted as a parameter estimation method for the proposed model. The proposed model is useful not only for improving the ability measurement accuracy, but also for detailed analysis of rubric quality and rubric construct validity. The study demonstrates the effectiveness of the proposed model through simulation experiments and application to real data.


2017 ◽  
Vol 17 (4) ◽  
pp. 850-868 ◽  
Author(s):  
William Soo Lon Wah ◽  
Yung-Tsang Chen ◽  
Gethin Wyn Roberts ◽  
Ahmed Elamin

Analyzing changes in vibration properties (e.g. natural frequencies) of structures as a result of damage has been heavily used by researchers for damage detection of civil structures. These changes, however, are not only caused by damage of the structural components, but they are also affected by the varying environmental conditions the structures are faced with, such as the temperature change, which limits the use of most damage detection methods presented in the literature that did not account for these effects. In this article, a damage detection method capable of distinguishing between the effects of damage and of the changing environmental conditions affecting damage sensitivity features is proposed. This method eliminates the need to form the baseline of the undamaged structure using damage sensitivity features obtained from a wide range of environmental conditions, as conventionally has been done, and utilizes features from two extreme and opposite environmental conditions as baselines. To allow near real-time monitoring, subsequent measurements are added one at a time to the baseline to create new data sets. Principal component analysis is then introduced for processing each data set so that patterns can be extracted and damage can be distinguished from environmental effects. The proposed method is tested using a two-dimensional truss structure and validated using measurements from the Z24 Bridge which was monitored for nearly a year, with damage scenarios applied to it near the end of the monitoring period. The results demonstrate the robustness of the proposed method for damage detection under changing environmental conditions. The method also works despite the nonlinear effects produced by environmental conditions on damage sensitivity features. Moreover, since each measurement is allowed to be analyzed one at a time, near real-time monitoring is possible. Damage progression can also be given from the method which makes it advantageous for damage evolution monitoring.


2018 ◽  
Vol 45 (4) ◽  
pp. 441-459 ◽  
Author(s):  
Sue Thomas ◽  
Ryan Treffers ◽  
Nancy F. Berglas ◽  
Laurie Drabble ◽  
Sarah C. M. Roberts

As U.S. states legalize marijuana and as governmental attention is paid to the “opioid crisis,” state policies pertaining to drug use during pregnancy are increasingly important. Little is known about the scope of state policies targeting drug use during pregnancy, how they have evolved, and how they compare to alcohol use during pregnancy policies. Method: Our 46-year original data set of statutes and regulations in U.S. states covers the entirety of state-level legislation in this policy domain. Data were obtained through original legal research and from the National Institute on Alcohol Abuse and Alcoholism’s Alcohol Policy Information System. Policies were analyzed individually as well as by classification as punitive toward or supportive of women. Results: The number of states with drug use during pregnancy policies has increased from 1 in 1974 to 43 in 2016. Policies started as punitive. By the mid- to late 1980s, supportive policies emerged, and mixed policy environments dominated in the 2000s. Overall, drug/pregnancy policy environments have become less supportive over time. Comparisons of drug laws to alcohol laws show that the policy trajectories started in opposite directions, but by 2016, the results were the same: Punitive policies were more prevalent than supportive policies across states. Moreover, there is a great deal of overlap between drug use during pregnancy policies and alcohol/pregnancy policies. Conclusion: This study breaks new ground. More studies are needed that explore the effects of these policies on alcohol and other drug use by pregnant women and on birth outcomes.


Sign in / Sign up

Export Citation Format

Share Document