scholarly journals Constructing a Scale of Attitudes toward School Science Using the General Graded Unfolding Model

Author(s):  
Taghreed Hijazi ◽  
Zaid Bani Ata

The present study aimed at constructing an attitude scale toward school science using the generalized graded unfolding model (GGUM). A 47-item scale (24 positive, 23 negative) with 4-point response format was used to measure attitudes toward science among 9th  (n=424) and 10th (n=420) grade students in 38 sections distributed randomly over 22 schools in Irbid district. Respondents selected one of four options to represent their level of agreement with each item. The findings support the hypothesis that the data form a single unidimensional unfolding model. Furthermore, the findings showed that the GGUM didn’t fit the data of 7 items, leaving the final scale with 40 items, where accurate estimates of these item parameters were derived and the GGUM was appropriate. Cronbach's alpha for the internal consistency, and the test retest reliability coefficients of the final scale were 0.932 and 0.875, respectively. 

2021 ◽  
pp. 014662162110404
Author(s):  
Naidan Tu ◽  
Bo Zhang ◽  
Lawrence Angrave ◽  
Tianjun Sun

Over the past couple of decades, there has been an increasing interest in adopting ideal point models to represent noncognitive constructs, as they have been demonstrated to better measure typical behaviors than traditional dominance models do. The generalized graded unfolding model ( GGUM) has consistently been the most popular ideal point model among researchers and practitioners. However, the GGUM2004 software and the later developed GGUM package in R can only handle unidimensional models despite the fact that many noncognitive constructs are multidimensional in nature. In addition, GGUM2004 and the GGUM package often yield unreasonable estimates of item parameters and standard errors. To address these issues, we developed the new open-source bmggum R package that is capable of estimating both unidimensional and multidimensional GGUM using a fully Bayesian approach, with supporting capabilities of stabilizing parameterization, incorporating person covariates, estimating constrained models, providing fit diagnostics, producing convergence metrics, and effectively handling missing data.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Piotr Przymuszała ◽  
Magdalena Cerbin-Koczorowska ◽  
Patrycja Marciniak-Stępak ◽  
Łucja Zielińska-Tomczak ◽  
Martyna Piszczek ◽  
...  

Abstract Background The Communication Skills Attitude Scale (CSAS) is a recognized tool for assessment of attitudes towards communication learning. In the original version, it consists of 26 items divided on theoretical assumptions into two subscales: Positive and Negative Attitudes Scales. However, the evidence for its structure seems unsatisfactory, and a simple division into positive and negative attitudes may be insufficient to describe attitudes of medical students towards communication learning. Moreover, the existing evidence of the test-retest reliability of the CSAS seems limited. Consequently, this study aimed to provide more evidence on its psychometric properties while validating the CSAS questionnaire in a cohort of Polish medical students. Methods The CSAS was translated, adapted into Polish, and validated in a cohort of 389 Polish medical students. Statistical analysis involved, among others, parallel analysis to determine the number of factors, confirmatory factor analysis to compare the proposed model with theory-based ones, and test-retest reliability analysis. Results Conducted analysis revealed that in the examined population, the CSAS should rather consist of four than two subscales. Proposed four subscales addressed perceived outcomes of communication learning, positive and negative attitudes towards it (affective components), and factors motivating students to learn communication (a cognitive component of attitudes). Results of test-retest reliability were satisfactory for individual items and subscales. Conclusions This study presented a valid and reliable version of the Communication Skills Attitude Scale for Polish medical students and confirmed previous assumptions that CSAS may also be appropriate for assessment of affective and cognitive components of attitudes. Future research should, based on Ajzen’s Theory of Planned Behavior, make attempts to develop a tool assessing not only attitudes but also subjective norms and perceived behavioral control.


2021 ◽  
pp. 014662162110517
Author(s):  
Seang-Hwane Joo ◽  
Philseok Lee ◽  
Stephen Stark

Collateral information has been used to address subpopulation heterogeneity and increase estimation accuracy in some large-scale cognitive assessments. The methodology that takes collateral information into account has not been developed and explored in published research with models designed specifically for noncognitive measurement. Because the accurate noncognitive measurement is becoming increasingly important, we sought to examine the benefits of using collateral information in latent trait estimation with an item response theory model that has proven valuable for noncognitive testing, namely, the generalized graded unfolding model (GGUM). Our presentation introduces an extension of the GGUM that incorporates collateral information, henceforth called Explanatory GGUM. We then present a simulation study that examined Explanatory GGUM latent trait estimation as a function of sample size, test length, number of background covariates, and correlation between the covariates and the latent trait. Results indicated the Explanatory GGUM approach provides scoring accuracy and precision superior to traditional expected a posteriori (EAP) and full Bayesian (FB) methods. Implications and recommendations are discussed.


2003 ◽  
Vol 11 (1) ◽  
pp. 1-15 ◽  
Author(s):  
Kelly Thompson ◽  
Eleonora Gullone

AbstractRecognizing the importance of increasing the levels of children's humane behavior toward animals other than humans relates to the developing of valid and reliable measures of such behavior. This study reports the psychometric properties of the Children's Treatment of Animals Questionnaire (CTAQ), which assesses children's humane behavior toward nonhuman animals. The findings, based on self-reports by 61 elementary school children (25 boys; 36 girls), showed that the 13-item scale has adequate internal consistency. In addition, comparing two administrations of the scale over a five-week period demonstrated good test-retest reliability. The scale's convergent validity was demonstrated with significant correlations between responses on the CTAQ and two previously validated measures of empathy. The study concluded that the CTAQ is a valid and reliable measure for assessing the degree to which children's behavior toward nonhuman animals is humane. Determining the sensitivity of the measure to change (following humane education) and the predictive validity of the measure (identification of children who are cruel to animals) will require further research.


2019 ◽  
Vol 43 (6) ◽  
pp. 717 ◽  
Author(s):  
Helen Corbett ◽  
Kristen Pearson ◽  
Leila Karimi ◽  
Wen Kwang Lim

Objective The aim of this study was to investigate the psychometric properties of a multisource review survey tool for medical consultants in an Australian health care setting. Methods Two sets of survey data from a convenience sample of medical consultants were analysed using SPSS, comprising self-assessment data from 73 consultants and data from 734 peer reviewers. The 20-question survey consisted of three subscales, plus an additional global question for reviewers. Analysis included the reliability coefficient (α) of the scale and the three subscales, inter-rater reliability or agreement and validity of the model, correlation between the single global question, the total performance score and the three survey subfactors (Pearson’s), interrater agreement (rWG(J)), the optimal number of peer reviewers required and model-based reliability (ρ). Results The global question, total performance score and the three subfactors were strongly correlated (general scale r=0.81, clinical subscale r=0.78, humanistic subscale r =0.74, management subscale r=0.75; two-tailed P<0.01 for all). The scale showed very good internal consistency, except for the five-question management subscale. Model-based reliability was excellent (ρ=0.93). Confirmatory factor analysis showed the model fit using the 20-item scale was not satisfactory (minimum discrepancy/d.f.=7.70; root mean square error of approximation=0.10; comparative fit index=0.79; Tucker–Lewis index=0.76). A modified 13-item model provided a good fit. Using the 20-item scale, a 99% level of agreement could be achieved with eight to 10 peer reviewers; for the same level of agreement, the number of reviewers increased to >10 using a revised 13-item scale. Conclusions Overall, the 20-item multisource review survey tool showed good internal consistency reliability for both self and peer ratings; however, further investigation using a larger dataset is needed to analyse the robustness of the model and to clarify the role that a single global question may play in future multisource review processes. What is known about the topic? Defining and measuring skills and behaviours that reflect competence in the health setting have proven to be complex, and this has resulted in the development of specific multisource feedback surveys for individual medical specialities. Because little literature exists on multisource reviews in an Australian context, a pilot study of a revised survey tool was undertaken at an Australian tertiary hospital. What does this paper add? The aim of this study was to investigate the psychometric properties of a generic tool (used across specialities) by assessing the validity, reliability and interrater reliability of the scale and to consider the contribution of a single global question to the overall multisource feedback process. This study provides evidence of the validity and reliability of the survey tool under investigation. The strong correlation between the global item, the total performance score and the three subfactors suggests that this is an area requiring further investigation to determine the role that a robust single global question like this may play in future multisource review surveys. Our five-question management skills subscale provides answers to questions relevant to the specific organisation surveyed, and we anticipate that it may serve to stimulate further exploration in this area. What are the implications for practitioners? The survey tool may provide a valid and reliable basis for performance review of medical consultants in an Australian healthcare setting.


AAOHN Journal ◽  
2006 ◽  
Vol 54 (5) ◽  
pp. 201-209 ◽  
Author(s):  
Arnold B. de Castro ◽  
Barbara Curbow ◽  
Jacqueline Agnew ◽  
Jennifer A. Haythornthwaite ◽  
Sheila T. Fitzgerald

This analysis examines the applicability of the emotional labor scale from the Emotions at Work Scale (EWS) through the assessment of its psychometric properties in a sample of young workers. Factor analysis and test—retest reliability were conducted on a 13-item scale measuring emotional labor. The EWS 13-item emotional labor scale was refined to 9 items. Two subscales were delineated: 5 items measured surface acting and 4 items measured deep acting, each with a mean inter-item correlation of 0.33. Cronbach's alpha was .96 for the 9-item scale, and .71 and .67 for the surface acting and deep acting subscales, respectively. Test—retest reliability was 0.64 for surface acting and 0.51 for deep acting during a mean interval of 3 months. Emotional labor can be quantitatively measured among young workers using the derived 9-item scale, although additional studies further evaluating its use should be conducted.


2020 ◽  
pp. 019394592094225
Author(s):  
Chen X. Chen ◽  
Tabitha Murphy ◽  
Susan Ofner ◽  
Lilian Yahng ◽  
Peter Krombach ◽  
...  

Dysmenorrhea affects most reproductive-age women and increases the risk of future pain. To evaluate dysmenorrhea interventions, validated outcome measures are needed. In this two-phase study, we developed and tested the dysmenorrhea symptom interference scale. During the scale-development phase ( n = 30), we created a nine-item scale based on qualitative data from cognitive interviews. During the scale-testing phase ( n = 686), we evaluated reliability, validity, and responsiveness to change. The scale measures how dysmenorrhea symptoms interfere with physical, mental, and social activities. Internal consistency was strong with Cronbach’s α > 0.9. Test–retest reliability was acceptable ( r = 0.8). The scale showed satisfactory content validity, construct validity (supported by confirmatory factor analysis), concurrent validity, and responsiveness to change. The minimally important difference was 0.3 points on a scale with a possible total score ranging from 1 to 5. This new psychometrically sound scale can be used in research and clinical practice to facilitate the measurement and management of dysmenorrhea.


Sign in / Sign up

Export Citation Format

Share Document