Constructing a Scale of Attitudes toward School Science Using the General Graded Unfolding Model

The present study aimed at constructing an attitude scale toward school science using the generalized graded unfolding model (GGUM). A 47-item scale (24 positive, 23 negative) with 4-point response format was used to measure attitudes toward science among 9th (n=424) and 10th (n=420) grade students in 38 sections distributed randomly over 22 schools in Irbid district. Respondents selected one of four options to represent their level of agreement with each item. The findings support the hypothesis that the data form a single unidimensional unfolding model. Furthermore, the findings showed that the GGUM didn’t fit the data of 7 items, leaving the final scale with 40 items, where accurate estimates of these item parameters were derived and the GGUM was appropriate. Cronbach's alpha for the internal consistency, and the test retest reliability coefficients of the final scale were 0.932 and 0.875, respectively.

Download Full-text

Markov Chain Monte Carlo Estimation of Item Parameters for the Generalized Graded Unfolding Model

Applied Psychological Measurement ◽

10.1177/0146621605282772 ◽

2006 ◽

Vol 30 (3) ◽

pp. 216-232 ◽

Cited By ~ 30

Author(s):

Jimmy de la Torre ◽

Stephan Stark ◽

Oleksandr S. Chernyshenko

Keyword(s):

Monte Carlo ◽

Markov Chain ◽

Markov Chain Monte Carlo ◽

Unfolding Model ◽

Item Parameters ◽

Generalized Graded Unfolding Model ◽

Monte Carlo Estimation

Download Full-text

bmggum: An R Package for Bayesian Estimation of the Multidimensional Generalized Graded Unfolding Model With Covariates

Applied Psychological Measurement ◽

10.1177/01466216211040488 ◽

2021 ◽

pp. 014662162110404

Author(s):

Naidan Tu ◽

Bo Zhang ◽

Lawrence Angrave ◽

Tianjun Sun

Keyword(s):

Ideal Point ◽

R Package ◽

Point Model ◽

Unfolding Model ◽

The Past ◽

Item Parameters ◽

Generalized Graded Unfolding Model ◽

Constrained Models ◽

Fully Bayesian ◽

Noncognitive Constructs

Over the past couple of decades, there has been an increasing interest in adopting ideal point models to represent noncognitive constructs, as they have been demonstrated to better measure typical behaviors than traditional dominance models do. The generalized graded unfolding model ( GGUM) has consistently been the most popular ideal point model among researchers and practitioners. However, the GGUM2004 software and the later developed GGUM package in R can only handle unidimensional models despite the fact that many noncognitive constructs are multidimensional in nature. In addition, GGUM2004 and the GGUM package often yield unreasonable estimates of item parameters and standard errors. To address these issues, we developed the new open-source bmggum R package that is capable of estimating both unidimensional and multidimensional GGUM using a fully Bayesian approach, with supporting capabilities of stabilizing parameterization, incorporating person covariates, estimating constrained models, providing fit diagnostics, producing convergence metrics, and effectively handling missing data.

Download Full-text

Constructing a Scale of Attitudes toward School Science Using the General Graded Unfolding Model

Journal of Educational and Psychological Studies [JEPS] ◽

10.12816/0002603 ◽

2013 ◽

Vol 7 (4) ◽

pp. 237-248

Author(s):

Zaid Bani Ata ◽

Taghreed Hijazi

Keyword(s):

School Science ◽

Attitudes Toward School ◽

Unfolding Model

Download Full-text

Affective and cognitive components of students’ attitudes towards communication learning - validation of the Communication Skills Attitude Scale in a cohort of polish medical students

BMC Medical Education ◽

10.1186/s12909-021-02626-7 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Piotr Przymuszała ◽

Magdalena Cerbin-Koczorowska ◽

Patrycja Marciniak-Stępak ◽

Łucja Zielińska-Tomczak ◽

Martyna Piszczek ◽

...

Keyword(s):

Medical Students ◽

Communication Skills ◽

Behavioral Control ◽

Attitude Scale ◽

Future Research ◽

Perceived Behavioral Control ◽

Negative Attitudes ◽

Retest Reliability ◽

Cognitive Components ◽

Test Retest Reliability

Abstract Background The Communication Skills Attitude Scale (CSAS) is a recognized tool for assessment of attitudes towards communication learning. In the original version, it consists of 26 items divided on theoretical assumptions into two subscales: Positive and Negative Attitudes Scales. However, the evidence for its structure seems unsatisfactory, and a simple division into positive and negative attitudes may be insufficient to describe attitudes of medical students towards communication learning. Moreover, the existing evidence of the test-retest reliability of the CSAS seems limited. Consequently, this study aimed to provide more evidence on its psychometric properties while validating the CSAS questionnaire in a cohort of Polish medical students. Methods The CSAS was translated, adapted into Polish, and validated in a cohort of 389 Polish medical students. Statistical analysis involved, among others, parallel analysis to determine the number of factors, confirmatory factor analysis to compare the proposed model with theory-based ones, and test-retest reliability analysis. Results Conducted analysis revealed that in the examined population, the CSAS should rather consist of four than two subscales. Proposed four subscales addressed perceived outcomes of communication learning, positive and negative attitudes towards it (affective components), and factors motivating students to learn communication (a cognitive component of attitudes). Results of test-retest reliability were satisfactory for individual items and subscales. Conclusions This study presented a valid and reliable version of the Communication Skills Attitude Scale for Polish medical students and confirmed previous assumptions that CSAS may also be appropriate for assessment of affective and cognitive components of attitudes. Future research should, based on Ajzen’s Theory of Planned Behavior, make attempts to develop a tool assessing not only attitudes but also subjective norms and perceived behavioral control.

Download Full-text

The Explanatory Generalized Graded Unfolding Model: Incorporating Collateral Information to Improve the Latent Trait Estimation Accuracy

Applied Psychological Measurement ◽

10.1177/01466216211051717 ◽

2021 ◽

pp. 014662162110517

Author(s):

Seang-Hwane Joo ◽

Philseok Lee ◽

Stephen Stark

Keyword(s):

Large Scale ◽

Latent Trait ◽

Theory Model ◽

Estimation Accuracy ◽

Cognitive Assessments ◽

Collateral Information ◽

Unfolding Model ◽

Generalized Graded Unfolding Model ◽

Scoring Accuracy ◽

Trait Estimation

Collateral information has been used to address subpopulation heterogeneity and increase estimation accuracy in some large-scale cognitive assessments. The methodology that takes collateral information into account has not been developed and explored in published research with models designed specifically for noncognitive measurement. Because the accurate noncognitive measurement is becoming increasingly important, we sought to examine the benefits of using collateral information in latent trait estimation with an item response theory model that has proven valuable for noncognitive testing, namely, the generalized graded unfolding model (GGUM). Our presentation introduces an extension of the GGUM that incorporates collateral information, henceforth called Explanatory GGUM. We then present a simulation study that examined Explanatory GGUM latent trait estimation as a function of sample size, test length, number of background covariates, and correlation between the covariates and the latent trait. Results indicated the Explanatory GGUM approach provides scoring accuracy and precision superior to traditional expected a posteriori (EAP) and full Bayesian (FB) methods. Implications and recommendations are discussed.

Download Full-text

Computerized Classification Testing Under the Generalized Graded Unfolding Model

Educational and Psychological Measurement ◽

10.1177/0013164410391575 ◽

2011 ◽

Vol 71 (1) ◽

pp. 114-128 ◽

Cited By ~ 2

Author(s):

Wen-Chung Wang ◽

Chen-Wei Liu

Keyword(s):

Unfolding Model ◽

Generalized Graded Unfolding Model

Download Full-text

The Children's Treatment of Animals Questionnaire (CTAQ): A Psychometric Investigation

Society and Animals ◽

10.1163/156853003321618800 ◽

2003 ◽

Vol 11 (1) ◽

pp. 1-15 ◽

Cited By ~ 15

Author(s):

Kelly Thompson ◽

Eleonora Gullone

Keyword(s):

Convergent Validity ◽

Reliable Measure ◽

Good Test ◽

Humane Education ◽

Self Reports ◽

Nonhuman Animals ◽

Test Retest Reliability ◽

Psychometric Investigation ◽

Adequate Internal Consistency ◽

Item Scale

AbstractRecognizing the importance of increasing the levels of children's humane behavior toward animals other than humans relates to the developing of valid and reliable measures of such behavior. This study reports the psychometric properties of the Children's Treatment of Animals Questionnaire (CTAQ), which assesses children's humane behavior toward nonhuman animals. The findings, based on self-reports by 61 elementary school children (25 boys; 36 girls), showed that the 13-item scale has adequate internal consistency. In addition, comparing two administrations of the scale over a five-week period demonstrated good test-retest reliability. The scale's convergent validity was demonstrated with significant correlations between responses on the CTAQ and two previously validated measures of empathy. The study concluded that the CTAQ is a valid and reliable measure for assessing the degree to which children's behavior toward nonhuman animals is humane. Determining the sensitivity of the measure to change (following humane education) and the predictive validity of the measure (identification of children who are cruel to animals) will require further research.

Download Full-text

Improving the utility of multisource feedback for medical consultants in a tertiary hospital: a study of the psychometric properties of a survey tool

Australian Health Review ◽

10.1071/ah17219 ◽

2019 ◽

Vol 43 (6) ◽

pp. 717 ◽

Cited By ~ 3

Author(s):

Helen Corbett ◽

Kristen Pearson ◽

Leila Karimi ◽

Wen Kwang Lim

Keyword(s):

Psychometric Properties ◽

Tertiary Hospital ◽

Multisource Feedback ◽

Performance Score ◽

Survey Tool ◽

Peer Reviewers ◽

Good Internal Consistency ◽

Total Performance ◽

Item Scale ◽

Level Of Agreement

Objective The aim of this study was to investigate the psychometric properties of a multisource review survey tool for medical consultants in an Australian health care setting. Methods Two sets of survey data from a convenience sample of medical consultants were analysed using SPSS, comprising self-assessment data from 73 consultants and data from 734 peer reviewers. The 20-question survey consisted of three subscales, plus an additional global question for reviewers. Analysis included the reliability coefficient (α) of the scale and the three subscales, inter-rater reliability or agreement and validity of the model, correlation between the single global question, the total performance score and the three survey subfactors (Pearson’s), interrater agreement (rWG(J)), the optimal number of peer reviewers required and model-based reliability (ρ). Results The global question, total performance score and the three subfactors were strongly correlated (general scale r=0.81, clinical subscale r=0.78, humanistic subscale r =0.74, management subscale r=0.75; two-tailed P<0.01 for all). The scale showed very good internal consistency, except for the five-question management subscale. Model-based reliability was excellent (ρ=0.93). Confirmatory factor analysis showed the model fit using the 20-item scale was not satisfactory (minimum discrepancy/d.f.=7.70; root mean square error of approximation=0.10; comparative fit index=0.79; Tucker–Lewis index=0.76). A modified 13-item model provided a good fit. Using the 20-item scale, a 99% level of agreement could be achieved with eight to 10 peer reviewers; for the same level of agreement, the number of reviewers increased to >10 using a revised 13-item scale. Conclusions Overall, the 20-item multisource review survey tool showed good internal consistency reliability for both self and peer ratings; however, further investigation using a larger dataset is needed to analyse the robustness of the model and to clarify the role that a single global question may play in future multisource review processes. What is known about the topic? Defining and measuring skills and behaviours that reflect competence in the health setting have proven to be complex, and this has resulted in the development of specific multisource feedback surveys for individual medical specialities. Because little literature exists on multisource reviews in an Australian context, a pilot study of a revised survey tool was undertaken at an Australian tertiary hospital. What does this paper add? The aim of this study was to investigate the psychometric properties of a generic tool (used across specialities) by assessing the validity, reliability and interrater reliability of the scale and to consider the contribution of a single global question to the overall multisource feedback process. This study provides evidence of the validity and reliability of the survey tool under investigation. The strong correlation between the global item, the total performance score and the three subfactors suggests that this is an area requiring further investigation to determine the role that a robust single global question like this may play in future multisource review surveys. Our five-question management skills subscale provides answers to questions relevant to the specific organisation surveyed, and we anticipate that it may serve to stimulate further exploration in this area. What are the implications for practitioners? The survey tool may provide a valid and reliable basis for performance review of medical consultants in an Australian healthcare setting.

Download Full-text

Measuring Emotional Labor among Young Workers

AAOHN Journal ◽

10.1177/216507990605400503 ◽

2006 ◽

Vol 54 (5) ◽

pp. 201-209 ◽

Cited By ~ 7

Author(s):

Arnold B. de Castro ◽

Barbara Curbow ◽

Jacqueline Agnew ◽

Jennifer A. Haythornthwaite ◽

Sheila T. Fitzgerald

Keyword(s):

Factor Analysis ◽

Psychometric Properties ◽

Emotional Labor ◽

Young Workers ◽

Surface Acting ◽

Retest Reliability ◽

Deep Acting ◽

Cronbach's Alpha ◽

Test Retest Reliability ◽

Item Scale

This analysis examines the applicability of the emotional labor scale from the Emotions at Work Scale (EWS) through the assessment of its psychometric properties in a sample of young workers. Factor analysis and test—retest reliability were conducted on a 13-item scale measuring emotional labor. The EWS 13-item emotional labor scale was refined to 9 items. Two subscales were delineated: 5 items measured surface acting and 4 items measured deep acting, each with a mean inter-item correlation of 0.33. Cronbach's alpha was .96 for the 9-item scale, and .71 and .67 for the surface acting and deep acting subscales, respectively. Test—retest reliability was 0.64 for surface acting and 0.51 for deep acting during a mean interval of 3 months. Emotional labor can be quantitatively measured among young workers using the derived 9-item scale, although additional studies further evaluating its use should be conducted.

Download Full-text

Development and Testing of the Dysmenorrhea Symptom Interference (DSI) Scale

Western Journal of Nursing Research ◽

10.1177/0193945920942252 ◽

2020 ◽

pp. 019394592094225

Author(s):

Chen X. Chen ◽

Tabitha Murphy ◽

Susan Ofner ◽

Lilian Yahng ◽

Peter Krombach ◽

...

Keyword(s):

Concurrent Validity ◽

Qualitative Data ◽

Reproductive Age ◽

Two Phase ◽

Reproductive Age Women ◽

Phase Study ◽

Confirmatory Factor ◽

Test Retest Reliability ◽

Item Scale ◽

Responsiveness To Change

Dysmenorrhea affects most reproductive-age women and increases the risk of future pain. To evaluate dysmenorrhea interventions, validated outcome measures are needed. In this two-phase study, we developed and tested the dysmenorrhea symptom interference scale. During the scale-development phase ( n = 30), we created a nine-item scale based on qualitative data from cognitive interviews. During the scale-testing phase ( n = 686), we evaluated reliability, validity, and responsiveness to change. The scale measures how dysmenorrhea symptoms interfere with physical, mental, and social activities. Internal consistency was strong with Cronbach’s α > 0.9. Test–retest reliability was acceptable ( r = 0.8). The scale showed satisfactory content validity, construct validity (supported by confirmatory factor analysis), concurrent validity, and responsiveness to change. The minimally important difference was 0.3 points on a scale with a possible total score ranging from 1 to 5. This new psychometrically sound scale can be used in research and clinical practice to facilitate the measurement and management of dysmenorrhea.

Download Full-text