scholarly journals The Self-Reflection and Insight Scale: Applying Item Response Theory to Craft an Efficient Short Form

2020 ◽  
Author(s):  
Paul Silvia

The human ability for self-consciousness—the capacity to reflect on oneself and to think about one’s thoughts, experiences, and actions—is central to understanding personality and motivation. The present research examined the psychometric properties of the Self-reflection and Insight Scale (SRIS), a prominent self-report scale for measuring individual differences in private self-consciousness. Using tools from Rasch and item response theory models, the SRIS was evaluated using responses from a large sample of young adults (n = 1192). The SRIS had many strengths, including essentially zero gender-based differential item functioning (DIF), but a cluster of poor performing items was identified based on item misfit, high local dependence, and low item difficulty and discrimination. Based on the IRT analyses, a concise 12-item scale, evenly balanced between self-reflection and insight, was crafted. The short SRIS showed strong dimensionality, reliability, item fit, and local independence as well as essentially no gender DIF. Taken together, the many psychometric strengths of the SRIS support its popularity, and the short form will be useful for research and applied contexts where an efficient, concise version is needed.

2021 ◽  
Author(s):  
Paul Silvia ◽  
Rebekah Rodriguez ◽  
James C. Kaufman ◽  
Roni Reiter-Palmon ◽  
Jeb S. Puryear

The original 90-item Creative Behavior Inventory (CBI) was a landmark self-report scale in creativity research, and the 28-item brief form developed nearly 20 years ago is a popular measure of everyday creativity. Relatively little is known, however, about the psychometric properties of this widely used scale. In the current research, we conduct a detailed psychometric investigation into the 28-item CBI by applying methods from item response theory using a sample of 2,082 adults. Our investigation revealed several strengths of the current scale: excellent reliability, suitable dimensionality, appropriate item difficulty, and reasonably good item discrimination. Several areas for improvement were highlighted as well: (1) the four-point response scale should probably have fewer options; (2) a handful of items showed gender-based differential item functioning, indicating some item bias; and (3) local dependence statistics revealed clusters of items that are probably redundant. These analyses support the continued use of the CBI for assessing engagement in everyday creative behaviors and suggest that the CBI could benefit from thoughtful revision.


2020 ◽  
Vol 34 (1) ◽  
pp. 40-63 ◽  
Author(s):  
Miles Bore ◽  
Kristin R. Laurens ◽  
Megan J. Hobbs ◽  
Melissa J. Green ◽  
Stacy Tzoumakis ◽  
...  

Prior investigations indicate that the five core personality dimensions (the “Big Five”) are measurable by middle childhood. The aim of this research was to examine the psychometric properties of a short-form self-report measure of the Big Five personality dimensions in children that would be suitable for administration online in large population-based studies. Twenty-five questionnaire items in English, derived from the 65-item Big Five Questionnaire for Children in Italian (Barbaranelli, Caprara, Rabasca, & Pastorelli, 2003), were completed online by 27,415 Australian children in Year 6 (mean age 11.92 years). An item response theory approach evaluated the psychometric properties and resolved a 20-item short-form questionnaire. Exploratory and confirmatory factor analyses supported the Big Five structure. Construct validity was demonstrated via correlations between Big Five scores and the Strengths and Difficulties Questionnaire subscales (Goodman, 2001). The 20 items provide a brief, reliable, and valid child self-report measure of the Big Five personality dimensions.


2020 ◽  
Vol 10 (11) ◽  
pp. 173
Author(s):  
Paul J. Silvia ◽  
Rebekah M. Rodriguez

The Humor Styles Questionnaire (HSQ) is one of the most popular self-report scales in humor research. The present research conducted a forward-looking psychometric analysis grounded in Rasch and item response theory models, which have not been applied to the HSQ thus far. Regarding strengths, the analyses found very good evidence for reliability and dimensionality and essentially zero gender-based differential item functioning, indicating no gender bias in the items. Regarding opportunities for future development, the analyses suggested that (1) the seven-point rating scale performs poorly relative to a five-point scale; (2) the affiliative subscale is far too easy to endorse and much easier than the other subscales; (3) the four subscales show problematic variation in their readability and proportion of reverse-scored items; and (4) a handful of items with poor discrimination and high local dependence are easy targets for scale revision. Taken together, the findings suggest that the HSQ, as it nears the two-decade mark, has many strengths but would benefit from light remodeling.


2013 ◽  
Vol 41 (1) ◽  
pp. 153-158 ◽  
Author(s):  
James F. Fries ◽  
James Witter ◽  
Matthias Rose ◽  
David Cella ◽  
Dinesh Khanna ◽  
...  

Objective.Patient-reported outcome (PRO) questionnaires record health information directly from research participants because observers may not accurately represent the patient perspective. Patient-reported Outcomes Measurement Information System (PROMIS) is a US National Institutes of Health cooperative group charged with bringing PRO to a new level of precision and standardization across diseases by item development and use of item response theory (IRT).Methods.With IRT methods, improved items are calibrated on an underlying concept to form an item bank for a “domain” such as physical function (PF). The most informative items can be combined to construct efficient “instruments” such as 10-item or 20-item PF static forms. Each item is calibrated on the basis of the probability that a given person will respond at a given level, and the ability of the item to discriminate people from one another. Tailored forms may cover any desired level of the domain being measured. Computerized adaptive testing (CAT) selects the best items to sharpen the estimate of a person’s functional ability, based on prior responses to earlier questions. PROMIS item banks have been improved with experience from several thousand items, and are calibrated on over 21,000 respondents.Results.In areas tested to date, PROMIS PF instruments are superior or equal to Health Assessment Questionnaire and Medical Outcome Study Short Form-36 Survey legacy instruments in clarity, translatability, patient importance, reliability, and sensitivity to change.Conclusion.Precise measures, such as PROMIS, efficiently incorporate patient self-report of health into research, potentially reducing research cost by lowering sample size requirements. The advent of routine IRT applications has the potential to transform PRO measurement.


2021 ◽  
pp. 019394592110159
Author(s):  
Wen Liu ◽  
Lilian Dindo ◽  
Katherine Hadlandsmyth ◽  
George Jay Unick ◽  
M. Bridget Zimmerman ◽  
...  

Little research has compared item functioning of the Patient-Reported Outcomes Measurement Information System (PROMIS®) anxiety short form 6a and the generalized anxiety disorder 7-item scale using item response theory models. This was a secondary analysis of self-reported assessments from 67 at-risk U.S. military veterans. The two measures performed comparably well with data fitting adequately to models, acceptable item discriminations, and item and test information curves being unimodal and symmetric. The PROMIS® anxiety short form 6a performed better in that item difficulty estimates had a wider range and distributed more evenly and all response categories had less floor effect, while the third category in most items of the generalized anxiety disorder 7-item scale were rarely used. While both measures may be appropriate, findings provided preliminary information supporting use of the PROMIS® anxiety short form 6a as potentially preferable, especially for veterans with low-to-moderate anxiety. Further testing is needed in larger, more diverse samples.


2020 ◽  
Author(s):  
Paul Silvia ◽  
Rebekah Rodriguez

The Humor Styles Questionnaire (HSQ) is one of the most popular self-report scales in humor research. The present research conducted a forward-looking psychometric analysis grounded in Rasch and item response theory models, which have not been applied to the HSQ thus far. Regarding strengths, the analyses found very good evidence for reliability and dimensionality and essentially zero gender-based differential item functioning, indicating no gender bias in the items. Regarding opportunities for future development, the analyses suggested that (1) the 7-point rating scale performs poorly relative to a 5-point scale; (2) the affiliative subscale is far too easy to endorse and much easier than the other subscales; (3) the four subscales show problematic variation in their readability and proportion of reverse-scored items; and (4) a handful of items with poor discrimination and high local dependence are easy targets for scale revision. Taken together, the findings suggest that the HSQ, as it nears the two-decade mark, has many strengths but would benefit from light remodeling.


2021 ◽  
Vol 12 ◽  
Author(s):  
Amy Shaw ◽  
Melissa Kapnek ◽  
Neil A. Morelli

Applying the graded response model within the item response theory framework, the present study analyzes the psychometric properties of Karwowski’s creative self-efficacy (CSE) scale. With an ethnically diverse sample of US college students, the results suggested that the six items of the CSE scale were well fitted to a latent unidimensional structure. The scale also had adequate measurement precision or reliability, high levels of item discrimination, and an appropriate range of item difficulty. Gender-based differential item functioning analyses confirmed that there were no differences in the measurement results of the scale concerning gender. Additionally, openness to experience was found to be positively related to the CSE scale scores, providing some support for the scale’s convergent validity. Collectively, these results confirmed the psychometric soundness of the CSE scale for measuring CSE and also identified avenues for future research.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Yunsoo Lee ◽  
Ji Hoon Song ◽  
Soo Jung Kim

Purpose This paper aims to validate the Korean version of the decent work scale and examine the relationship between decent work and work engagement. Design/methodology/approach After completing translation and back translation, the authors surveyed 266 Korean employees from various organizations via network sampling. They assessed Rasch’s model based on item response theory. In addition, they used classical test theory to evaluate the decent work scale’s validity and reliability. Findings The authors found that the current version of the decent work scale has good validity, reliability and item difficulty, and decent work has a positive relationship with work engagement. However, based on item response theory, the assessment showed that three of the items are extremely similar to another item within the same dimension, implying that the items are unable to discriminate among individual traits. Originality/value This study validated the decent work scale in a Korean work environment using Rasch’s (1960) model from the perspective of item response theory.


Sign in / Sign up

Export Citation Format

Share Document