scholarly journals Psychometrics of art: Validation of RizbA, a quantitative rating instrument for pictorial expression

2021 ◽  
Author(s):  
Kerstin Schoch ◽  
Thomas Ostermann

Although art has been subject to psychological research for some time, the artwork itself received little attention in quantitative research. The rating instrument for two-dimensional pictorial works (RizbA) fills this gap by providing a tool for formal picture analysis. This study validates the questionnaire on 294 images created by 147 non-artists. In an online test-retest study the material was rated by 880 (T1) and 475 (T2) experts using RizbA. Statistical quality criteria, Principal component analysis and indices of factor similarity were computed. The overall test's capacity of differentiation yields a partial eta-squared of .28 (T1) and .33 (T2). Test-retest reliability is .93. PCA reveals an eight-factors solution. Tucker's coefficients of congruence range between |0.82| and |1.00|. Intraclass correlation coefficients are .81 (T1) and .84 (T2). Results indicate generalizability to amateurs' works. As the first reliable tool for picture analysis, RizbA allows a more detailed examination of art and its correlates.

2020 ◽  
Vol 7 (2) ◽  
pp. 373-410
Author(s):  
Kerstin Schoch ◽  
Thomas Ostermann

Abstract In empirical art psychology and creativity research most studies focus on the psychological correlates of art. Only few go beyond treating artworks as categorical data (e.g. abstract vs. representational) and consider artworks in detail. In part this is due to the lack of reliable quantitative measurements. The rating instrument for two-dimensional pictorial works (RizbA) makes a difference to current research designs. The current study validates the questionnaire on a representative sample of contemporary visual art, consisting of 318 images depicting works by artists from different cultural areas dated to the 21st century. In a randomized test-retest design, the pictorial material was rated by 506 (T1) and 238 (T2) art experts using RizbA. Statistical quality criteria, such as item difficulty, capacity of differentiation, test-retest reliability, and intraclass correlation were calculated. Principal component analysis (PCA) and indices of factor similarity were computed. The overall test’s capacity for differentiation yields partial eta-squared of .31 (T1) and .40 (T2). Test-retest reliability is .86. PCA reveals an eight-factor solution, which is largely consistent across both measurement points. Tucker’s coefficient of congruence ranges between |.71| and |1.00|. Intraclass correlation coefficients are .86 (T1) and .73 (T2). This study indicates generalizability of the questionnaire to contemporary artworks. Although a conclusion on the factors’ structure cannot be drawn yet, results are very promising. As the first reliable quantitative tool for formal picture analysis, RizbA allows more detailedexamination of visual art and its psychological correlates. This broadens research methodology by giving art greater weight in art psychology and creativity research.


1997 ◽  
Vol 64 (5) ◽  
pp. 270-276 ◽  
Author(s):  
Johanne Desrosiers ◽  
Annie Rochette ◽  
Réjean Hébert ◽  
Gina Bravo

Several dexterity tests have been developed, including the Minnesota Rate of Manipulation Test (MRMT) and a new version, the Minnesota Manual Dexterity Test (MMDT). The objectives of the study were: a) to verify the test-retest reliability of the MMDT; b) to compare the MRMT and the MMDT; c) to study the concurrent validity of the MMDT; and d) to establish reference values for elderly people with the MMDT. Two hundred and forty-seven community-living healthy elderly were evaluated with the MMDT, and two other dexterity tests, the Box and Block Test (BBT) and the Purdue Pegboard (PP). Thirty-five of them were evaluated twice with the MMDT and 44 were evaluated with both the MMDT and MRMT. The results show that the test-retest reliability of the MMDT is acceptable to high (intraclass correlation coefficients of 0.79 to 0.87, depending on the subtest) and the validity of the test is demonstrated by significant correlations between the MMDT, the BBT and the PP (0.63 to 0.67). There is a high correlation (0.85 to 0.95) between the MMDT and the MMRT in spite of different results. The reference values will help occupational therapists to differentiate better between real dexterity difficulties and those that may be attributed to normal aging.


2008 ◽  
Vol 22 (6) ◽  
pp. 737-744 ◽  
Author(s):  
I-Ping Hsueh ◽  
Miao-Ju Hsu ◽  
Ching-Fan Sheu ◽  
Su Lee ◽  
Ching-Lin Hsieh ◽  
...  

Objective. To provide empirical justification for selecting motor scales for stroke patients, the authors compared the psychometric properties (validity, responsiveness, test-retest reliability, and smallest real difference [SRD]) of the Fugl-Meyer Motor Scale (FM), the simplified FM (S-FM), the Stroke Rehabilitation Assessment of Movement instrument (STREAM), and the simplified STREAM (S-STREAM). Methods. For the validity and responsiveness study, 50 inpatients were assessed with the FM and the STREAM at admission and discharge to a rehabilitation department. The scores of the S-FM and the S-STREAM were retrieved from their corresponding scales. For the test-retest reliability study, a therapist administered both scales on a different sample of 60 chronic patients on 2 occasions. Results. Only the S-STREAM had no notable floor or ceiling effects at admission and discharge. The 4 motor scales had good concurrent validity (rho ≥ .91) and satisfactory predictive validity (rho = .72-.77). The scales showed responsiveness (effect size d ≥ 0.34; standardized response mean ≥ 0.95; P < .0001), with the S-STREAM most responsive. The test-retest agreements of the scales were excellent (intraclass correlation coefficients ≥ .96). The SRD of the 4 scales was 10% of their corresponding highest score, indicating acceptable level of measurement error. The upper extremity and the lower extremity subscales of the 4 showed similar results. Conclusions. The 4 motor scales showed acceptable levels of reliability, validity, and responsiveness in stroke patients. The S-STREAM is recommended because it is short, responsive to change, and able to discriminate patients with severe or mild stroke.


2021 ◽  
Vol 12 ◽  
Author(s):  
Wei Xia ◽  
William Ho Cheung Li ◽  
Tingna Liang ◽  
Yuanhui Luo ◽  
Laurie Long Kwan Ho ◽  
...  

Objectives: This study conducted a linguistic and psychometric evaluation of the Chinese Counseling Competencies Scale-Revised (CCS-R).Methods: The Chinese CCS-R was created from the original English version using a standard forward-backward translation process. The psychometric properties of the Chinese CCS-R were examined in a cohort of 208 counselors-in-training by two independent raters. Fifty-three counselors-in-training were asked to undergo another counseling performance evaluation for the test-retest. The confirmatory factor analysis (CFA) was conducted for the Chinese CCS-R, followed by internal consistency, test-retest reliability, inter-rater reliability, convergent validity, and concurrent validity.Results: The results of the CFA supported the factorial validity of the Chinese CCS-R, with adequate construct replicability. The scale had a McDonald's omega of 0.876, and intraclass correlation coefficients of 0.63 and 0.90 for test-retest reliability and inter-rater reliability, respectively. Significantly positive correlations were observed between the Chinese CCS-R score and scores of performance checklist (Pearson's γ = 0.781), indicating a large convergent validity, and knowledge on drug abuse (Pearson's γ = 0.833), indicating a moderate concurrent validity.Conclusion: The results support that the Chinese CCS-R is a valid and reliable measure of the counseling competencies.Practice implication: The CCS-R provides trainers with a reliable tool to evaluate counseling students' competencies and to facilitate discussions with trainees about their areas for growth.


1999 ◽  
Vol 8 (4) ◽  
pp. 254-261 ◽  
Author(s):  
J Powers ◽  
SJ Bennett

BACKGROUND: Dyspnea, or difficult breathing, is common in patients receiving mechanical ventilation; however, dyspnea is not routinely or systematically measured. OBJECTIVE: The primary purpose of this methodological study was to evaluate the test-retest reliability of 5 dyspnea rating scales and the criterion validity of 4 dyspnea rating scales in patients receiving mechanical ventilation. The secondary purpose was to examine the correlations between each of these 5 rating scales and physiological measures of respiratory function. METHODS: The convenience sample consisted of 28 patients on mechanical ventilation during their hospitalization in the intensive care units of a large, inner-city hospital. Patients rated their dyspnea twice at 30-minute intervals on the visual analogue scale, the vertical analogue dyspnea scale, the modified Borg scale, the numerical scale, and the faces scale. Test-retest reliability was computed by using the intraclass correlation coefficient. Criterion validity was evaluated by using the Spearman rank-order correlation coefficient. RESULTS: The 5 rating scales had acceptable test-retest reliabilities, with intraclass correlation coefficients ranging from 0.81 to 0.97. Criterion validity of the 4 scales also was acceptable, with Spearman rank-order correlation coefficients from 0.76 to 0.96. The rating scales were not correlated with most of the physiological variables. At least half of the patients reported moderate to severe dyspnea. CONCLUSION: The scales showed acceptable reliability and validity, and they will be useful in quantifying dyspnea experienced by patients receiving mechanical ventilation. Further work is needed to evaluate the extent and the severity of dyspnea in such patients in order to evaluate the effectiveness of interventions.


2002 ◽  
Vol 82 (4) ◽  
pp. 364-371 ◽  
Author(s):  
Douglas P Gross ◽  
Michele C Battié

Abstract Background and Purpose. Functional capacity evaluations (FCEs) are measurement tools used in predicting readiness to return to work following injury. The interrater and test-retest reliability of determinations of maximal safe lifting during kinesiophysical FCEs were examined in a sample of people who were off work and receiving workers' compensation. Subjects. Twenty-eight subjects with low back pain who had plateaued with treatment were enrolled. Five occupational therapists, trained and experienced in kinesiophysical methods, conducted testing. Methods. A repeated-measures design was used, with raters testing subjects simultaneously, yet independently. Subjects were rated on 2 occasions, separated by 2 to 4 days. Analyses included intraclass correlation coefficients (ICCs) and 95% confidence intervals. Results. The ICC values for interrater reliability ranged from .95 to .98. Test-retest values ranged from .78 to .94. Discussion and Conclusion. Inconsistencies in subjects' performance across sessions were the greatest source of FCE measurement variability. Overall, however, test-retest reliability was good and interrater reliability was excellent.


2006 ◽  
Vol 86 (8) ◽  
pp. 1107-1117 ◽  
Author(s):  
Olaf Verschuren ◽  
Tim Takken ◽  
Marjolijn Ketelaar ◽  
Jan Willem Gorter ◽  
Paul JM Helders

Abstract Background and Purpose. The purpose of this study was to examine the reliability and validity of data obtained with 2 newly developed shuttle run tests (SRT-I and SRT-II) to measure aerobic power in children with cerebral palsy (CP) who were classified at level I or II on the Gross Motor Function Classification System (GMFCS). The SRT-I was developed for children at GMFCS level I, and the SRT-II was developed for children at GMFCS level II. Subjects. Twenty-five children and adolescents with CP (10 female, 15 male; mean age=11.9 years, SD=2.9), classified at GMFCS level I (n=14) or level II (n=11), participated in the study. Methods. To assess test-retest reliability of data for the 10-m shuttle run tests, the subjects performed the same test within 2 weeks. To examine validity, the shuttle run tests were compared with a GMFCS level–based treadmill test designed to measure peak oxygen uptake. Results. Statistical analyses revealed test-retest reliability for exercise time (number of levels completed) (intraclass correlation coefficients of .97 for the SRT-I and .99 for the SRT-II) and reliability for peak heart rate attained during the final level (intraclass correlation coefficients of .87 for the SRT-I and .94 for the SRT-II). High correlations were found for the relationship between data for both shuttle run tests and data for the treadmill test (r=.96 for both). Discussion and Conclusion. The results suggest that both 10-m shuttle run tests yield reliable and valid data. Moreover, the shuttle run tests have advantages over a treadmill test for children with CP who are able to walk and run (GMFCS level I or II). [Verschuren O, Takken T, Ketelaar M, et al. Reliability and validity of data for 2 newly developed shuttle run tests in children with cerebral palsy. Phys Ther. 2006;86:1107–1117.]


2018 ◽  
Vol 6 (s2) ◽  
pp. S252-S263 ◽  
Author(s):  
Lisa M. Barnett ◽  
Owen Makin

Assessing young children’s perceptions is commonly done one on one with an interviewer. An app enables several children to complete the scale at once. The objective was to describe an app to assess children’s perceptions of movement competence and then present consistency of child responses. The Pictorial Scale of Perceived Movement Skill Competence (PMSC) has fundamental movement skill (FMS; e.g., catch) and play items (e.g., cycling). The PMSC android app has the same items and images but children complete it independently with audio. Intraclass correlation coefficients (ICC) assessed i) test-retest reliability using the PMSC app on 18 items in 42 children (M = 6.8 yrs) and ii) consistency between measures for 13 FMS items in 44 children (M = 8.5 yrs). Over time (M = 6.9 days, SD = 0.35) the full PMSC had good consistency (ICC = 0.79, 95% CI 0.64–0.88) and the FMS items had moderate consistency (ICC = 0.68, 95% CI 0.47–0.81). There was good agreement between the app and interview for FMS items (ICC = 0.86, 95% CI 0.76–0.92). Locomotor items were less consistent. The PMSC app can generally be recommended. Future research could investigate how different forms of digital assessment affect children’s perception.


2020 ◽  
Vol 47 (4) ◽  
pp. 479-486
Author(s):  
Yuki Kondo ◽  
Kyota Bando ◽  
Yosuke Ariake ◽  
Wakana Katsuta ◽  
Kyoko Todoroki ◽  
...  

BACKGROUND: The reliability of the evaluation of the Balance Evaluation Systems Test (BESTest) and its two abbreviated versions are confirmed for balance characteristics and reliability. However, they are not utilized in cases of spinocerebellar ataxia (SCA). OBJECTIVE: We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BESTest and its abbreviated versions in persons with mild to moderate spinocerebellar ataxia. METHODS: The BESTest was performed in 20 persons with SCA at baseline and one month later. The scores of the abbreviated version of the BESTest were determined from the BESTest scores. The interclass correlation coefficient (1,1) was used as a measure of relative reliability. Furthermore, we calculated the MDC in the BESTest and its abbreviated versions. RESULTS: The intraclass correlation coefficients (1,1) and MDC at 95% confidence intervals were 0.92, 8.7(8.1%), 0.91, 4.1(14.5%), and 0.81, 5.2(21.6%) for the Balance, Mini-Balance, and Brief-Balance Evaluation Systems Tests, respectively. CONCLUSIONS: The BESTest and its abbreviated versions had high test-retest reliability. The MDC values of the BESTest could enable clinicians and researchers to interpret changes in the balance of patients with SCA more precisely.


Sign in / Sign up

Export Citation Format

Share Document