scholarly journals Test-Retest Reliability of the HEXACO-PI-R

2021 ◽  
Author(s):  
Samuel Henry ◽  
Isabel Thielmann ◽  
Tom Booth ◽  
René Mõttus

Despite the widespread use of the HEXACO as a descriptive taxonomy of human personality, there remains very limited information on the test-retest reliability of commonly used tools to measure the six traits. We report 12-day test-retest of the 100-item HEXACO-PI-R (HEXACO-100) at the level of domains, facets and items. We compare test-retest estimates to internal consistency for domains and facets, and to cross-rater agreement for all levels of measurement. Median rTTs were r = .65, .81, and .88 (n = 416) for items, facets, and domains, respectively. Facets’ rCAs were highly correlated with rTTs but not s. We conclude that the HEXACO-100 demonstrates rTT similar to other contemporary measures, and that rTT data should be routinely collected for scales.

PLoS ONE ◽  
2022 ◽  
Vol 17 (1) ◽  
pp. e0262465
Author(s):  
Sam Henry ◽  
Isabel Thielmann ◽  
Tom Booth ◽  
René Mõttus

Despite the widespread use of the HEXACO model as a descriptive taxonomy of personality traits, there remains limited information on the test-retest reliability of its commonly-used inventories. Studies typically report internal consistency estimates, such as alpha or omega, but there are good reasons to believe that these do not accurately assess reliability. We report 13-day test-retest correlations of the 100- and 60-item English HEXACO Personality Inventory-Revised (HEXACO-100 and HEXACO-60) domains, facets, and items. In order to test the validity of test-retest reliability, we then compare these estimates to correlations between self- and informant-reports (i.e., cross-rater agreement), a widely-used validity criterion. Median estimates of test-retest reliability were .88, .81, and .65 (N = 416) for domains, facets, and items, respectively. Facets’ and items’ test-retest reliabilities were highly correlated with their cross-rater agreement estimates, whereas internal consistencies were not. Overall, the HEXACO Personality Inventory-Revised demonstrates test-retest reliability similar to other contemporary measures. We recommend that short-term retest reliability should be routinely calculated to assess reliability.


PLoS ONE ◽  
2021 ◽  
Vol 16 (11) ◽  
pp. e0258123
Author(s):  
Nour Amin Elsahoryi ◽  
Gina Trakman ◽  
Ayah Al Kilani

Background Nutrition knowledge (NK) is a modifiable determinant of diet intake and can positively influence athletic performance. This study aimed to (1) adapt and translate a validated general and sports NK questionnaire into Arabic (2) assess the NK of Jordanian sportspeople, and (3) evaluate the relationship between NK and various sociodemographic factors. Methods The Abridged Nutrition for Sport Knowledge Questionnaire (ANSKQ) was translated into Arabic using forward-backward translation and underwent pilot testing and psychometric validation (internal consistency, test-retest reliability, inter-rater agreement) using a convenience sample of 30 individuals. Following ANSKQ validation, athletes a from 50 sport institutes in Jordan were invited (via email) to complete the Arabic ANSKQ online. Differences in NK based on demographics were analysed using t-test or ANOVA for continuous variables and chi-square tests for categorical variables. The ability of demographic factors to predict NK score-category (poor/good/average/excellent) was assessed using multivariate logistic regression. Results The Arabic ANSKQ had excellent internal consistency (Cronbach’s alpha = 0.92), test-retest reliability (Pearson r = 0.926) and inter-rater agreement (Cohen’s k statistic = 0.89). A total of 3636 eligible participants completed the Arabic ANSKQ. Participants were mostly athletes (91.4%), female (68.0%), had normal BMI (50.6%), and played high-intensity sports (59.6%). 88.3% of participants had poor NK (<50%). There were statistically significant differences in NK score based on participant role (athlete vs coach), age, gender, BMI, nationality, smoking, years playing sport, sport frequency, sport intensity, and nutrition training. Multivariate modelling showed participant role, BMI, education level, sport frequency and nutrition training were predictors of NK category. Conclusions In conclusion, Jordanian sportspeople have poor NK and may benefit from increased nutrition training.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Amin Kordi Yoosefinejad ◽  
Fatemeh Karjalian ◽  
Marzieh Momennasab ◽  
Shahrokh Ezzatzadegan Jahromi

Abstract Background Hemodialysis is considered a major therapeutic method for patients with chronic kidney disease. Pruritus is a common complaint of hemodialysis patients. The 5-D pruritus scale is amongst the most common tools to evaluate several dimensions of itch. Psychometric properties of the 5-D scale have not been evaluated in Persian speaking population with hemodialysis; hence, the objective of this study was to assess reliability and validity of the Persian version of the scale. Methods Ninety hemodialysis patients (men: 50, women: 40, mean age: 54.4 years) participated in this cross-sectional study. The final Persian version of 5-D scale was given to the participants. Tests Compared: One-third of the participants completed the scale twice within 3–7 days apart to evaluate test- retest reliability. Other psychometric properties including internal consistency, absolute reliability, convergent, discriminative and construct validity, floor/ceiling effects were also evaluated. Results The Persian 5-D scale has strong test-retest reliability (ICC= 0.98) and internal consistency (Cronbach’s alpha= 0.99). Standard error of measurement and minimal detectable change were 0.33 and 0.91, respectively. Regarding convergent validity, the scale had moderate correlation with numeric rating scale (r =0.67) and quality of life questionnaire related to itch (r = 0.59). Exploratory factor analysis revealed two factors within the scale. No floor or ceiling effect was found for the scale. Conclusion The Persian version of 5-D the itching scale is a brief instrument with acceptable reliability and validity. Therefore, the scale could be used by experts, nurses, and other health service providers to evaluate pruritus among Persian speaking hemodialysis patients.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Liyuan Cui ◽  
Yaxin Zhu ◽  
Jinglou Qu ◽  
Liming Tie ◽  
Ziqi Wang ◽  
...  

Abstract Background Critical thinking disposition helps medical students and professionals overcome the effects of personal values and beliefs when exercising clinical judgment. The lack of effective instruments to measure critical thinking disposition in medical students has become an obstacle for training and evaluating students in undergraduate programs in China. The aim of this study was to evaluate the psychometric properties of the CTDA test. Methods A total of 278 students participated in this study and responded to the CTDA test. Cronbach’s α coefficient, internal consistency, test-retest reliability, floor effects and ceiling effects were measured to assess the reliability of the questionnaire. Construct validity of the pre-specified three-domain structure of the CTDA was evaluated by explanatory factor analysis (EFA) and confirmatory factor analysis (CFA). The convergent validity and discriminant validity were also analyzed. Results Cronbach’s alpha coefficient for the entire questionnaire was calculated to be 0.92, all of the domains showed acceptable internal consistency (0.81–0.86), and the test-retest reliability indicated acceptable intra-class correlation coefficients (ICCs) (0.93, p < 0.01). The EFA and the CFA demonstrated that the three-domain model fitted the data adequately. The test showed satisfactory convergent and discriminant validity. Conclusions The CTDA is a reliable and valid questionnaire to evaluate the disposition of medical students towards critical thinking in China and can reasonably be applied in critical thinking programs and medical education research.


2021 ◽  
Author(s):  
Qi Zhang ◽  
Ke Zhang ◽  
Miao Li ◽  
Jiaxin Gu ◽  
Xintong Li ◽  
...  

Abstract Objectives To examine the validity and reliability of the Mandarin version of the Treatment Burden Questionnaire (TBQ) among stroke patients. Background Stroke patients need long-term management of symptoms and life situation, and treatment burden has recently emerged as a new concept that can influence the health outcomes during the rehabilitation process. Methods The convenience sampling method was used to recruit 187 cases of stroke patients in a tertiary grade hospital in Tianjin for a formal investigation. Item analysis, reliability and validity tests were carried out. The reliability test included internal consistency and test–retest reliability. And as well as content, structure and convergent validity were performed for the validity test. Results Of the 187 completed questionnaires, only 180 (96.3%) were suitable for analysis. According to the experts’ evaluation, the I-CVI of each item was from 0.833 to 1.000, and the S-CVI was 0.967. The exploratory factor analysis yielded three-factor components with a cumulative variation of 53.054%. Convergent validity was demonstrated using measures of Morisky’s Medication Adherence Scale 8 (r = –0.450, P &lt; 0.01). All correlations between items and global scores ranged from 0.403 to 0.638. Internal consistency reliability and test–retest reliability were found to be acceptable, as indicated by a Cronbach’s α of 0.824 and an intraclass correlation coefficient of 0.846, respectively. Conclusions The Mandarin TBQ had acceptable validity and reliability. The use of TBQ in the assessment of treatment burden of stroke survivor may benefit health resources allocation and provide tailor therapeutic interventions to construct minimally disruptive care.


2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Marco Monticone ◽  
Cristiano Sconza ◽  
Igor Portoghese ◽  
Tomohiko Nishigami ◽  
Benedict M. Wand ◽  
...  

Abstract Background and aim Growing attention is being given to utilising physical function measures to better understand and manage knee osteoarthritis (OA). The Fremantle Knee Awareness Questionnaire (FreKAQ), a self-reported measure of body-perception specific to the knee, has never been validated in Italian patients. The aims of this study were to culturally adapt and validate the Italian version of the FreKAQ (FreKAQ-I), to allow for its use with Italian-speaking patients with painful knee OA. Methods The FreKAQ-I was developed by means of forward–backward translation, a final review by an expert committee and a test of the pre-final version to evaluate its comprehensibility. The psychometric testing included: internal structural validity by Rasch analysis; construct validity by assessing hypotheses of FreKAQ correlations with the knee injury and osteoarthritis outcome score (KOOS), a pain intensity numerical rating scale (PI-NRS), the pain catastrophising scale (PCS), and the Hospital anxiety and depression score (HADS) (Pearson’s correlations); known-group validity by evaluating the ability of FreKAQ scores to discriminate between two groups of participants with different clinical profiles (Mann–Whitney U test); reliability by internal consistency (Cronbach’s alpha) and test–retest reliability (intraclass correlation coefficient, ICC2.1); and measurement error by calculating the minimum detectable change (MDC). Results It took one month to develop a consensus-based version of the FreKAQ-I. The questionnaire was administered to 102 subjects with painful knee OA and was well accepted. Internal structural validity confirmed the substantial unidimensionality of the FreKAQ-I: variance explained was 53.3%, the unexplained variance in the first contrast showed an eigenvalue of 1.8, and no local dependence was detected. Construct validity was good as all of the hypotheses were met; correlations: KOOS (rho = 0.38–0.51), PI-NRS (rho = 0.35–0.37), PCS (rho = 0.47) and HADS (Anxiety rho = 0.36; Depression rho = 0.43). Regarding known-groups validity, FreKAQ scores were significantly different between groups of participants demonstrating high and low levels of pain intensity, pain catastrophising, anxiety, depression and the four KOOS subscales (p ≤ 0.004). Internal consistency was acceptable (α = 0.74) and test–retest reliability was excellent (ICC = 0.92, CI 0.87–0.94). The MDC95 was 5.22 scale points. Conclusion The FreKAQ-I is unidimensional, reliable and valid in Italian patients with painful knee OA. Its use is recommended for clinical and research purposes.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Widjane Sheila Ferreira Goncalves ◽  
Rebecca Byrne ◽  
Pedro Israel Cabral de Lira ◽  
Marcelo Tavares Viana ◽  
Stewart G. Trost

Abstract Background Childhood obesity has increased remarkably in low and middle-income (LMIC) countries. Movement behaviors (physical activity, screen time, and sleep) are crucial in the development of overweight and obesity in young children. Yet, few studies have investigated the relationship between children’s movement behaviors and parenting practices because validated measures for use among families from LMIC are lacking. This study evaluated the psychometric properties of previously validated measures of young children’s physical activity, screen time, and sleep and parenting practices, translated and culturally adapted to Brazilian families. Methods A total of 78 parent-child dyads completed an interviewer-administered survey twice within 7 days. Child physical activity, sedentary time and sleep were concurrently measured using a wrist-worn accelerometer. Internal consistency and test-retest reliability was assessed using McDonald’s Omega and Intraclass Correlation Coefficients (ICC’s). Concurrent validity was evaluated by calculating Spearman correlations between parent reported child behaviors and accelerometer measured behaviors. Results Seventeen of the 19 parenting practices scales exhibited acceptable internal consistency reliability (Ω ≥ 0.70). Test-retest reliability ICC’s were acceptable and ranged from 0.82 - 0.99. Parent reported child physical activity was positively correlated with objectively measured total movement (rho= 0.29 - 0.46, p < .05) and energetic play (rho= 0.29 – 0.40, p < .05). Parent reported child screen time was positively correlated with objectively measured sedentary time; (rho = 0.26, p < .05), and inversely correlated with total movement (rho = - 0.39 – - 0.41, p < .05) and energetic play (rho = - 0.37 – - 0.41, p < .05). Parent reported night-time sleep duration was significantly correlated with accelerometer measured sleep duration on weekdays (rho = 0.29, p < .05), but not weekends. Conclusions Measurement tools to assess children’s movement behaviors and parenting practices, translated and culturally adapted for use in Brazilian families, exhibited acceptable evidence of concurrent validity, internal consistency, and test-retest reliability.


Author(s):  
Helmut Schröder ◽  
Isaac Subirana ◽  
Julia Wärnberg ◽  
María Medrano ◽  
Marcela González-Gross ◽  
...  

Abstract Background Validation of self-reported tools, such as physical activity (PA) questionnaires, is crucial. The aim of this study was to determine test-retest reliability, internal consistency, and the concurrent, construct, and predictive validity of the short semi-quantitative Physical Activity Unit 7 item Screener (PAU-7S), using accelerometry as the reference measurement. The effect of linear calibration on PAU-7S validity was tested. Methods A randomized sample of 321 healthy children aged 8–16 years (149 boys, 172 girls) from the nationwide representative PASOS study completed the PAU-7S before and after wearing an accelerometer for at least 7 consecutive days. Weight, height, and waist circumference were measured. Cronbach alpha was calculated for internal consistency. Test-retest reliability was determined by intra-class correlation (ICC). Concurrent validity was assessed by ICC and Spearman correlation coefficient between moderate to vigorous PA (MVPA) derived by the PAU-7S and by accelerometer. Concordance between both methods was analyzed by absolute agreement, weighted kappa, and Bland-Altman statistics. Multiple linear regression models were fitted for construct validity and predictive validity was determined by leave-one-out cross-validation. Results The PAU-7S overestimated MVPA by 18%, compared to accelerometers (106.5 ± 77.0 vs 95.2 ± 33.2 min/day, respectively). A Cronbach alpha of 0.76 showed an acceptable internal consistency of the PAU-7S. Test-retest reliability was good (ICC 0.71 p < 0.001). Spearman correlation and ICC coefficients of MVPA derived by the PAU-7S and accelerometers increased from 0.31 to 0.62 and 0.20 to 0.62, respectively, after calibration of the PAU-7S. Between-methods concordance improved from a weighted kappa of 0.24 to 0.50 after calibration. A slight reduction in ICC, from 0.62 to 0.60, yielded good predictive validity. Multiple linear regression models showed an inverse association of MVPA with standardized body mass index (β − 0.162; p < 0.077) and waist to height ratio (β − 0.010; p < 0.014). All validity dimensions were somewhat stronger in boys compared to girls. Conclusion The PAU-7S shows a good test-retest reliability and acceptable internal consistency. All dimensions of validity increased from poor/fair to moderate/good after calibration. The PAU-7S is a valid instrument for measuring MVPA in children and adolescents. Trial registration Trial registration numberISRCTN34251612.


Hand Therapy ◽  
2021 ◽  
pp. 175899832110345
Author(s):  
E Lanfranchi ◽  
T Fairplay ◽  
P Arcuri ◽  
M Lando ◽  
F Marinelli ◽  
...  

Introduction Several general hand functional assessment tools for Dupuytren’s disease have been reported, but none of the patient-reported-outcome measures specific to Dupuytren’s disease-associated disabilities are available in the Italian language. The purpose of this study was to culturally adapt the Unité Rhumatologique des Affections de la Main (URAM) into Italian (URAM-I) and determine its measurement properties. Methods Cross-cultural adaptation was performed according to the current guidelines. Construct validity (convergent and divergent validity) was measured by comparing the URAM-I with the Pain-Rated Wrist/Hand Evaluation (PRWHE-I), Short-Form 36 (SF-36-I) scale and finger range of motion, respectively. Factor analysis was used to investigate the URAM-I’s internal structure. Reliability was assessed by internal consistency (Cronbach’s alpha) and test-retest reliability by Intra-Class Correlation Coefficient (ICC). Results This study included 96 patients (males = 85%, age = 66.8 ± 9.3). Due to the cultural adaptation, we divided the original item #1 into two separate items, thus generating the URAM-I(10). Convergent validity analysis showed a strong positive (r = 0.67), significant (p < 0.01) Pearson’s correlation with the PRWHE-I. Divergent validity analysis showed a weak, negative (r < 0.3) and not significant correlation with the SF-36-I subscales, except for the physical pain subscale (r = −0.21, p < 0.05). Factor analysis revealed a 2-factor, 4-item solution that explained 76% of the total variance. The URAM-I(10) demonstrated high internal consistency (α = 0.94) and high test-retest reliability (ICC = 0.97). Conclusion The URAM-I(10) demonstrates moderate construct validity, high internal consistency and test-retest reliability, and showed a 2-factor internal structure. Its evaluative use can be suggested for the Italian Dupuytren’s population.


Scientifica ◽  
2017 ◽  
Vol 2017 ◽  
pp. 1-6 ◽  
Author(s):  
Mostafa Sadeghi ◽  
Homayoun Sadeghi-Bazargani ◽  
Shahrokh Amiri

Background. The Barkley Adult Attention Deficit/Hyperactivity Disorder (ADHD) Rating Scale-IV (BAARS-IV) was developed, and it demonstrated good psychometric properties. The BAARS-IV includes 27 questions on the symptoms of adult ADHD. The purpose of the present study is to investigate the psychometric testing of the Persian version of BAARS-IV among the elderlies in Tabriz City. Method. This cross-sectional study was conducted in Tabriz City—in the west of Iran—in 2015 via enrolling of 121 old-aged people. We did the process of translation and adaptation of BAARS-IV and examined its concurrent validity, internal consistency, and test-retest reliability. Result. The BAARS-IV demonstrated good internal consistency and test-retest reliability. Correlations between the BAARS-IV and the CAARS-S: SV were high and evidence supporting concurrent validity was revealed. Cronbach’s alpha for the overall scale and subscales stood at 0.89, 0.81, 0.66, 0.56, and 0.82, respectively. Conclusion. The Persian BAARS-IV showed acceptable reliability and validity. BAARS-IV was determined to be composed of internally consistent and psychometrically sound items.


Sign in / Sign up

Export Citation Format

Share Document