scholarly journals Psychometric Properties of the NIH Toolbox Cognition Battery in Healthy Older Adults: Reliability, Validity, and Agreement with Standard Neuropsychological Tests

2019 ◽  
Vol 25 (08) ◽  
pp. 857-867 ◽  
Author(s):  
Emmi P. Scott ◽  
Anne Sorrell ◽  
Andreana Benitez

AbstractObjective:Few independent studies have examined the psychometric properties of the NIH Toolbox Cognition Battery (NIHTB-CB) in older adults, despite growing interest in its use for clinical purposes. In this paper we report the test–retest reliability and construct validity of the NIHTB-CB, as well as its agreement or concordance with traditional neuropsychological tests of the same construct to determine whether tests could be used interchangeably.Methods:Sixty-one cognitively healthy adults ages 60–80 completed “gold standard” (GS) neuropsychological tests, NIHTB-CB, and brain MRI. Test–retest reliability, convergent/discriminant validity, and agreement statistics were calculated using Pearson’s correlations, concordance correlation coefficients (CCC), and root mean square deviations.Results:Test–retest reliability was acceptable (CCC = .73 Fluid; CCC = .85 Crystallized). The NIHTB-CB Fluid Composite correlated significantly with cerebral volumes (r’s = |.35−.41|), and both composites correlated highly with their respective GS composites (r’s = .58−.84), although this was more variable for individual tests. Absolute agreement was generally lower (CCC = .55 Fluid; CCC = .70 Crystallized) due to lower precision in fluid scores and systematic overestimation of crystallized composite scores on the NIHTB-CB.Conclusions:These results support the reliability and validity of the NIHTB-CB in healthy older adults and suggest that the fluid composite tests are at least as sensitive as standard neuropsychological tests to medial temporal atrophy and ventricular expansion. However, the NIHTB-CB may generate different estimates of performance and should not be treated as interchangeable with established neuropsychological tests.

2012 ◽  
Vol 2012 ◽  
pp. 1-7 ◽  
Author(s):  
Alison Douglas ◽  
Lori Letts ◽  
Kevin Eva ◽  
Julie Richardson

Objectives. Defining and validating a measure of safety contributes to further validation of clinical measures. The objective was to define and examine the psychometric properties of the outcome “incidents of harm.”Methods. The Incident of Harm Caregiver Questionnaire was administered to caregivers of older adults discharged from hospital by telephone. Caregivers completed daily logs for one month and medical charts were examined.Results. Test-retest reliability (n=38) was high for the occurrence of an incident of harm (yes/no; kappa = 1.0) and the type of incident (agreement = 100%). Validation against daily logs found no disagreement regarding occurrence or types of incidents. Validation with medical charts found no disagreement regarding incident occurrence and disagreement in half regarding incident type.Discussion. The data support the Incident of Harm Caregiver Questionnaire as a reliable and valid estimation of incidents for this sample and are important to researchers as a method to measure safety when validating clinical measures.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Amin Kordi Yoosefinejad ◽  
Fatemeh Karjalian ◽  
Marzieh Momennasab ◽  
Shahrokh Ezzatzadegan Jahromi

Abstract Background Hemodialysis is considered a major therapeutic method for patients with chronic kidney disease. Pruritus is a common complaint of hemodialysis patients. The 5-D pruritus scale is amongst the most common tools to evaluate several dimensions of itch. Psychometric properties of the 5-D scale have not been evaluated in Persian speaking population with hemodialysis; hence, the objective of this study was to assess reliability and validity of the Persian version of the scale. Methods Ninety hemodialysis patients (men: 50, women: 40, mean age: 54.4 years) participated in this cross-sectional study. The final Persian version of 5-D scale was given to the participants. Tests Compared: One-third of the participants completed the scale twice within 3–7 days apart to evaluate test- retest reliability. Other psychometric properties including internal consistency, absolute reliability, convergent, discriminative and construct validity, floor/ceiling effects were also evaluated. Results The Persian 5-D scale has strong test-retest reliability (ICC= 0.98) and internal consistency (Cronbach’s alpha= 0.99). Standard error of measurement and minimal detectable change were 0.33 and 0.91, respectively. Regarding convergent validity, the scale had moderate correlation with numeric rating scale (r =0.67) and quality of life questionnaire related to itch (r = 0.59). Exploratory factor analysis revealed two factors within the scale. No floor or ceiling effect was found for the scale. Conclusion The Persian version of 5-D the itching scale is a brief instrument with acceptable reliability and validity. Therefore, the scale could be used by experts, nurses, and other health service providers to evaluate pruritus among Persian speaking hemodialysis patients.


Author(s):  
Thomas Finkenzeller ◽  
Björn Krenn ◽  
Sabine Würth ◽  
Günter Amesberger

AbstractThe design fluency test (DFT) has been reported to predict successful sports performance of soccer players and has therefore been in the spotlight of sport psychology research. There is, however, a lack of research regarding the psychometric properties of the DFT in elite sports. Thus, the aim of this research was to provide findings of test–retest reliability, practice effects and the diagnostic power of the DFT. Multiple studies of youth and adult elite athletes, as well as nonathlete students, were conducted in applied settings. Test–retest relationship demonstrated poor to acceptable short-term and long-term correlations. Furthermore, significant changes between test and retest were obtained in some variables that differed among samples. The differential value of the DFT was corroborated by significant differences between adolescent students and adolescent elite soccer players. Regarding the prospective value, significant partial correlation coefficients were found between DFT scores and volleyball performance in adult elite players. Although our research partially confirmed previous findings on the differential and prospective power of the DFT, the findings on test–retest reliability indicate that the DFT cannot be recommended for application in sports. The psychometric properties—in particular the findings on test–retest reliability—of the DFT have to be improved before research can be carried out on the application for the selection of team sport athletes and for the prediction of future success in team sports. Further research is needed to develop a scientific instrument for the assessment of game intelligence.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Liyuan Cui ◽  
Yaxin Zhu ◽  
Jinglou Qu ◽  
Liming Tie ◽  
Ziqi Wang ◽  
...  

Abstract Background Critical thinking disposition helps medical students and professionals overcome the effects of personal values and beliefs when exercising clinical judgment. The lack of effective instruments to measure critical thinking disposition in medical students has become an obstacle for training and evaluating students in undergraduate programs in China. The aim of this study was to evaluate the psychometric properties of the CTDA test. Methods A total of 278 students participated in this study and responded to the CTDA test. Cronbach’s α coefficient, internal consistency, test-retest reliability, floor effects and ceiling effects were measured to assess the reliability of the questionnaire. Construct validity of the pre-specified three-domain structure of the CTDA was evaluated by explanatory factor analysis (EFA) and confirmatory factor analysis (CFA). The convergent validity and discriminant validity were also analyzed. Results Cronbach’s alpha coefficient for the entire questionnaire was calculated to be 0.92, all of the domains showed acceptable internal consistency (0.81–0.86), and the test-retest reliability indicated acceptable intra-class correlation coefficients (ICCs) (0.93, p < 0.01). The EFA and the CFA demonstrated that the three-domain model fitted the data adequately. The test showed satisfactory convergent and discriminant validity. Conclusions The CTDA is a reliable and valid questionnaire to evaluate the disposition of medical students towards critical thinking in China and can reasonably be applied in critical thinking programs and medical education research.


2020 ◽  
Author(s):  
Liyuan Cui ◽  
Yaxin Zhu ◽  
Jinglou Qu ◽  
Liming Tie ◽  
Ziqi Wang ◽  
...  

Abstract Background: Critical thinking disposition helps medical students and professionals overcome the effects of personal values and beliefs when exercising clinical judgment. The lack of effective instruments to measure critical thinking disposition in medical students has become an obstacle for training and evaluating students in undergraduate programs in China. The aim of this study was to evaluate the psychometric properties of the CTDA test.Methods: A total of 278 students participated in this study and responded to the CTDA test. Cronbach’s a coefficient, internal consistency, test-retest reliability, floor effects and ceiling effects were measured to assess the reliability of the questionnaire. Construct validity of the pre-specified three-domain structure of the CTDA was evaluated by explanatory factor analysis (EFA) and confirmatory factor analysis (CFA). The convergent validity and discriminant validity were also analyzed.Results: Cronbach’s alpha coefficient for the entire questionnaire was calculated to be 0.92, all of the domains showed acceptable internal consistency (0.81-0.86), and the test-retest reliability indicated acceptable intra-class correlation coefficients (ICCs) (0.93, p<0.01). The EFA and the CFA demonstrated that the three-domain model fitted the data adequately. The test showed satisfactory convergent and discriminant validity.Conclusions: The CTDA is a reliable and valid questionnaire to evaluate the disposition of medical students towards critical thinking in China and can reasonably be applied in critical thinking programs and medical education research.


2008 ◽  
Vol 16 (3) ◽  
pp. 292-315 ◽  
Author(s):  
Dawn P. Gill ◽  
Gareth R. Jones ◽  
GuangYong Zou ◽  
Mark Speechley

The purpose of this study was to develop a brief physical activity interview for older adults (Phone-FITT) and evaluate its test–retest reliability and validity. Summary scores were derived for household, recreational, and total PA. Reliability was evaluated in a convenience sample from a fall-prevention study (N= 43, 79.4 ± 2.9 years, 51% male), and validity, in a random sample of individuals in older adult exercise programs (N= 48, 77.4 ± 4.7 years, 25% male). Mean time to complete the Phone-FITT was 10 min for participants sampled from exercise programs. Evaluation of test–retest reliability indicated substantial to almost perfect agreement for all scores, with intraclass correlation coefficients (95% confidence intervals) ranging from .74 (.58–.85) to .88 (.8–.94). For validity, Spearman’s rho correlations of Phone-FITT scores with accelerometer counts ranged from .29 (.01–.53) to .57 (.34–.73). Correlations of Phone-FITT recreational scores with age and seconds to complete a self-paced step test ranged from –.29 (–.53 to –.01) to –.45 (–.68 to –.14). This study contributes preliminary evidence of the reliability and validity of the Phone-FITT.


2015 ◽  
Vol 12 (5) ◽  
pp. 727-732 ◽  
Author(s):  
Keith P. Gennuso ◽  
Charles E. Matthews ◽  
Lisa H. Colbert

Background:The purpose of this study was to examine the reliability and validity of 2 currently available physical activity surveys for assessing time spent in sedentary behavior (SB) in older adults.Methods:Fifty-eight adults (≥65 years) completed the Yale Physical Activity Survey for Older Adults (YPAS) and Community Health Activities Model Program for Seniors (CHAMPS) before and after a 10-day period during which they wore an ActiGraph accelerometer (ACC). Intraclass correlation coefficients (ICC) examined test-retest reliability. Overall percent agreement and a kappa statistic examined YPAS validity. Lin’s concordance correlation, Pearson correlation, and Bland-Altman analysis examined CHAMPS validity.Results:Both surveys had moderate test-retest reliability (ICC: YPAS = 0.59 (P < .001), CHAMPS = 0.64 (P < .001)) and significantly underestimated SB time. Agreement between YPAS and ACC was low (κ = −0.0003); however, there was a linear increase (P < .01) in ACC-derived SB time across YPAS response categories. There was poor agreement between ACC-derived SB and CHAMPS (Lin’s r = .005; 95% CI, −0.010 to 0.020), and no linear trend across CHAMPS quartiles (P = .53).Conclusions:Neither of the surveys should be used as the sole measure of SB in a study; though the YPAS has the ability to rank individuals, providing it with some merit for use in correlational SB research.


2007 ◽  
Vol 15 (2) ◽  
pp. 184-194 ◽  
Author(s):  
Marissa E. Mendelsohn ◽  
Denise M. Connelly ◽  
Tom J. Overend ◽  
Robert J. Petrella

Although popular in clinical settings, little is known about the utility of all-extremity semirecumbent exercise machines for research. Twenty-one community-dwelling older adults performed two exercise trials (three 4-min stages at increasing workloads) to evaluate the reliability and validity of exercise responses to submaximal all-extremity semirecumbent exercise (BioStep). Exercise responses were measured directly (Cosmed K4b2) and indirectly through software on the BioStep. Test–retest reliability (ICC2,1) was moderate to high across all three stages for directly measured METs (.92, .87, and .88) and HR (.91, .83, and .86). Concurrent criterion validity between the K4b2and BioStep MET values was moderate to very good across the three stages on both Day 1 (r= .86, .71, and .83) and Day 2 (r= .73, .87, and .72). All-extremity semirecumbent submaximal exercise elicited reliable and valid responses in our sample of older adults and thus can be considered a viable exercise mode.


Sign in / Sign up

Export Citation Format

Share Document