Psychometric Properties of the NIH Toolbox Cognition Battery in Healthy Older Adults: Reliability, Validity, and Agreement with Standard Neuropsychological Tests

AbstractObjective:Few independent studies have examined the psychometric properties of the NIH Toolbox Cognition Battery (NIHTB-CB) in older adults, despite growing interest in its use for clinical purposes. In this paper we report the test–retest reliability and construct validity of the NIHTB-CB, as well as its agreement or concordance with traditional neuropsychological tests of the same construct to determine whether tests could be used interchangeably.Methods:Sixty-one cognitively healthy adults ages 60–80 completed “gold standard” (GS) neuropsychological tests, NIHTB-CB, and brain MRI. Test–retest reliability, convergent/discriminant validity, and agreement statistics were calculated using Pearson’s correlations, concordance correlation coefficients (CCC), and root mean square deviations.Results:Test–retest reliability was acceptable (CCC = .73 Fluid; CCC = .85 Crystallized). The NIHTB-CB Fluid Composite correlated significantly with cerebral volumes (r’s = |.35−.41|), and both composites correlated highly with their respective GS composites (r’s = .58−.84), although this was more variable for individual tests. Absolute agreement was generally lower (CCC = .55 Fluid; CCC = .70 Crystallized) due to lower precision in fluid scores and systematic overestimation of crystallized composite scores on the NIHTB-CB.Conclusions:These results support the reliability and validity of the NIHTB-CB in healthy older adults and suggest that the fluid composite tests are at least as sensitive as standard neuropsychological tests to medial temporal atrophy and ventricular expansion. However, the NIHTB-CB may generate different estimates of performance and should not be treated as interchangeable with established neuropsychological tests.

Download Full-text

Measurement of Harm Outcomes in Older Adults after Hospital Discharge: Reliability and Validity

Journal of Aging Research ◽

10.1155/2012/150473 ◽

2012 ◽

Vol 2012 ◽

pp. 1-7 ◽

Cited By ~ 3

Author(s):

Alison Douglas ◽

Lori Letts ◽

Kevin Eva ◽

Julie Richardson

Keyword(s):

Older Adults ◽

Psychometric Properties ◽

Hospital Discharge ◽

Reliability And Validity ◽

Retest Reliability ◽

Incident Type ◽

Data Support ◽

Caregiver Questionnaire ◽

Test Retest Reliability ◽

Clinical Measures

Objectives. Defining and validating a measure of safety contributes to further validation of clinical measures. The objective was to define and examine the psychometric properties of the outcome “incidents of harm.”Methods. The Incident of Harm Caregiver Questionnaire was administered to caregivers of older adults discharged from hospital by telephone. Caregivers completed daily logs for one month and medical charts were examined.Results. Test-retest reliability (n=38) was high for the occurrence of an incident of harm (yes/no; kappa = 1.0) and the type of incident (agreement = 100%). Validation against daily logs found no disagreement regarding occurrence or types of incidents. Validation with medical charts found no disagreement regarding incident occurrence and disagreement in half regarding incident type.Discussion. The data support the Incident of Harm Caregiver Questionnaire as a reliable and valid estimation of incidents for this sample and are important to researchers as a method to measure safety when validating clinical measures.

Download Full-text

Reliability and validity of the Persian version of 5-D itching scale among patients with chronic kidney disease

BMC Nephrology ◽

10.1186/s12882-020-02220-x ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Amin Kordi Yoosefinejad ◽

Fatemeh Karjalian ◽

Marzieh Momennasab ◽

Shahrokh Ezzatzadegan Jahromi

Keyword(s):

Chronic Kidney Disease ◽

Kidney Disease ◽

Psychometric Properties ◽

Internal Consistency ◽

Reliability And Validity ◽

Hemodialysis Patients ◽

Life Questionnaire ◽

Retest Reliability ◽

Persian Version ◽

Test Retest Reliability

Abstract Background Hemodialysis is considered a major therapeutic method for patients with chronic kidney disease. Pruritus is a common complaint of hemodialysis patients. The 5-D pruritus scale is amongst the most common tools to evaluate several dimensions of itch. Psychometric properties of the 5-D scale have not been evaluated in Persian speaking population with hemodialysis; hence, the objective of this study was to assess reliability and validity of the Persian version of the scale. Methods Ninety hemodialysis patients (men: 50, women: 40, mean age: 54.4 years) participated in this cross-sectional study. The final Persian version of 5-D scale was given to the participants. Tests Compared: One-third of the participants completed the scale twice within 3–7 days apart to evaluate test- retest reliability. Other psychometric properties including internal consistency, absolute reliability, convergent, discriminative and construct validity, floor/ceiling effects were also evaluated. Results The Persian 5-D scale has strong test-retest reliability (ICC= 0.98) and internal consistency (Cronbach’s alpha= 0.99). Standard error of measurement and minimal detectable change were 0.33 and 0.91, respectively. Regarding convergent validity, the scale had moderate correlation with numeric rating scale (r =0.67) and quality of life questionnaire related to itch (r = 0.59). Exploratory factor analysis revealed two factors within the scale. No floor or ceiling effect was found for the scale. Conclusion The Persian version of 5-D the itching scale is a brief instrument with acceptable reliability and validity. Therefore, the scale could be used by experts, nurses, and other health service providers to evaluate pruritus among Persian speaking hemodialysis patients.

Download Full-text

The design fluency test: a reliable and valid instrument for the assessment of game intelligence?

German Journal of Exercise and Sport Research ◽

10.1007/s12662-020-00697-0 ◽

2021 ◽

Author(s):

Thomas Finkenzeller ◽

Björn Krenn ◽

Sabine Würth ◽

Günter Amesberger

Keyword(s):

Psychometric Properties ◽

Team Sports ◽

Correlation Coefficients ◽

Soccer Players ◽

Scientific Instrument ◽

Retest Reliability ◽

Adolescent Students ◽

Design Fluency ◽

Test Retest Reliability ◽

Fluency Test

AbstractThe design fluency test (DFT) has been reported to predict successful sports performance of soccer players and has therefore been in the spotlight of sport psychology research. There is, however, a lack of research regarding the psychometric properties of the DFT in elite sports. Thus, the aim of this research was to provide findings of test–retest reliability, practice effects and the diagnostic power of the DFT. Multiple studies of youth and adult elite athletes, as well as nonathlete students, were conducted in applied settings. Test–retest relationship demonstrated poor to acceptable short-term and long-term correlations. Furthermore, significant changes between test and retest were obtained in some variables that differed among samples. The differential value of the DFT was corroborated by significant differences between adolescent students and adolescent elite soccer players. Regarding the prospective value, significant partial correlation coefficients were found between DFT scores and volleyball performance in adult elite players. Although our research partially confirmed previous findings on the differential and prospective power of the DFT, the findings on test–retest reliability indicate that the DFT cannot be recommended for application in sports. The psychometric properties—in particular the findings on test–retest reliability—of the DFT have to be improved before research can be carried out on the application for the selection of team sport athletes and for the prediction of future success in team sports. Further research is needed to develop a scientific instrument for the assessment of game intelligence.

Download Full-text

Psychometric properties of the critical thinking disposition assessment test amongst medical students in China: a cross-sectional study

BMC Medical Education ◽

10.1186/s12909-020-02437-2 ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Liyuan Cui ◽

Yaxin Zhu ◽

Jinglou Qu ◽

Liming Tie ◽

Ziqi Wang ◽

...

Keyword(s):

Factor Analysis ◽

Critical Thinking ◽

Medical Students ◽

Psychometric Properties ◽

Internal Consistency ◽

Discriminant Validity ◽

Retest Reliability ◽

Critical Thinking Disposition ◽

Thinking Disposition ◽

Test Retest Reliability

Abstract Background Critical thinking disposition helps medical students and professionals overcome the effects of personal values and beliefs when exercising clinical judgment. The lack of effective instruments to measure critical thinking disposition in medical students has become an obstacle for training and evaluating students in undergraduate programs in China. The aim of this study was to evaluate the psychometric properties of the CTDA test. Methods A total of 278 students participated in this study and responded to the CTDA test. Cronbach’s α coefficient, internal consistency, test-retest reliability, floor effects and ceiling effects were measured to assess the reliability of the questionnaire. Construct validity of the pre-specified three-domain structure of the CTDA was evaluated by explanatory factor analysis (EFA) and confirmatory factor analysis (CFA). The convergent validity and discriminant validity were also analyzed. Results Cronbach’s alpha coefficient for the entire questionnaire was calculated to be 0.92, all of the domains showed acceptable internal consistency (0.81–0.86), and the test-retest reliability indicated acceptable intra-class correlation coefficients (ICCs) (0.93, p < 0.01). The EFA and the CFA demonstrated that the three-domain model fitted the data adequately. The test showed satisfactory convergent and discriminant validity. Conclusions The CTDA is a reliable and valid questionnaire to evaluate the disposition of medical students towards critical thinking in China and can reasonably be applied in critical thinking programs and medical education research.

Download Full-text

Psychometric Properties of the Critical Thinking Disposition Assessment Test Amongst Medical Students in China: A Cross-Sectional Study

10.21203/rs.3.rs-16612/v2 ◽

2020 ◽

Author(s):

Liyuan Cui ◽

Yaxin Zhu ◽

Jinglou Qu ◽

Liming Tie ◽

Ziqi Wang ◽

...

Keyword(s):

Factor Analysis ◽

Critical Thinking ◽

Medical Students ◽

Psychometric Properties ◽

Internal Consistency ◽

Discriminant Validity ◽

Retest Reliability ◽

Critical Thinking Disposition ◽

Thinking Disposition ◽

Test Retest Reliability

Abstract Background: Critical thinking disposition helps medical students and professionals overcome the effects of personal values and beliefs when exercising clinical judgment. The lack of effective instruments to measure critical thinking disposition in medical students has become an obstacle for training and evaluating students in undergraduate programs in China. The aim of this study was to evaluate the psychometric properties of the CTDA test.Methods: A total of 278 students participated in this study and responded to the CTDA test. Cronbach’s a coefficient, internal consistency, test-retest reliability, floor effects and ceiling effects were measured to assess the reliability of the questionnaire. Construct validity of the pre-specified three-domain structure of the CTDA was evaluated by explanatory factor analysis (EFA) and confirmatory factor analysis (CFA). The convergent validity and discriminant validity were also analyzed.Results: Cronbach’s alpha coefficient for the entire questionnaire was calculated to be 0.92, all of the domains showed acceptable internal consistency (0.81-0.86), and the test-retest reliability indicated acceptable intra-class correlation coefficients (ICCs) (0.93, p<0.01). The EFA and the CFA demonstrated that the three-domain model fitted the data adequately. The test showed satisfactory convergent and discriminant validity.Conclusions: The CTDA is a reliable and valid questionnaire to evaluate the disposition of medical students towards critical thinking in China and can reasonably be applied in critical thinking programs and medical education research.

Download Full-text

Test-Retest Reliability and Minimal Detectable Change of the Instrumented Modified Clinical Test of Sensory Interaction on Balance in Healthy, Older Adults

Journal of Geriatric Physical Therapy ◽

10.1519/jpt.0000000000000274 ◽

2020 ◽

Vol Publish Ahead of Print ◽

Author(s):

Seth Watson ◽

Elaine Trudelle-Jackson

Keyword(s):

Older Adults ◽

Minimal Detectable Change ◽

Clinical Test ◽

Detectable Change ◽

Sensory Interaction ◽

Retest Reliability ◽

Healthy Older Adults ◽

Test Retest Reliability

Download Full-text

The Phone-FITT: A Brief Physical Activity Interview for Older Adults

Journal of Aging and Physical Activity ◽

10.1123/japa.16.3.292 ◽

2008 ◽

Vol 16 (3) ◽

pp. 292-315 ◽

Cited By ~ 40

Author(s):

Dawn P. Gill ◽

Gareth R. Jones ◽

GuangYong Zou ◽

Mark Speechley

Keyword(s):

Physical Activity ◽

Older Adults ◽

Intraclass Correlation ◽

Reliability And Validity ◽

Step Test ◽

Exercise Programs ◽

Perfect Agreement ◽

Convenience Sample ◽

Retest Reliability ◽

Test Retest Reliability

The purpose of this study was to develop a brief physical activity interview for older adults (Phone-FITT) and evaluate its test–retest reliability and validity. Summary scores were derived for household, recreational, and total PA. Reliability was evaluated in a convenience sample from a fall-prevention study (N= 43, 79.4 ± 2.9 years, 51% male), and validity, in a random sample of individuals in older adult exercise programs (N= 48, 77.4 ± 4.7 years, 25% male). Mean time to complete the Phone-FITT was 10 min for participants sampled from exercise programs. Evaluation of test–retest reliability indicated substantial to almost perfect agreement for all scores, with intraclass correlation coefficients (95% confidence intervals) ranging from .74 (.58–.85) to .88 (.8–.94). For validity, Spearman’s rho correlations of Phone-FITT scores with accelerometer counts ranged from .29 (.01–.53) to .57 (.34–.73). Correlations of Phone-FITT recreational scores with age and seconds to complete a self-paced step test ranged from –.29 (–.53 to –.01) to –.45 (–.68 to –.14). This study contributes preliminary evidence of the reliability and validity of the Phone-FITT.

Download Full-text

Repeated computerized cognitive testing: Performance shifts and test–retest reliability in healthy older adults

Journal of Clinical and Experimental Neuropsychology ◽

10.1080/13803395.2018.1526888 ◽

2018 ◽

Vol 41 (2) ◽

pp. 179-191 ◽

Cited By ~ 4

Author(s):

Naomi White ◽

Larnee Flannery ◽

Alice McClintock ◽

Liana Machado

Keyword(s):

Older Adults ◽

Cognitive Testing ◽

Retest Reliability ◽

Healthy Older Adults ◽

Testing Performance ◽

Test Retest Reliability

Download Full-text

Reliability and Validity of 2 Self-Report Measures to Assess Sedentary Behavior in Older Adults

Journal of Physical Activity and Health ◽

10.1123/jpah.2013-0546 ◽

2015 ◽

Vol 12 (5) ◽

pp. 727-732 ◽

Cited By ~ 22

Author(s):

Keith P. Gennuso ◽

Charles E. Matthews ◽

Lisa H. Colbert

Keyword(s):

Physical Activity ◽

Older Adults ◽

Sedentary Behavior ◽

Linear Trend ◽

Pearson Correlation ◽

Intraclass Correlation ◽

Reliability And Validity ◽

Self Report ◽

Retest Reliability ◽

Test Retest Reliability

Background:The purpose of this study was to examine the reliability and validity of 2 currently available physical activity surveys for assessing time spent in sedentary behavior (SB) in older adults.Methods:Fifty-eight adults (≥65 years) completed the Yale Physical Activity Survey for Older Adults (YPAS) and Community Health Activities Model Program for Seniors (CHAMPS) before and after a 10-day period during which they wore an ActiGraph accelerometer (ACC). Intraclass correlation coefficients (ICC) examined test-retest reliability. Overall percent agreement and a kappa statistic examined YPAS validity. Lin’s concordance correlation, Pearson correlation, and Bland-Altman analysis examined CHAMPS validity.Results:Both surveys had moderate test-retest reliability (ICC: YPAS = 0.59 (P < .001), CHAMPS = 0.64 (P < .001)) and significantly underestimated SB time. Agreement between YPAS and ACC was low (κ = −0.0003); however, there was a linear increase (P < .01) in ACC-derived SB time across YPAS response categories. There was poor agreement between ACC-derived SB and CHAMPS (Lin’s r = .005; 95% CI, −0.010 to 0.020), and no linear trend across CHAMPS quartiles (P = .53).Conclusions:Neither of the surveys should be used as the sole measure of SB in a study; though the YPAS has the ability to rank individuals, providing it with some merit for use in correlational SB research.

Download Full-text

Reliability and Validity of Responses to Submaximal All-Extremity Semirecumbent Exercise in Older Adults

Journal of Aging and Physical Activity ◽

10.1123/japa.15.2.184 ◽

2007 ◽

Vol 15 (2) ◽

pp. 184-194 ◽

Cited By ~ 2

Author(s):

Marissa E. Mendelsohn ◽

Denise M. Connelly ◽

Tom J. Overend ◽

Robert J. Petrella

Keyword(s):

Older Adults ◽

Reliability And Validity ◽

Submaximal Exercise ◽

Community Dwelling ◽

Clinical Settings ◽

Retest Reliability ◽

Exercise Mode ◽

Three Stages ◽

Test Retest Reliability ◽

Community Dwelling Older Adults

Although popular in clinical settings, little is known about the utility of all-extremity semirecumbent exercise machines for research. Twenty-one community-dwelling older adults performed two exercise trials (three 4-min stages at increasing workloads) to evaluate the reliability and validity of exercise responses to submaximal all-extremity semirecumbent exercise (BioStep). Exercise responses were measured directly (Cosmed K4b2) and indirectly through software on the BioStep. Test–retest reliability (ICC2,1) was moderate to high across all three stages for directly measured METs (.92, .87, and .88) and HR (.91, .83, and .86). Concurrent criterion validity between the K4b2and BioStep MET values was moderate to very good across the three stages on both Day 1 (r= .86, .71, and .83) and Day 2 (r= .73, .87, and .72). All-extremity semirecumbent submaximal exercise elicited reliable and valid responses in our sample of older adults and thus can be considered a viable exercise mode.

Download Full-text