Test–Retest Reliability of the Standing Heel-Rise Test

Michael D. Ross; Elizabeth G. Fontenot

doi:10.1123/jsr.9.2.117

Test–Retest Reliability of the Standing Heel-Rise Test

Journal of Sport Rehabilitation ◽

10.1123/jsr.9.2.117 ◽

2000 ◽

Vol 9 (2) ◽

pp. 117-123 ◽

Cited By ~ 22

Author(s):

Michael D. Ross ◽

Elizabeth G. Fontenot

Keyword(s):

Outcome Measures ◽

Repeated Measures ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Muscle Performance ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Reliable Assessment ◽

Repeated Measures Analysis ◽

Test Retest Reliability

Context:The standing heel-rise test has been recommended as a means of assessing calf-muscle performance. To the authors' knowledge, the reliability of the test using intraclass correlation coefficients (ICCs) has not been reported.Objective:To determine the test-retest reliability of the standing heel-rise test.Design:Single-group repeated measures.Participants:Seventeen healthy subjects.Settings and Infevention:Each subject was asked to perform as many standing heel raises as possible during 2 testing sessions separated by 7 days.Main Outcome Measures:Reliability data for the standing heel-rise test were studied through a repeated-measures analysis of variance, ICC2, 1 and SEMs.Results:The ICC2,1 and SEM values for the standing heel-rise test were .96 and 2.07 repetitions, respectively.Conclusions:The standing heel-rise test offers clinicians a reliable assessment of calfmuscle performance. Further study is necessary to determine the ability of the standing heel-rise test to detect functional deficiencies in patients recovering from lower leg injury or surgery

Download Full-text

Assessing test–retest reliability of patient-reported outcome measures using intraclass correlation coefficients: recommendations for selecting and documenting the analytical formula

Quality of Life Research ◽

10.1007/s11136-018-2076-0 ◽

2018 ◽

Vol 28 (4) ◽

pp. 1029-1033 ◽

Cited By ~ 18

Author(s):

Shanshan Qin ◽

Lauren Nelson ◽

Lori McLeod ◽

Sonya Eremenco ◽

Stephen Joel Coons

Keyword(s):

Outcome Measures ◽

Intraclass Correlation ◽

Analytical Formula ◽

Correlation Coefficients ◽

Patient Reported Outcome Measures ◽

Patient Reported Outcome ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Patient Reported ◽

Test Retest Reliability

Download Full-text

Psychometric Comparisons of 2 Versions of the Fugl-Meyer Motor Scale and 2 Versions of the Stroke Rehabilitation Assessment of Movement

Neurorehabilitation and Neural Repair ◽

10.1177/1545968308315999 ◽

2008 ◽

Vol 22 (6) ◽

pp. 737-744 ◽

Cited By ~ 64

Author(s):

I-Ping Hsueh ◽

Miao-Ju Hsu ◽

Ching-Fan Sheu ◽

Su Lee ◽

Ching-Lin Hsieh ◽

...

Keyword(s):

Stroke Rehabilitation ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Stroke Patients ◽

Chronic Patients ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Mild Stroke ◽

Test Retest Reliability ◽

Rehabilitation Assessment

Objective. To provide empirical justification for selecting motor scales for stroke patients, the authors compared the psychometric properties (validity, responsiveness, test-retest reliability, and smallest real difference [SRD]) of the Fugl-Meyer Motor Scale (FM), the simplified FM (S-FM), the Stroke Rehabilitation Assessment of Movement instrument (STREAM), and the simplified STREAM (S-STREAM). Methods. For the validity and responsiveness study, 50 inpatients were assessed with the FM and the STREAM at admission and discharge to a rehabilitation department. The scores of the S-FM and the S-STREAM were retrieved from their corresponding scales. For the test-retest reliability study, a therapist administered both scales on a different sample of 60 chronic patients on 2 occasions. Results. Only the S-STREAM had no notable floor or ceiling effects at admission and discharge. The 4 motor scales had good concurrent validity (rho ≥ .91) and satisfactory predictive validity (rho = .72-.77). The scales showed responsiveness (effect size d ≥ 0.34; standardized response mean ≥ 0.95; P < .0001), with the S-STREAM most responsive. The test-retest agreements of the scales were excellent (intraclass correlation coefficients ≥ .96). The SRD of the 4 scales was 10% of their corresponding highest score, indicating acceptable level of measurement error. The upper extremity and the lower extremity subscales of the 4 showed similar results. Conclusions. The 4 motor scales showed acceptable levels of reliability, validity, and responsiveness in stroke patients. The S-STREAM is recommended because it is short, responsive to change, and able to discriminate patients with severe or mild stroke.

Download Full-text

Reliability of Safe Maximum Lifting Determinations of a Functional Capacity Evaluation

Physical Therapy ◽

10.1093/ptj/82.4.364 ◽

2002 ◽

Vol 82 (4) ◽

pp. 364-371 ◽

Cited By ~ 73

Author(s):

Douglas P Gross ◽

Michele C Battié

Keyword(s):

Functional Capacity ◽

Repeated Measures ◽

Interrater Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Functional Capacity Evaluation ◽

Measurement Variability ◽

Retest Reliability ◽

Repeated Measures Design ◽

Test Retest Reliability

Abstract Background and Purpose. Functional capacity evaluations (FCEs) are measurement tools used in predicting readiness to return to work following injury. The interrater and test-retest reliability of determinations of maximal safe lifting during kinesiophysical FCEs were examined in a sample of people who were off work and receiving workers' compensation. Subjects. Twenty-eight subjects with low back pain who had plateaued with treatment were enrolled. Five occupational therapists, trained and experienced in kinesiophysical methods, conducted testing. Methods. A repeated-measures design was used, with raters testing subjects simultaneously, yet independently. Subjects were rated on 2 occasions, separated by 2 to 4 days. Analyses included intraclass correlation coefficients (ICCs) and 95% confidence intervals. Results. The ICC values for interrater reliability ranged from .95 to .98. Test-retest values ranged from .78 to .94. Discussion and Conclusion. Inconsistencies in subjects' performance across sessions were the greatest source of FCE measurement variability. Overall, however, test-retest reliability was good and interrater reliability was excellent.

Download Full-text

Test-retest reliability and minimal detectable change of the Balance Evaluation Systems Test and its two abbreviated versions in persons with mild to moderate spinocerebellar ataxia: A pilot study

Neurorehabilitation ◽

10.3233/nre-203154 ◽

2020 ◽

Vol 47 (4) ◽

pp. 479-486

Author(s):

Yuki Kondo ◽

Kyota Bando ◽

Yosuke Ariake ◽

Wakana Katsuta ◽

Kyoko Todoroki ◽

...

Keyword(s):

Spinocerebellar Ataxia ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Minimal Detectable Change ◽

Detectable Change ◽

Retest Reliability ◽

Evaluation Systems ◽

Intraclass Correlation Coefficients ◽

Interclass Correlation ◽

Test Retest Reliability

BACKGROUND: The reliability of the evaluation of the Balance Evaluation Systems Test (BESTest) and its two abbreviated versions are confirmed for balance characteristics and reliability. However, they are not utilized in cases of spinocerebellar ataxia (SCA). OBJECTIVE: We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BESTest and its abbreviated versions in persons with mild to moderate spinocerebellar ataxia. METHODS: The BESTest was performed in 20 persons with SCA at baseline and one month later. The scores of the abbreviated version of the BESTest were determined from the BESTest scores. The interclass correlation coefficient (1,1) was used as a measure of relative reliability. Furthermore, we calculated the MDC in the BESTest and its abbreviated versions. RESULTS: The intraclass correlation coefficients (1,1) and MDC at 95% confidence intervals were 0.92, 8.7(8.1%), 0.91, 4.1(14.5%), and 0.81, 5.2(21.6%) for the Balance, Mini-Balance, and Brief-Balance Evaluation Systems Tests, respectively. CONCLUSIONS: The BESTest and its abbreviated versions had high test-retest reliability. The MDC values of the BESTest could enable clinicians and researchers to interpret changes in the balance of patients with SCA more precisely.

Download Full-text

The Reliability of Pharyngeal High Resolution Manometry with Impedance for Derivation of Measures of Swallowing Function in Healthy Volunteers

International Journal of Otolaryngology ◽

10.1155/2016/2718482 ◽

2016 ◽

Vol 2016 ◽

pp. 1-8 ◽

Cited By ~ 18

Author(s):

Taher I. Omari ◽

Johanna Savilampi ◽

Karmen Kokkinn ◽

Mistyka Schar ◽

Kristin Lamvik ◽

...

Keyword(s):

High Resolution ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Interrater Agreement ◽

Swallowing Function ◽

Retest Reliability ◽

High Resolution Manometry ◽

Intraclass Correlation Coefficients ◽

Test Retest Reliability ◽

Swallow Function

Purpose. We evaluated the intra- and interrater agreement and test-retest reliability of analyst derivation of swallow function variables based on repeated high resolution manometry with impedance measurements.Methods. Five subjects swallowed10×10 mL saline on two occasions one week apart producing a database of 100 swallows. Swallows were repeat-analysed by six observers using software. Swallow variables were indicative of contractility, intrabolus pressure, and flow timing.Results. The average intraclass correlation coefficients (ICC) for intra- and interrater comparisons of all variable means showedsubstantialtoexcellentagreement (intrarater ICC 0.85–1.00; mean interrater ICC 0.77–1.00). Test-retest results were less reliable. ICC for test-retest comparisons ranged fromslighttoexcellentdepending on the class of variable. Contractility variables differed most in terms of test-retest reliability. Amongst contractility variables, UES basal pressure showedexcellenttest-retest agreement (mean ICC 0.94), measures of UES postrelaxation contractile pressure showedmoderatetosubstantialtest-retest agreement (mean Interrater ICC 0.47–0.67), and test-retest agreement of pharyngeal contractile pressure ranged fromslighttosubstantial(mean Interrater ICC 0.15–0.61).Conclusions. Test-retest reliability of HRIM measures depends on the class of variable. Measures of bolus distension pressure and flow timing appear to be more test-retest reliable than measures of contractility.

Download Full-text

Reliability of the ForceFrame With and Without a Fixed Upper-Limb Mold in Shoulder Rotation Strength Assessments Compared With Traditional Hand-Held Dynamometry

Journal of Sport Rehabilitation ◽

10.1123/jsr.2020-0434 ◽

2021 ◽

pp. 1-4

Author(s):

Jamon Couch ◽

Marc Sayers ◽

Tania Pizzari

Keyword(s):

Upper Limb ◽

External Rotation ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Shoulder Abduction ◽

Minimal Detectable Change ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Shoulder Strength ◽

Test Retest Reliability

Context: An imbalance between shoulder internal rotation (IR) and external rotation (ER) strength in athletes is proposed to increase the risk of sustaining a shoulder injury. Hand-held (HHD) and externally fixed dynamometry are reliable forms of assessing shoulder IR and ER strength. A new externally fixed device with an attachable fixed upper-limb mold (The ForceFrame) exists; however, its reliability in measuring shoulder strength is yet to be investigated. Objective: To determine the test–retest reliability of the ForceFrame, with and without the fixed upper-limb mold, in the assessment of shoulder IR and ER strength, as compared with HHD. Design: Test–retest reliability study. Setting: Laboratory, clinical. Participants: Twenty-two healthy and active individuals were recruited from the university community and a private physiotherapy practice. Main Outcome Measures: Maximal isometric shoulder IR and ER strength was measured using the ForceFrame and traditional HHD in neutral and at 90° shoulder abduction. Mean (SD) strength measures were calculated. Test–retest reliability was analyzed using intraclass correlation coefficients (3, 1). The SEM and minimal detectable change were calculated. Results: Good to excellent test–retest reliability was found for all shoulder strength tests across Hand-held dynamometry (HHD) and externally fixed dynamometry (EFD) are reliable forms (intraclass correlation coefficients [3, 1] = .854–.916). The minimal detectable changes ranged between 25.61 and 41.84 N across tests. Test–retest reliability was not affected by the dynamometer or testing position. Conclusions: The results from this study indicate that both the ForceFrame and HHD are suitable for measuring shoulder strength in clinical practice. The use of the fixed upper-limb mold with the ForceFrame does not improve reliability.

Download Full-text

Test–Retest Reliability of a New Device Versus a Long-Arm Goniometer to Evaluate Knee Proprioception

Journal of Sport Rehabilitation ◽

10.1123/jsr.2021-0146 ◽

2021 ◽

pp. 1-6

Author(s):

Fei Tian ◽

Yaqi Zhao ◽

Jixin Li ◽

Wenjin Wang ◽

Danni Wu ◽

...

Keyword(s):

Standard Error ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Joint Position Sense ◽

Repeated Measurements ◽

Good Reliability ◽

Retest Reliability ◽

New Device ◽

Intraclass Correlation Coefficients ◽

Test Retest Reliability

Context: Many methods used to evaluate knee proprioception have shortcomings that limit their use in clinical settings. Based on an inexpensive 3D camera, a new portable device was recently used to evaluate the joint position sense (JPS) of the knee joint. However, the test–retest reliability of the new method remains unclear. This study aimed to evaluate the test–retest reliability of the new device and a long-arm goniometer for assessing knee JPS, and to compare the variability of the 2 methods. Design: Prospective observational study of the test–retest reliability of knee JPS measurements. Methods: Twenty-one healthy adults were tested in 2 sessions with a 1-week interval. Three target knee flexion angles (30°, 45°, and 60°) were reproduced in each session. Target and reproduced angles were measured with both methods. Intraclass correlation coefficients, standard error of the measurement, and Bland–Altman plots were used to quantify test–retest reliability. Paired t tests were used to compare knee JPS (absolute error of the target-reproduced angle) between the methods. Results: The new device (good to excellent intraclass correlation coefficients .74–.80; standard error of the measurement 0.52°–0.61°) demonstrated better test–retest reliability than the goniometer (poor to fair intraclass correlation coefficients .23–.43; standard error of the measurement 0.89°–2.07°) and better test–retest agreement (respective mean differences for the 30°, 45°, and 60° knee angles: 0.11°, 0.13°, and 0.41° for the new system; 0.84°, 1.52°, and 1.18° for the goniometer). The measurements (absolute errors of the target-reproduced angles) with the goniometer were significantly greater than those with the new device (P < .05); the SDs of repeated measurements with the goniometer (1.50°–2.41°) were greater than with the new device (1.08°–1.38°). Conclusions: Given that the new device has good reliability and sufficient precision, it is the better alternative for evaluating knee JPS. Goniometers should be used with caution to assess knee JPS.

Download Full-text

The 10/5 Repeated Jumps Test: Are 10 Repetitions and Three Trials Necessary?

Biomechanics ◽

10.3390/biomechanics1010001 ◽

2020 ◽

Vol 1 (1) ◽

pp. 1-14

Author(s):

Callum Stratford ◽

Thomas Dos’Santos ◽

John J. McMahon

Keyword(s):

Repeated Measures ◽

Intraclass Correlation ◽

Force Plate ◽

Correlation Coefficients ◽

Pairwise Comparisons ◽

Strength Index ◽

Intraclass Correlation Coefficients ◽

Coefficients Of Variation ◽

Repeated Measures Analysis ◽

Ground Contact Time

The purpose of this study was to identify whether ten repetitions and three trials were necessary to achieve stability in peak reactive strength index (RSI) during the 10/5 repeated jumps test (RJT). Twenty-five males, from multiple sports, performed three trials of the RJT on an in-ground force plate, with 90 seconds’ rest between trials. Intraclass correlation coefficients (ICC = 0.916–0.986) and coefficients of variation (CV ≤ 14.5%) were considered acceptable for all variables. Repeated-measures analysis of variance and Freidman’s tests revealed large and significant differences (p ≤ 0.006, η2 = 0.159–0.434, power ≥ 0.859) in ground contact time (GCT), jump height (JH), and subsequently RSI both between trials and repetitions. Pairwise comparisons revealed that repetitions 1–3 produced longer GCTs (p ≤ 0.05, d = 0.41–1.40), lower JHs (p ≤ 0.05, d = 0.31–0.56), and lower RSI values (p ≤ 0.05, d = 0.35–1.24). The shortest GCTs, greatest JHs and greater RSIs occurred between repetitions 7 and 10, with approximately 60% of peak RSIs occurring during these ranges. The sequential estimate technique revealed that seven repetitions were needed to attain stability in mean peak RSI. Non-significant (p = 0.554) and negligible differences (d ≤ 0.09) in the five best RSIs between trials were noted. One trial of the 10/5 RJT is sufficient to evaluate peak RSI in athletes and should reduce data collection time and fatigue.

Download Full-text

Agreement between the Open Barbell and Tendo Linear Position Transducers for Monitoring Barbell Velocity during Resistance Exercise

Sports ◽

10.3390/sports7050125 ◽

2019 ◽

Vol 7 (5) ◽

pp. 125

Author(s):

Adam M. Gonzalez ◽

Gerald T. Mangine ◽

Robert W. Spitz ◽

Jamie J. Ghigiarelli ◽

Katie M. Sell

Keyword(s):

Repeated Measures ◽

Peak Velocity ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Moderate Intensity ◽

Mean Velocity ◽

Intraclass Correlation Coefficients ◽

Coefficients Of Variation ◽

Repeated Measures Analysis ◽

Standard Error Of Measurement

To determine the agreement between the Open Barbell (OB) and Tendo weightlifting analyzer (TWA) for measuring barbell velocity, eleven men (19.4 ± 1.0 y) performed one set of 2–3 repetitions at four sub-maximal percentage loads, [i.e., 30, 50, 70, and 90% one-repetition maximum (1RM)] in the back (BS) and front squat (FS) exercises. During each repetition, peak and mean barbell velocity were recorded by OB and TWA devices, and the average of the 2–3 repetitions was used for analyses. Although the repeated measures analysis of variance revealed significantly (p ≤ 0.005) greater peak and mean velocity scores from OB across all intensities, high intraclass correlation coefficients (ICC2,K = 0.790–0.998), low standard error of measurement (SEM2,K = 0.040–0.119 m·s−1), and coefficients of variation (CV = 2–4%) suggested consistency between devices. Positive (r = 0.491–0.949) Pearson correlations between averages and differences (between devices) in peak velocity, as well as associated Bland-Altman plots, showed greater differences occurred as the velocity increased, particularly at low-moderate intensity loads. OB consistently provides greater barbell velocity scores compared to TWA, and the differences between devices were more apparent as the peak velocity increased with low-to-moderate loads. Strength coaches and athletes may find better agreement between devices if the mean velocity scores are only considered.

Download Full-text

Test-Retest Reliability of the King-Devick Test in an Adolescent Population

Journal of Athletic Training ◽

10.4085/1062-6050-52.2.12 ◽

2017 ◽

Vol 52 (5) ◽

pp. 439-445 ◽

Cited By ~ 30

Author(s):

Tyler J. Oberlander ◽

Bernadette L. Olson ◽

Lee Weidauer

Keyword(s):

Intraclass Correlation ◽

Correlation Coefficients ◽

Cross Sectional Study ◽

Testing Time ◽

Cross Sectional ◽

Healthy Adolescent ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Adolescent Population ◽

Test Retest Reliability

Context: The King-Devick (KD) test is a screening tool designed to assess cognitive visual impairments, namely saccadic rhythm, postconcussion. Test-retest reliability of the KD in a healthy adolescent population has not yet been established. Objective: To investigate the overall test-retest reliability of the KD among a sample of healthy adolescents. Additionally, we sought to determine if sex and age influenced reliability. Design: Cross-sectional study. Setting: Secondary school. Patients or Other Participants: Sixty-eight healthy adolescents, 41 boys (age = 15.4 ± 1.9 years) and 27 girls (age = 15.4 ± 1.9 years). Main Outcome Measure(s): Participants completed the KD (version 1) at 3 testing sessions (days 1, 30, and 45) following standard instructions. We recorded total time to complete the reading of 3 cards for each participant during each testing session. Two-way random-effects intraclass correlation coefficients (ICCs) using single measurements repeated over time and repeatability coefficients were calculated. Linear mixed models were used to determine whether differences existed at each testing time and to examine whether changes that took place among visits were different by sex or age. Results: Adolescents who completed the KD demonstrated acceptable reliability (ICC = 0.81; 95% confidence interval = 0.73, 0.87); however, the repeatability coefficient was large (±8.76 seconds). The sample demonstrated improvements between visits 1 and 2 (mean ± standard error = 4.3 ± 0.5 seconds, P < .001) and between visits 2 and 3 (2.4 ± 0.5 seconds, P < .001) for a total improvement of 6.9 seconds over 3 tests. No significant visit-by-sex or visit-by-age interactions were observed. Conclusions: Despite the ICC being clinically acceptable, providers using the KD test for serial assessment of concussion in adolescents should be cautious in interpreting the results due to a large learning effect. Incorporating multiple measures can ensure accurate detection of sport concussion.

Download Full-text