Reliability of Lower Extremity Strength Measurements Using the Belt-Resisted Method

Johanne Desrosiers; François Prince; Annie Rochette; Michel Raîche

doi:10.1123/japa.6.4.317

Reliability of Lower Extremity Strength Measurements Using the Belt-Resisted Method

Journal of Aging and Physical Activity ◽

10.1123/japa.6.4.317 ◽

1998 ◽

Vol 6 (4) ◽

pp. 317-326 ◽

Cited By ~ 14

Author(s):

Johanne Desrosiers ◽

François Prince ◽

Annie Rochette ◽

Michel Raîche

Keyword(s):

Lower Extremity ◽

Interrater Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Muscle Group ◽

Reliability Test ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Muscle Groups ◽

Test Retest Reliability

The objectives of this study were to standardize measurement procedures and study the test-retest and interrater reliability of the belt-resisted method for measuring the lower extremity isometric strength of three muscle groups. The strength of 33 healthy, elderly, community-dwelling subjects was evaluated with a hand-held dynamometer using the belt-resisted method. Isometric strength testing of three muscle groups (hip flexors, knee extensors, and ankle dorsiflexors) was performed on two separate occasions, I week apart, by the same tester to determine test-retest reliability. The test results of two different examiners testing on different days were used to determine interrater reliability. Test-retest reliability was higher than interrater reliability. Test-retest reliability coefficients of the three muscle groups were high (J9-.95). For interrater reliability, intraclass correlation coefficients varied from .64 to .92. depending on the muscle group and side. For the two kinds of reliability, intraclass correlation coefficients increased from proximal to distal. The method for the hip muscle group should be modified to increase reliability of the measure.

Download Full-text

Association Between Markerless Motion Capture Screenings and Musculoskeletal Injury Risk for Military Trainees: A Large Cohort and Reliability Study

Orthopaedic Journal of Sports Medicine ◽

10.1177/23259671211041656 ◽

2021 ◽

Vol 9 (10) ◽

pp. 232596712110416

Author(s):

Ben R. Hando ◽

W. Casan Scott ◽

Jacob F. Bryant ◽

Juste N. Tchandja ◽

Ryan M. Scott ◽

...

Keyword(s):

Lower Extremity ◽

Motion Capture ◽

Injury Risk ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Musculoskeletal Injury ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Markerless Motion Capture ◽

Test Retest Reliability

Background: Markerless motion capture (MMC) systems used to screen for musculoskeletal injury (MSKI) risk have become popular in military and collegiate athletic settings. However, little is known regarding the test-retest reliability or, more importantly, the ability of these systems to accurately identify individuals at risk for MSKI. Purpose: To determine the association between scores from a proprietary MMC movement screen test and the likelihood of suffering a subsequent MSKI and establish the test-retest reliability of the MMC system used. Study Design: Cohort study; Level of evidence, 3. Methods: Trainees for the Air Force Special Warfare program underwent MMC screenings immediately before entering the 8-week training course. MSKI data were extracted from a database for the surveillance period for each trainee. Logistic regression analyses were performed to identify associations between baseline MMC scores and the likelihood of suffering any MSKI or, specifically, a lower extremity MSKI. The test-retest portion of the study collected MMC scores from 10 separate participants performing 4 trials of the standard test procedures. Reliability was assessed using intraclass correlation coefficients by a single rater. Results: Overall, 1570 trainees, of whom 800 (51%) suffered an MSKI, were included in the analysis. MMC scores poorly predicted the likelihood of any or a lower extremity MSKI (odds ratio, 1.01-1.02). Further, receiver operating characteristic curve analyses demonstrated poor sensitivity and specificity for prediction of MSKI with MMC scores (area under the curve = 0.53). Finally, intraclass correlation coefficients from the test-retest analysis of MMC scores ranged from 0.157 to 0.602. Conclusion: This MMC system displayed poor to moderate test-retest reliability and did not demonstrate the ability to discriminate between individuals who were and were not likely to suffer an MSKI.

Download Full-text

Psychometric Comparisons of 2 Versions of the Fugl-Meyer Motor Scale and 2 Versions of the Stroke Rehabilitation Assessment of Movement

Neurorehabilitation and Neural Repair ◽

10.1177/1545968308315999 ◽

2008 ◽

Vol 22 (6) ◽

pp. 737-744 ◽

Cited By ~ 64

Author(s):

I-Ping Hsueh ◽

Miao-Ju Hsu ◽

Ching-Fan Sheu ◽

Su Lee ◽

Ching-Lin Hsieh ◽

...

Keyword(s):

Stroke Rehabilitation ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Stroke Patients ◽

Chronic Patients ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Mild Stroke ◽

Test Retest Reliability ◽

Rehabilitation Assessment

Objective. To provide empirical justification for selecting motor scales for stroke patients, the authors compared the psychometric properties (validity, responsiveness, test-retest reliability, and smallest real difference [SRD]) of the Fugl-Meyer Motor Scale (FM), the simplified FM (S-FM), the Stroke Rehabilitation Assessment of Movement instrument (STREAM), and the simplified STREAM (S-STREAM). Methods. For the validity and responsiveness study, 50 inpatients were assessed with the FM and the STREAM at admission and discharge to a rehabilitation department. The scores of the S-FM and the S-STREAM were retrieved from their corresponding scales. For the test-retest reliability study, a therapist administered both scales on a different sample of 60 chronic patients on 2 occasions. Results. Only the S-STREAM had no notable floor or ceiling effects at admission and discharge. The 4 motor scales had good concurrent validity (rho ≥ .91) and satisfactory predictive validity (rho = .72-.77). The scales showed responsiveness (effect size d ≥ 0.34; standardized response mean ≥ 0.95; P < .0001), with the S-STREAM most responsive. The test-retest agreements of the scales were excellent (intraclass correlation coefficients ≥ .96). The SRD of the 4 scales was 10% of their corresponding highest score, indicating acceptable level of measurement error. The upper extremity and the lower extremity subscales of the 4 showed similar results. Conclusions. The 4 motor scales showed acceptable levels of reliability, validity, and responsiveness in stroke patients. The S-STREAM is recommended because it is short, responsive to change, and able to discriminate patients with severe or mild stroke.

Download Full-text

Assessing test–retest reliability of patient-reported outcome measures using intraclass correlation coefficients: recommendations for selecting and documenting the analytical formula

Quality of Life Research ◽

10.1007/s11136-018-2076-0 ◽

2018 ◽

Vol 28 (4) ◽

pp. 1029-1033 ◽

Cited By ~ 18

Author(s):

Shanshan Qin ◽

Lauren Nelson ◽

Lori McLeod ◽

Sonya Eremenco ◽

Stephen Joel Coons

Keyword(s):

Outcome Measures ◽

Intraclass Correlation ◽

Analytical Formula ◽

Correlation Coefficients ◽

Patient Reported Outcome Measures ◽

Patient Reported Outcome ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Patient Reported ◽

Test Retest Reliability

Download Full-text

Reliability of Safe Maximum Lifting Determinations of a Functional Capacity Evaluation

Physical Therapy ◽

10.1093/ptj/82.4.364 ◽

2002 ◽

Vol 82 (4) ◽

pp. 364-371 ◽

Cited By ~ 73

Author(s):

Douglas P Gross ◽

Michele C Battié

Keyword(s):

Functional Capacity ◽

Repeated Measures ◽

Interrater Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Functional Capacity Evaluation ◽

Measurement Variability ◽

Retest Reliability ◽

Repeated Measures Design ◽

Test Retest Reliability

Abstract Background and Purpose. Functional capacity evaluations (FCEs) are measurement tools used in predicting readiness to return to work following injury. The interrater and test-retest reliability of determinations of maximal safe lifting during kinesiophysical FCEs were examined in a sample of people who were off work and receiving workers' compensation. Subjects. Twenty-eight subjects with low back pain who had plateaued with treatment were enrolled. Five occupational therapists, trained and experienced in kinesiophysical methods, conducted testing. Methods. A repeated-measures design was used, with raters testing subjects simultaneously, yet independently. Subjects were rated on 2 occasions, separated by 2 to 4 days. Analyses included intraclass correlation coefficients (ICCs) and 95% confidence intervals. Results. The ICC values for interrater reliability ranged from .95 to .98. Test-retest values ranged from .78 to .94. Discussion and Conclusion. Inconsistencies in subjects' performance across sessions were the greatest source of FCE measurement variability. Overall, however, test-retest reliability was good and interrater reliability was excellent.

Download Full-text

Test-retest reliability and minimal detectable change of the Balance Evaluation Systems Test and its two abbreviated versions in persons with mild to moderate spinocerebellar ataxia: A pilot study

Neurorehabilitation ◽

10.3233/nre-203154 ◽

2020 ◽

Vol 47 (4) ◽

pp. 479-486

Author(s):

Yuki Kondo ◽

Kyota Bando ◽

Yosuke Ariake ◽

Wakana Katsuta ◽

Kyoko Todoroki ◽

...

Keyword(s):

Spinocerebellar Ataxia ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Minimal Detectable Change ◽

Detectable Change ◽

Retest Reliability ◽

Evaluation Systems ◽

Intraclass Correlation Coefficients ◽

Interclass Correlation ◽

Test Retest Reliability

BACKGROUND: The reliability of the evaluation of the Balance Evaluation Systems Test (BESTest) and its two abbreviated versions are confirmed for balance characteristics and reliability. However, they are not utilized in cases of spinocerebellar ataxia (SCA). OBJECTIVE: We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BESTest and its abbreviated versions in persons with mild to moderate spinocerebellar ataxia. METHODS: The BESTest was performed in 20 persons with SCA at baseline and one month later. The scores of the abbreviated version of the BESTest were determined from the BESTest scores. The interclass correlation coefficient (1,1) was used as a measure of relative reliability. Furthermore, we calculated the MDC in the BESTest and its abbreviated versions. RESULTS: The intraclass correlation coefficients (1,1) and MDC at 95% confidence intervals were 0.92, 8.7(8.1%), 0.91, 4.1(14.5%), and 0.81, 5.2(21.6%) for the Balance, Mini-Balance, and Brief-Balance Evaluation Systems Tests, respectively. CONCLUSIONS: The BESTest and its abbreviated versions had high test-retest reliability. The MDC values of the BESTest could enable clinicians and researchers to interpret changes in the balance of patients with SCA more precisely.

Download Full-text

The Reliability of Pharyngeal High Resolution Manometry with Impedance for Derivation of Measures of Swallowing Function in Healthy Volunteers

International Journal of Otolaryngology ◽

10.1155/2016/2718482 ◽

2016 ◽

Vol 2016 ◽

pp. 1-8 ◽

Cited By ~ 18

Author(s):

Taher I. Omari ◽

Johanna Savilampi ◽

Karmen Kokkinn ◽

Mistyka Schar ◽

Kristin Lamvik ◽

...

Keyword(s):

High Resolution ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Interrater Agreement ◽

Swallowing Function ◽

Retest Reliability ◽

High Resolution Manometry ◽

Intraclass Correlation Coefficients ◽

Test Retest Reliability ◽

Swallow Function

Purpose. We evaluated the intra- and interrater agreement and test-retest reliability of analyst derivation of swallow function variables based on repeated high resolution manometry with impedance measurements.Methods. Five subjects swallowed10×10 mL saline on two occasions one week apart producing a database of 100 swallows. Swallows were repeat-analysed by six observers using software. Swallow variables were indicative of contractility, intrabolus pressure, and flow timing.Results. The average intraclass correlation coefficients (ICC) for intra- and interrater comparisons of all variable means showedsubstantialtoexcellentagreement (intrarater ICC 0.85–1.00; mean interrater ICC 0.77–1.00). Test-retest results were less reliable. ICC for test-retest comparisons ranged fromslighttoexcellentdepending on the class of variable. Contractility variables differed most in terms of test-retest reliability. Amongst contractility variables, UES basal pressure showedexcellenttest-retest agreement (mean ICC 0.94), measures of UES postrelaxation contractile pressure showedmoderatetosubstantialtest-retest agreement (mean Interrater ICC 0.47–0.67), and test-retest agreement of pharyngeal contractile pressure ranged fromslighttosubstantial(mean Interrater ICC 0.15–0.61).Conclusions. Test-retest reliability of HRIM measures depends on the class of variable. Measures of bolus distension pressure and flow timing appear to be more test-retest reliable than measures of contractility.

Download Full-text

Reliability of the ForceFrame With and Without a Fixed Upper-Limb Mold in Shoulder Rotation Strength Assessments Compared With Traditional Hand-Held Dynamometry

Journal of Sport Rehabilitation ◽

10.1123/jsr.2020-0434 ◽

2021 ◽

pp. 1-4

Author(s):

Jamon Couch ◽

Marc Sayers ◽

Tania Pizzari

Keyword(s):

Upper Limb ◽

External Rotation ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Shoulder Abduction ◽

Minimal Detectable Change ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Shoulder Strength ◽

Test Retest Reliability

Context: An imbalance between shoulder internal rotation (IR) and external rotation (ER) strength in athletes is proposed to increase the risk of sustaining a shoulder injury. Hand-held (HHD) and externally fixed dynamometry are reliable forms of assessing shoulder IR and ER strength. A new externally fixed device with an attachable fixed upper-limb mold (The ForceFrame) exists; however, its reliability in measuring shoulder strength is yet to be investigated. Objective: To determine the test–retest reliability of the ForceFrame, with and without the fixed upper-limb mold, in the assessment of shoulder IR and ER strength, as compared with HHD. Design: Test–retest reliability study. Setting: Laboratory, clinical. Participants: Twenty-two healthy and active individuals were recruited from the university community and a private physiotherapy practice. Main Outcome Measures: Maximal isometric shoulder IR and ER strength was measured using the ForceFrame and traditional HHD in neutral and at 90° shoulder abduction. Mean (SD) strength measures were calculated. Test–retest reliability was analyzed using intraclass correlation coefficients (3, 1). The SEM and minimal detectable change were calculated. Results: Good to excellent test–retest reliability was found for all shoulder strength tests across Hand-held dynamometry (HHD) and externally fixed dynamometry (EFD) are reliable forms (intraclass correlation coefficients [3, 1] = .854–.916). The minimal detectable changes ranged between 25.61 and 41.84 N across tests. Test–retest reliability was not affected by the dynamometer or testing position. Conclusions: The results from this study indicate that both the ForceFrame and HHD are suitable for measuring shoulder strength in clinical practice. The use of the fixed upper-limb mold with the ForceFrame does not improve reliability.

Download Full-text

Test–Retest Reliability of a New Device Versus a Long-Arm Goniometer to Evaluate Knee Proprioception

Journal of Sport Rehabilitation ◽

10.1123/jsr.2021-0146 ◽

2021 ◽

pp. 1-6

Author(s):

Fei Tian ◽

Yaqi Zhao ◽

Jixin Li ◽

Wenjin Wang ◽

Danni Wu ◽

...

Keyword(s):

Standard Error ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Joint Position Sense ◽

Repeated Measurements ◽

Good Reliability ◽

Retest Reliability ◽

New Device ◽

Intraclass Correlation Coefficients ◽

Test Retest Reliability

Context: Many methods used to evaluate knee proprioception have shortcomings that limit their use in clinical settings. Based on an inexpensive 3D camera, a new portable device was recently used to evaluate the joint position sense (JPS) of the knee joint. However, the test–retest reliability of the new method remains unclear. This study aimed to evaluate the test–retest reliability of the new device and a long-arm goniometer for assessing knee JPS, and to compare the variability of the 2 methods. Design: Prospective observational study of the test–retest reliability of knee JPS measurements. Methods: Twenty-one healthy adults were tested in 2 sessions with a 1-week interval. Three target knee flexion angles (30°, 45°, and 60°) were reproduced in each session. Target and reproduced angles were measured with both methods. Intraclass correlation coefficients, standard error of the measurement, and Bland–Altman plots were used to quantify test–retest reliability. Paired t tests were used to compare knee JPS (absolute error of the target-reproduced angle) between the methods. Results: The new device (good to excellent intraclass correlation coefficients .74–.80; standard error of the measurement 0.52°–0.61°) demonstrated better test–retest reliability than the goniometer (poor to fair intraclass correlation coefficients .23–.43; standard error of the measurement 0.89°–2.07°) and better test–retest agreement (respective mean differences for the 30°, 45°, and 60° knee angles: 0.11°, 0.13°, and 0.41° for the new system; 0.84°, 1.52°, and 1.18° for the goniometer). The measurements (absolute errors of the target-reproduced angles) with the goniometer were significantly greater than those with the new device (P < .05); the SDs of repeated measurements with the goniometer (1.50°–2.41°) were greater than with the new device (1.08°–1.38°). Conclusions: Given that the new device has good reliability and sufficient precision, it is the better alternative for evaluating knee JPS. Goniometers should be used with caution to assess knee JPS.

Download Full-text

Reliability of the Function in Sitting Test (FIST)

Rehabilitation Research and Practice ◽

10.1155/2014/593280 ◽

2014 ◽

Vol 2014 ◽

pp. 1-6 ◽

Cited By ~ 4

Author(s):

Sharon L. Gorman ◽

Monica Rivera ◽

Lise McCarthy

Keyword(s):

Interrater Reliability ◽

Physical Therapists ◽

Physical Therapist ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Inpatient Rehabilitation ◽

Neurological Dysfunction ◽

Retest Reliability ◽

Reliability Characteristics ◽

Test Retest Reliability

The function in sitting test (FIST) is a newly developed, performance-based measure examining deficits in seated postural control. The FIST has been shown to be internally consistent and valid in persons with neurological dysfunction but intra- and interrater reliability and test-retest reliability have not been previously described. Seven patients with chronic neurologic dysfunction were tested and videotaped performing the FIST on two consecutive days. Seventeen acute care and inpatient rehabilitation physical therapist raters scored six of the videotaped performance of the FIST on two occasions at least 2 weeks apart. Intraclass correlation coefficients were used to calculate the test-retest and intra- and interrater reliability of the FIST. ICC of 0.97 (95% CI 0.847–0.995) indicated excellent test-retest reliability of the FIST. Intra- and interrater reliability was also excellent with ICCs of 0.99 (95% CI 0.994–0.997) and 0.99 (95% CI 0.988–0.994), respectively. Physical therapists and other rehabilitation professionals can confidently use the FIST in a variety of clinical practice and research settings due to its favorable reliability characteristics. More studies are needed to describe the responsiveness and minimal clinically important level of change in FIST scores to further enhance clinical usefulness of this measure.

Download Full-text

Test-Retest Reliability of the King-Devick Test in an Adolescent Population

Journal of Athletic Training ◽

10.4085/1062-6050-52.2.12 ◽

2017 ◽

Vol 52 (5) ◽

pp. 439-445 ◽

Cited By ~ 30

Author(s):

Tyler J. Oberlander ◽

Bernadette L. Olson ◽

Lee Weidauer

Keyword(s):

Intraclass Correlation ◽

Correlation Coefficients ◽

Cross Sectional Study ◽

Testing Time ◽

Cross Sectional ◽

Healthy Adolescent ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Adolescent Population ◽

Test Retest Reliability

Context: The King-Devick (KD) test is a screening tool designed to assess cognitive visual impairments, namely saccadic rhythm, postconcussion. Test-retest reliability of the KD in a healthy adolescent population has not yet been established. Objective: To investigate the overall test-retest reliability of the KD among a sample of healthy adolescents. Additionally, we sought to determine if sex and age influenced reliability. Design: Cross-sectional study. Setting: Secondary school. Patients or Other Participants: Sixty-eight healthy adolescents, 41 boys (age = 15.4 ± 1.9 years) and 27 girls (age = 15.4 ± 1.9 years). Main Outcome Measure(s): Participants completed the KD (version 1) at 3 testing sessions (days 1, 30, and 45) following standard instructions. We recorded total time to complete the reading of 3 cards for each participant during each testing session. Two-way random-effects intraclass correlation coefficients (ICCs) using single measurements repeated over time and repeatability coefficients were calculated. Linear mixed models were used to determine whether differences existed at each testing time and to examine whether changes that took place among visits were different by sex or age. Results: Adolescents who completed the KD demonstrated acceptable reliability (ICC = 0.81; 95% confidence interval = 0.73, 0.87); however, the repeatability coefficient was large (±8.76 seconds). The sample demonstrated improvements between visits 1 and 2 (mean ± standard error = 4.3 ± 0.5 seconds, P < .001) and between visits 2 and 3 (2.4 ± 0.5 seconds, P < .001) for a total improvement of 6.9 seconds over 3 tests. No significant visit-by-sex or visit-by-age interactions were observed. Conclusions: Despite the ICC being clinically acceptable, providers using the KD test for serial assessment of concussion in adolescents should be cautious in interpreting the results due to a large learning effect. Incorporating multiple measures can ensure accurate detection of sport concussion.

Download Full-text