Reliability of Lower Extremity Strength Measurements Using the Belt-Resisted Method

1998 ◽  
Vol 6 (4) ◽  
pp. 317-326 ◽  
Author(s):  
Johanne Desrosiers ◽  
François Prince ◽  
Annie Rochette ◽  
Michel Raîche

The objectives of this study were to standardize measurement procedures and study the test-retest and interrater reliability of the belt-resisted method for measuring the lower extremity isometric strength of three muscle groups. The strength of 33 healthy, elderly, community-dwelling subjects was evaluated with a hand-held dynamometer using the belt-resisted method. Isometric strength testing of three muscle groups (hip flexors, knee extensors, and ankle dorsiflexors) was performed on two separate occasions, I week apart, by the same tester to determine test-retest reliability. The test results of two different examiners testing on different days were used to determine interrater reliability. Test-retest reliability was higher than interrater reliability. Test-retest reliability coefficients of the three muscle groups were high (J9-.95). For interrater reliability, intraclass correlation coefficients varied from .64 to .92. depending on the muscle group and side. For the two kinds of reliability, intraclass correlation coefficients increased from proximal to distal. The method for the hip muscle group should be modified to increase reliability of the measure.

2021 ◽  
Vol 9 (10) ◽  
pp. 232596712110416
Author(s):  
Ben R. Hando ◽  
W. Casan Scott ◽  
Jacob F. Bryant ◽  
Juste N. Tchandja ◽  
Ryan M. Scott ◽  
...  

Background: Markerless motion capture (MMC) systems used to screen for musculoskeletal injury (MSKI) risk have become popular in military and collegiate athletic settings. However, little is known regarding the test-retest reliability or, more importantly, the ability of these systems to accurately identify individuals at risk for MSKI. Purpose: To determine the association between scores from a proprietary MMC movement screen test and the likelihood of suffering a subsequent MSKI and establish the test-retest reliability of the MMC system used. Study Design: Cohort study; Level of evidence, 3. Methods: Trainees for the Air Force Special Warfare program underwent MMC screenings immediately before entering the 8-week training course. MSKI data were extracted from a database for the surveillance period for each trainee. Logistic regression analyses were performed to identify associations between baseline MMC scores and the likelihood of suffering any MSKI or, specifically, a lower extremity MSKI. The test-retest portion of the study collected MMC scores from 10 separate participants performing 4 trials of the standard test procedures. Reliability was assessed using intraclass correlation coefficients by a single rater. Results: Overall, 1570 trainees, of whom 800 (51%) suffered an MSKI, were included in the analysis. MMC scores poorly predicted the likelihood of any or a lower extremity MSKI (odds ratio, 1.01-1.02). Further, receiver operating characteristic curve analyses demonstrated poor sensitivity and specificity for prediction of MSKI with MMC scores (area under the curve = 0.53). Finally, intraclass correlation coefficients from the test-retest analysis of MMC scores ranged from 0.157 to 0.602. Conclusion: This MMC system displayed poor to moderate test-retest reliability and did not demonstrate the ability to discriminate between individuals who were and were not likely to suffer an MSKI.


2008 ◽  
Vol 22 (6) ◽  
pp. 737-744 ◽  
Author(s):  
I-Ping Hsueh ◽  
Miao-Ju Hsu ◽  
Ching-Fan Sheu ◽  
Su Lee ◽  
Ching-Lin Hsieh ◽  
...  

Objective. To provide empirical justification for selecting motor scales for stroke patients, the authors compared the psychometric properties (validity, responsiveness, test-retest reliability, and smallest real difference [SRD]) of the Fugl-Meyer Motor Scale (FM), the simplified FM (S-FM), the Stroke Rehabilitation Assessment of Movement instrument (STREAM), and the simplified STREAM (S-STREAM). Methods. For the validity and responsiveness study, 50 inpatients were assessed with the FM and the STREAM at admission and discharge to a rehabilitation department. The scores of the S-FM and the S-STREAM were retrieved from their corresponding scales. For the test-retest reliability study, a therapist administered both scales on a different sample of 60 chronic patients on 2 occasions. Results. Only the S-STREAM had no notable floor or ceiling effects at admission and discharge. The 4 motor scales had good concurrent validity (rho ≥ .91) and satisfactory predictive validity (rho = .72-.77). The scales showed responsiveness (effect size d ≥ 0.34; standardized response mean ≥ 0.95; P < .0001), with the S-STREAM most responsive. The test-retest agreements of the scales were excellent (intraclass correlation coefficients ≥ .96). The SRD of the 4 scales was 10% of their corresponding highest score, indicating acceptable level of measurement error. The upper extremity and the lower extremity subscales of the 4 showed similar results. Conclusions. The 4 motor scales showed acceptable levels of reliability, validity, and responsiveness in stroke patients. The S-STREAM is recommended because it is short, responsive to change, and able to discriminate patients with severe or mild stroke.


2002 ◽  
Vol 82 (4) ◽  
pp. 364-371 ◽  
Author(s):  
Douglas P Gross ◽  
Michele C Battié

Abstract Background and Purpose. Functional capacity evaluations (FCEs) are measurement tools used in predicting readiness to return to work following injury. The interrater and test-retest reliability of determinations of maximal safe lifting during kinesiophysical FCEs were examined in a sample of people who were off work and receiving workers' compensation. Subjects. Twenty-eight subjects with low back pain who had plateaued with treatment were enrolled. Five occupational therapists, trained and experienced in kinesiophysical methods, conducted testing. Methods. A repeated-measures design was used, with raters testing subjects simultaneously, yet independently. Subjects were rated on 2 occasions, separated by 2 to 4 days. Analyses included intraclass correlation coefficients (ICCs) and 95% confidence intervals. Results. The ICC values for interrater reliability ranged from .95 to .98. Test-retest values ranged from .78 to .94. Discussion and Conclusion. Inconsistencies in subjects' performance across sessions were the greatest source of FCE measurement variability. Overall, however, test-retest reliability was good and interrater reliability was excellent.


2020 ◽  
Vol 47 (4) ◽  
pp. 479-486
Author(s):  
Yuki Kondo ◽  
Kyota Bando ◽  
Yosuke Ariake ◽  
Wakana Katsuta ◽  
Kyoko Todoroki ◽  
...  

BACKGROUND: The reliability of the evaluation of the Balance Evaluation Systems Test (BESTest) and its two abbreviated versions are confirmed for balance characteristics and reliability. However, they are not utilized in cases of spinocerebellar ataxia (SCA). OBJECTIVE: We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BESTest and its abbreviated versions in persons with mild to moderate spinocerebellar ataxia. METHODS: The BESTest was performed in 20 persons with SCA at baseline and one month later. The scores of the abbreviated version of the BESTest were determined from the BESTest scores. The interclass correlation coefficient (1,1) was used as a measure of relative reliability. Furthermore, we calculated the MDC in the BESTest and its abbreviated versions. RESULTS: The intraclass correlation coefficients (1,1) and MDC at 95% confidence intervals were 0.92, 8.7(8.1%), 0.91, 4.1(14.5%), and 0.81, 5.2(21.6%) for the Balance, Mini-Balance, and Brief-Balance Evaluation Systems Tests, respectively. CONCLUSIONS: The BESTest and its abbreviated versions had high test-retest reliability. The MDC values of the BESTest could enable clinicians and researchers to interpret changes in the balance of patients with SCA more precisely.


2016 ◽  
Vol 2016 ◽  
pp. 1-8 ◽  
Author(s):  
Taher I. Omari ◽  
Johanna Savilampi ◽  
Karmen Kokkinn ◽  
Mistyka Schar ◽  
Kristin Lamvik ◽  
...  

Purpose. We evaluated the intra- and interrater agreement and test-retest reliability of analyst derivation of swallow function variables based on repeated high resolution manometry with impedance measurements.Methods. Five subjects swallowed10×10 mL saline on two occasions one week apart producing a database of 100 swallows. Swallows were repeat-analysed by six observers using software. Swallow variables were indicative of contractility, intrabolus pressure, and flow timing.Results. The average intraclass correlation coefficients (ICC) for intra- and interrater comparisons of all variable means showedsubstantialtoexcellentagreement (intrarater ICC 0.85–1.00; mean interrater ICC 0.77–1.00). Test-retest results were less reliable. ICC for test-retest comparisons ranged fromslighttoexcellentdepending on the class of variable. Contractility variables differed most in terms of test-retest reliability. Amongst contractility variables, UES basal pressure showedexcellenttest-retest agreement (mean ICC 0.94), measures of UES postrelaxation contractile pressure showedmoderatetosubstantialtest-retest agreement (mean Interrater ICC 0.47–0.67), and test-retest agreement of pharyngeal contractile pressure ranged fromslighttosubstantial(mean Interrater ICC 0.15–0.61).Conclusions. Test-retest reliability of HRIM measures depends on the class of variable. Measures of bolus distension pressure and flow timing appear to be more test-retest reliable than measures of contractility.


2021 ◽  
pp. 1-4
Author(s):  
Jamon Couch ◽  
Marc Sayers ◽  
Tania Pizzari

Context: An imbalance between shoulder internal rotation (IR) and external rotation (ER) strength in athletes is proposed to increase the risk of sustaining a shoulder injury. Hand-held (HHD) and externally fixed dynamometry are reliable forms of assessing shoulder IR and ER strength. A new externally fixed device with an attachable fixed upper-limb mold (The ForceFrame) exists; however, its reliability in measuring shoulder strength is yet to be investigated. Objective: To determine the test–retest reliability of the ForceFrame, with and without the fixed upper-limb mold, in the assessment of shoulder IR and ER strength, as compared with HHD. Design: Test–retest reliability study. Setting: Laboratory, clinical. Participants: Twenty-two healthy and active individuals were recruited from the university community and a private physiotherapy practice. Main Outcome Measures: Maximal isometric shoulder IR and ER strength was measured using the ForceFrame and traditional HHD in neutral and at 90° shoulder abduction. Mean (SD) strength measures were calculated. Test–retest reliability was analyzed using intraclass correlation coefficients (3, 1). The SEM and minimal detectable change were calculated. Results: Good to excellent test–retest reliability was found for all shoulder strength tests across Hand-held dynamometry (HHD) and externally fixed dynamometry (EFD) are reliable forms (intraclass correlation coefficients [3, 1] = .854–.916). The minimal detectable changes ranged between 25.61 and 41.84 N across tests. Test–retest reliability was not affected by the dynamometer or testing position. Conclusions: The results from this study indicate that both the ForceFrame and HHD are suitable for measuring shoulder strength in clinical practice. The use of the fixed upper-limb mold with the ForceFrame does not improve reliability.


2021 ◽  
pp. 1-6
Author(s):  
Fei Tian ◽  
Yaqi Zhao ◽  
Jixin Li ◽  
Wenjin Wang ◽  
Danni Wu ◽  
...  

Context: Many methods used to evaluate knee proprioception have shortcomings that limit their use in clinical settings. Based on an inexpensive 3D camera, a new portable device was recently used to evaluate the joint position sense (JPS) of the knee joint. However, the test–retest reliability of the new method remains unclear. This study aimed to evaluate the test–retest reliability of the new device and a long-arm goniometer for assessing knee JPS, and to compare the variability of the 2 methods. Design: Prospective observational study of the test–retest reliability of knee JPS measurements. Methods: Twenty-one healthy adults were tested in 2 sessions with a 1-week interval. Three target knee flexion angles (30°, 45°, and 60°) were reproduced in each session. Target and reproduced angles were measured with both methods. Intraclass correlation coefficients, standard error of the measurement, and Bland–Altman plots were used to quantify test–retest reliability. Paired t tests were used to compare knee JPS (absolute error of the target-reproduced angle) between the methods. Results: The new device (good to excellent intraclass correlation coefficients .74–.80; standard error of the measurement 0.52°–0.61°) demonstrated better test–retest reliability than the goniometer (poor to fair intraclass correlation coefficients .23–.43; standard error of the measurement 0.89°–2.07°) and better test–retest agreement (respective mean differences for the 30°, 45°, and 60° knee angles: 0.11°, 0.13°, and 0.41° for the new system; 0.84°, 1.52°, and 1.18° for the goniometer). The measurements (absolute errors of the target-reproduced angles) with the goniometer were significantly greater than those with the new device (P < .05); the SDs of repeated measurements with the goniometer (1.50°–2.41°) were greater than with the new device (1.08°–1.38°). Conclusions: Given that the new device has good reliability and sufficient precision, it is the better alternative for evaluating knee JPS. Goniometers should be used with caution to assess knee JPS.


2014 ◽  
Vol 2014 ◽  
pp. 1-6 ◽  
Author(s):  
Sharon L. Gorman ◽  
Monica Rivera ◽  
Lise McCarthy

The function in sitting test (FIST) is a newly developed, performance-based measure examining deficits in seated postural control. The FIST has been shown to be internally consistent and valid in persons with neurological dysfunction but intra- and interrater reliability and test-retest reliability have not been previously described. Seven patients with chronic neurologic dysfunction were tested and videotaped performing the FIST on two consecutive days. Seventeen acute care and inpatient rehabilitation physical therapist raters scored six of the videotaped performance of the FIST on two occasions at least 2 weeks apart. Intraclass correlation coefficients were used to calculate the test-retest and intra- and interrater reliability of the FIST. ICC of 0.97 (95% CI 0.847–0.995) indicated excellent test-retest reliability of the FIST. Intra- and interrater reliability was also excellent with ICCs of 0.99 (95% CI 0.994–0.997) and 0.99 (95% CI 0.988–0.994), respectively. Physical therapists and other rehabilitation professionals can confidently use the FIST in a variety of clinical practice and research settings due to its favorable reliability characteristics. More studies are needed to describe the responsiveness and minimal clinically important level of change in FIST scores to further enhance clinical usefulness of this measure.


2017 ◽  
Vol 52 (5) ◽  
pp. 439-445 ◽  
Author(s):  
Tyler J. Oberlander ◽  
Bernadette L. Olson ◽  
Lee Weidauer

Context:  The King-Devick (KD) test is a screening tool designed to assess cognitive visual impairments, namely saccadic rhythm, postconcussion. Test-retest reliability of the KD in a healthy adolescent population has not yet been established. Objective:  To investigate the overall test-retest reliability of the KD among a sample of healthy adolescents. Additionally, we sought to determine if sex and age influenced reliability. Design:  Cross-sectional study. Setting:  Secondary school. Patients or Other Participants:  Sixty-eight healthy adolescents, 41 boys (age = 15.4 ± 1.9 years) and 27 girls (age = 15.4 ± 1.9 years). Main Outcome Measure(s):  Participants completed the KD (version 1) at 3 testing sessions (days 1, 30, and 45) following standard instructions. We recorded total time to complete the reading of 3 cards for each participant during each testing session. Two-way random-effects intraclass correlation coefficients (ICCs) using single measurements repeated over time and repeatability coefficients were calculated. Linear mixed models were used to determine whether differences existed at each testing time and to examine whether changes that took place among visits were different by sex or age. Results:  Adolescents who completed the KD demonstrated acceptable reliability (ICC = 0.81; 95% confidence interval = 0.73, 0.87); however, the repeatability coefficient was large (±8.76 seconds). The sample demonstrated improvements between visits 1 and 2 (mean ± standard error = 4.3 ± 0.5 seconds, P &lt; .001) and between visits 2 and 3 (2.4 ± 0.5 seconds, P &lt; .001) for a total improvement of 6.9 seconds over 3 tests. No significant visit-by-sex or visit-by-age interactions were observed. Conclusions:  Despite the ICC being clinically acceptable, providers using the KD test for serial assessment of concussion in adolescents should be cautious in interpreting the results due to a large learning effect. Incorporating multiple measures can ensure accurate detection of sport concussion.


Sign in / Sign up

Export Citation Format

Share Document