The Reliability of a Smartphone Goniometer Application Compared With a Traditional Goniometer for Measuring Ankle Joint Range of Motion

2019 ◽  
Vol 109 (1) ◽  
pp. 22-29 ◽  
Author(s):  
Motaz Abdalla Alawna ◽  
Bayram H. Unver ◽  
Ertugrul O. Yuksel

Background: Evaluation of range of motion (ROM) is integral to assessment of the musculoskeletal system, is required in health fitness and pathologic conditions, and is used as an objective outcome measure. Several methods are described to check ROM, each with advantages and disadvantages. Hence, this study introduces a new device using a smartphone goniometer to measure ankle joint ROM. Objective: To test the reliability of smartphone goniometry in the ankle joint by comparing it with the universal goniometer (UG) and to assess interrater and intrarater reliability for the smartphone goniometer record (SGR) application. Methods: Fifty-eight healthy volunteers (29 men and 29 women aged 18–30 years) underwent SGR and UG measurement of ankle joint dorsiflexion and plantarflexion. Two examiners measured ankle joint ROM. Descriptive statistics were calculated for descriptive and anthropometric variables, as were intraclass correlation coefficients (ICCs). Results: There were 58 usable data sets. For measuring ankle dorsiflexion ROM, both instruments showed excellent interrater reliability: UG (ICC = 0.87) and SGR (ICC = 0.89). Intrarater reliability was excellent in both instruments in ankle dorsiflexion: UG and SGR (mean ICC = 0.91). For measuring ankle plantarflexion, both instruments showed excellent interrater reliability: UG (ICC = 0.76) and SGR (ICC = 0.82). Intrarater reliability was excellent in both instruments in ankle plantarflexion: UG (mean ICC = 0.85) and SGR (mean ICC = 0.82). Conclusions: Smartphone-based goniometers can be used to assess active ROM of the ankle joint because they can achieve a high degree of intrarater and interrater reliability.

2002 ◽  
Vol 96 (5) ◽  
pp. 1129-1139 ◽  
Author(s):  
Jason Slagle ◽  
Matthew B. Weinger ◽  
My-Than T. Dinh ◽  
Vanessa V. Brumer ◽  
Kevin Williams

Background Task analysis may be useful for assessing how anesthesiologists alter their behavior in response to different clinical situations. In this study, the authors examined the intraobserver and interobserver reliability of an established task analysis methodology. Methods During 20 routine anesthetic procedures, a trained observer sat in the operating room and categorized in real-time the anesthetist's activities into 38 task categories. Two weeks later, the same observer performed task analysis from videotapes obtained intraoperatively. A different observer performed task analysis from the videotapes on two separate occasions. Data were analyzed for percent of time spent on each task category, average task duration, and number of task occurrences. Rater reliability and agreement were assessed using intraclass correlation coefficients. Results Intrarater reliability was generally good for categorization of percent time on task and task occurrence (mean intraclass correlation coefficients of 0.84-0.97). There was a comparably high concordance between real-time and video analyses. Interrater reliability was generally good for percent time and task occurrence measurements. However, the interrater reliability of the task duration metric was unsatisfactory, primarily because of the technique used to capture multitasking. Conclusions A task analysis technique used in anesthesia research for several decades showed good intrarater reliability. Off-line analysis of videotapes is a viable alternative to real-time data collection. Acceptable interrater reliability requires the use of strict task definitions, sophisticated software, and rigorous observer training. New techniques must be developed to more accurately capture multitasking. Substantial effort is required to conduct task analyses that will have sufficient reliability for purposes of research or clinical evaluation.


Author(s):  
James C. Borders ◽  
Jordanna S. Sevitz ◽  
Jaime Bauer Malandraki ◽  
Georgia A. Malandraki ◽  
Michelle S. Troche

Purpose The COVID-19 pandemic has drastically increased the use of telehealth. Prior studies of telehealth clinical swallowing evaluations provide positive evidence for telemanagement of swallowing. However, the reliability of these measures in clinical practice, as opposed to well-controlled research conditions, remains unknown. This study aimed to investigate the reliability of outcome measures derived from clinical swallowing tele-evaluations in real-world clinical practice (e.g., variability in devices and Internet connectivity, lack of in-person clinician assistance, or remote patient/caregiver training). Method Seven raters asynchronously judged clinical swallowing tele-evaluations of 12 movement disorders patients. Outcomes included the Timed Water Swallow Test (TWST), Test of Masticating and Swallowing Solids (TOMASS), and common observations of oral intake. Statistical analyses were performed to examine inter- and intrarater reliability, as well as qualitative analyses exploring patient and clinician-specific factors impacting reliability. Results Forty-four trials were included for reliability analyses. All rater dyads demonstrated “good” to “excellent” interrater reliability for measures of the TWST (intraclass correlation coefficients [ICCs] ≥ .93) and observations of oral intake (≥ 77% agreement). The majority of TOMASS outcomes demonstrated “good” to “excellent” interrater reliability (ICCs ≥ .84), with the exception of the number of bites (ICCs = .43–.99) and swallows (ICCs = .21–.85). Immediate and delayed intrarater reliability were “excellent” for most raters across all tasks, ranging between ICCs of .63 and 1.00. Exploratory factors potentially impacting reliability included infrequent instances of suboptimal video quality, reduced camera stability, camera distance, and obstruction of the patient's mouth during tasks. Conclusions Subjective observations of oral intake and objective measures taken from the TWST and the TOMASS can be reliably measured via telehealth in clinical practice. Our results provide support for the feasibility and reliability of telehealth for outpatient clinical swallowing evaluations during COVID-19 and beyond. Supplemental Material https://doi.org/10.23641/asha.13661378


2020 ◽  
Vol 48 (1) ◽  
pp. 94-100 ◽  
Author(s):  
Floranne C. Ernste ◽  
Christopher Chong ◽  
Cynthia S. Crowson ◽  
Tanaz A. Kermani ◽  
Orla Ni Mhuircheartaigh ◽  
...  

Objective.Patients with dermatomyositis (DM) and polymyositis (PM) have reduced muscle endurance.The aim of this study was to streamline the Functional Index-2 (FI-2) by developing the Functional Index-3 (FI-3) and to evaluate its measurement properties, content and construct validity, and intra- and interrater reliability.Methods.A dataset of the previously performed and validated FI-2 (n = 63) was analyzed for internal redundancy, floor, and ceiling effects. The content of the FI-2 was revised into the FI-3. Construct validity and intrarater reliability of FI-3 were tested on 43 DM and PM patients at 2 rheumatology centers. Interrater reliability was tested in 25 patients. The construct validity was compared with the Myositis Activities Profile (MAP), Health Assessment Questionnaire (HAQ), and Borg CR-10 using Spearman correlation coefficient.Results.Spearman correlation coefficients of 63 patients performing FI-3 revealed moderate to high correlations between shoulder flexion and hip flexion tasks and similar correlations with MAP and HAQ scores; there were lower correlations for neck flexion task. All FI-3 tasks had very low to moderate correlations with the Borg scale. Intraclass correlation coefficients (ICC) of FI-3 tasks for intrarater reliability (n = 25) were moderate to good (0.88–0.98). ICC of FI-3 tasks for interrater reliability (n = 17) were fair to good (range 0.83–0.96).Conclusion.The FI-3 is an efficient and valid method for clinically assessing muscle endurance in DM and PM patients. FI-3 construct validity is supported by the significant correlations between functional tasks and the MAP, HAQ, and Borg CR-10 scores.


2013 ◽  
Vol 2013 ◽  
pp. 1-5 ◽  
Author(s):  
Lisa A. Dudley ◽  
Craig A. Smith ◽  
Brandon K. Olson ◽  
Nicole J. Chimera ◽  
Brian Schmitz ◽  
...  

Objective. The Tuck Jump Assessment (TJA), a clinical plyometric assessment, identifies 10 jumping and landing technique flaws. The study objective was to investigate TJA interrater and intrarater reliability with raters of different educational and clinical backgrounds.Methods. 40 participants were video recorded performing the TJA using published protocol and instructions. Five raters of varied educational and clinical backgrounds scored the TJA. Each score of the 10 technique flaws was summed for the total TJA score. Approximately one month later, 3 raters scored the videos again. Intraclass correlation coefficients determined interrater (5 and 3 raters for first and second session, resp.) and intrarater (3 raters) reliability.Results. Interrater reliability with 5 raters was poor (ICC = 0.47; 95% confidence intervals (CI) 0.33–0.62). Interrater reliability between 3 raters who completed 2 scoring sessions improved from 0.52 (95% CI 0.35–0.68) for session one to 0.69 (95% CI 0.55–0.81) for session two. Intrarater reliability was poor to moderate, ranging from 0.44 (95% CI 0.22–0.68) to 0.72 (95% CI 0.55–0.84).Conclusion. Published protocol and training of raters were insufficient to allow consistent TJA scoring. There may be a learned effect with the TJA since interrater reliability improved with repetition. TJA instructions and training should be modified and enhanced before clinical implementation.


2012 ◽  
Vol 21 (4) ◽  
Author(s):  
Bram L. Newman ◽  
Courtney L. Pollock ◽  
Michael A. Hunt

Context: Lateral trunk-flexion strength is an important determinant of overall trunk stability and function, but the reliability in measuring this outcome clinically in athletic individuals is not known. Objective: To determine the interrater and intrarater reliability of lateral trunk-flexion strength measurement in athletic individuals using handheld dynamometry. Design: Reliability study. Setting: Research laboratory. Participants: 12 healthy, athletic individuals. Intervention: Lateral trunk-flexion strength was measured using handheld dynamometry across 2 different trunk placements (lateral aspect of the axilla and laterally at the level of the midtrunk) and 2 testing occasions by 2 therapists. Three maximum-effort trials during a "make test" at each placement were completed for each therapist on both occasions. Main Outcome Measures: Maximum force output was identified and converted to a torque. Intraclass correlation coefficients (ICC2,1) were calculated for each dynamometer placement, therapist, and test occasion to determine intrarater and interrater reliability. Results: Intrarater reliability was moderate to good (ICC2,1 = .53-.77), while interrater reliability was good to very good (ICC2,1 = .79-81) at the axilla position. For the midtrunk position, intrarater reliability was good to very good (ICC2,1 = .80-.86), while interrater reliability was good on both days (ICC2,1 = .87-.88). Finally, the standard errors of measurement were low for the axilla position (0.20 Nm/kg; 95% CI .15, .28) and midtrunk position (0.09 Nm/kg; 95% CI .07, .12). Conclusions: Maximum lateral trunk-flexion strength can be reliably measured in athletic individuals with greater overall strength. Based on the 2 positions used in this study, measurement with a dynamometer placement at the midtrunk may be more reliable than that obtained at the axilla.


2020 ◽  
Vol 29 (6) ◽  
pp. 855-858 ◽  
Author(s):  
Stef Feijen ◽  
Angela Tate ◽  
Kevin Kuppens ◽  
Thomas Struyf ◽  
Anke Claes ◽  
...  

Context: The latissimus dorsi plays a major role in generating the propulsive force during swimming. In addition, stiffness of this muscle may result in altered stroke biomechanics and predispose swimmers to shoulder pain. Measuring the flexibility of the latissimus dorsi can be of interest to reduce injury. However, the reliability of such measurement has not yet been investigated in competitive swimmers. Objective: To assess the within-session intrarater and interrater reliability of a passive shoulder flexion range of motion measurement for latissimus dorsi flexibility in competitive swimmers. Design: Within-session intrarater and interrater reliability. Setting: Competitive swimming clubs in Flanders, Belgium. Participants: Twenty-six competitive swimmers (15.46 [2.98] y; 16 men and 10 women). Intervention: Each rater performed 2 alternating (eg, left-right-left-right) measurements of passive shoulder flexion range of motion twice, with a 30-second rest period in between. Main Outcome Measures: The intraclass correlation coefficients were calculated to assess intrarater and interrater reliability. Results: Interrater intraclass correlation coefficient ranged from .54 (95% confidence interval [CI], −.16 to .81) to .57 (95% CI, −.24 to .85). Results for the intrarater reliability ranged from .91 (95% CI, .81 to .96) to .94 (95% CI, .87 to .97). Conclusion: Results of this study suggest that shoulder flexion range of motion in young competitive swimmers can be measured reliably by a single rater within the same session.


2006 ◽  
Vol 96 (5) ◽  
pp. 418-422 ◽  
Author(s):  
Angela M. Evans ◽  
Sheila D. Scutter

Measurement of ankle dorsiflexion is a routine part of the podiatric examination of children, yet the reliability of this measure is largely unknown in healthy individuals. This study assessed the intrarater and interrater reliability of the first and second resistance levels of sagittal ankle range of motion in 4- to 6-year-old children. The results show that measures of ankle dorsiflexion in children are highly variable among examiners, and, in general, gastrocnemius range of motion is more reliable than soleal range of motion. (J Am Podiatr Med Assoc 96(5): 418–422, 2006)


1991 ◽  
Vol 34 (5) ◽  
pp. 989-999 ◽  
Author(s):  
Stephanie Shaw ◽  
Truman E. Coggins

This study examines whether observers reliably categorize selected speech production behaviors in hearing-impaired children. A group of experienced speech-language pathologists was trained to score the elicited imitations of 5 profoundly and 5 severely hearing-impaired subjects using the Phonetic Level Evaluation (Ling, 1976). Interrater reliability was calculated using intraclass correlation coefficients. Overall, the magnitude of the coefficients was found to be considerably below what would be accepted in published behavioral research. Failure to obtain acceptably high levels of reliability suggests that the Phonetic Level Evaluation may not yet be an accurate and objective speech assessment measure for hearing-impaired children.


2014 ◽  
Vol 138 (6) ◽  
pp. 809-813
Author(s):  
Carolyn R. Vitek ◽  
Jane C. Dale ◽  
Henry A. Homburger ◽  
Sandra C. Bryant ◽  
Amy K. Saenger ◽  
...  

Context.— Systems-based practice (SBP) is 1 of 6 core competencies required in all resident training programs accredited by the Accreditation Council for Graduate Medical Education. Reliable methods of assessing resident competency in SBP have not been described in the medical literature. Objective.— To develop and validate an analytic grading rubric to assess pathology residents' analyses of SBP problems in clinical chemistry. Design.— Residents were assigned an SBP project based upon unmet clinical needs in the clinical chemistry laboratories. Using an iterative method, we created an analytic grading rubric based on critical thinking principles. Four faculty raters used the SBP project evaluation rubric to independently grade 11 residents' projects during their clinical chemistry rotations. Interrater reliability and Cronbach α were calculated to determine the reliability and validity of the rubric. Project mean scores and range were also assessed to determine whether the rubric differentiated resident critical thinking skills related to the SBP projects. Results.— Overall project scores ranged from 6.56 to 16.50 out of a possible 20 points. Cronbach α ranged from 0.91 to 0.96, indicating that the 4 rubric categories were internally consistent without significant overlap. Intraclass correlation coefficients ranged from 0.63 to 0.81, indicating moderate to strong interrater reliability. Conclusions.— We report development and statistical analysis of a novel SBP project evaluation rubric. The results indicate the rubric can be used to reliably assess pathology residents' critical thinking skills in SBP.


2018 ◽  
Vol 25 (3) ◽  
pp. 286-290 ◽  
Author(s):  
Elif Bilgic ◽  
Madoka Takao ◽  
Pepa Kaneva ◽  
Satoshi Endo ◽  
Toshitatsu Takao ◽  
...  

Background. Needs assessment identified a gap regarding laparoscopic suturing skills targeted in simulation. This study collected validity evidence for an advanced laparoscopic suturing task using an Endo StitchTM device. Methods. Experienced (ES) and novice surgeons (NS) performed continuous suturing after watching an instructional video. Scores were based on time and accuracy, and Global Operative Assessment of Laparoscopic Surgery. Data are shown as medians [25th-75th percentiles] (ES vs NS). Interrater reliability was calculated using intraclass correlation coefficients (confidence interval). Results. Seventeen participants were enrolled. Experienced surgeons had significantly greater task (980 [964-999] vs 666 [391-711], P = .0035) and Global Operative Assessment of Laparoscopic Surgery scores (25 [24-25] vs 14 [12-17], P = .0029). Interrater reliability for time and accuracy were 1.0 and 0.9 (0.74-0.96), respectively. All experienced surgeons agreed that the task was relevant to practice. Conclusion. This study provides validity evidence for the task as a measure of laparoscopic suturing skill using an automated suturing device. It could help trainees acquire the skills they need to better prepare for clinical learning.


Sign in / Sign up

Export Citation Format

Share Document