The Reliability of a Smartphone Goniometer Application Compared With a Traditional Goniometer for Measuring Ankle Joint Range of Motion

Background: Evaluation of range of motion (ROM) is integral to assessment of the musculoskeletal system, is required in health fitness and pathologic conditions, and is used as an objective outcome measure. Several methods are described to check ROM, each with advantages and disadvantages. Hence, this study introduces a new device using a smartphone goniometer to measure ankle joint ROM. Objective: To test the reliability of smartphone goniometry in the ankle joint by comparing it with the universal goniometer (UG) and to assess interrater and intrarater reliability for the smartphone goniometer record (SGR) application. Methods: Fifty-eight healthy volunteers (29 men and 29 women aged 18–30 years) underwent SGR and UG measurement of ankle joint dorsiflexion and plantarflexion. Two examiners measured ankle joint ROM. Descriptive statistics were calculated for descriptive and anthropometric variables, as were intraclass correlation coefficients (ICCs). Results: There were 58 usable data sets. For measuring ankle dorsiflexion ROM, both instruments showed excellent interrater reliability: UG (ICC = 0.87) and SGR (ICC = 0.89). Intrarater reliability was excellent in both instruments in ankle dorsiflexion: UG and SGR (mean ICC = 0.91). For measuring ankle plantarflexion, both instruments showed excellent interrater reliability: UG (ICC = 0.76) and SGR (ICC = 0.82). Intrarater reliability was excellent in both instruments in ankle plantarflexion: UG (mean ICC = 0.85) and SGR (mean ICC = 0.82). Conclusions: Smartphone-based goniometers can be used to assess active ROM of the ankle joint because they can achieve a high degree of intrarater and interrater reliability.

Download Full-text

Assessment of the Intrarater and Interrater Reliability of an Established Clinical Task Analysis Methodology

Anesthesiology ◽

10.1097/00000542-200205000-00016 ◽

2002 ◽

Vol 96 (5) ◽

pp. 1129-1139 ◽

Cited By ~ 46

Author(s):

Jason Slagle ◽

Matthew B. Weinger ◽

My-Than T. Dinh ◽

Vanessa V. Brumer ◽

Kevin Williams

Keyword(s):

Real Time ◽

Task Analysis ◽

Interrater Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Intrarater Reliability ◽

Intraclass Correlation Coefficients ◽

Percent Time ◽

Analysis Methodology ◽

And Task

Background Task analysis may be useful for assessing how anesthesiologists alter their behavior in response to different clinical situations. In this study, the authors examined the intraobserver and interobserver reliability of an established task analysis methodology. Methods During 20 routine anesthetic procedures, a trained observer sat in the operating room and categorized in real-time the anesthetist's activities into 38 task categories. Two weeks later, the same observer performed task analysis from videotapes obtained intraoperatively. A different observer performed task analysis from the videotapes on two separate occasions. Data were analyzed for percent of time spent on each task category, average task duration, and number of task occurrences. Rater reliability and agreement were assessed using intraclass correlation coefficients. Results Intrarater reliability was generally good for categorization of percent time on task and task occurrence (mean intraclass correlation coefficients of 0.84-0.97). There was a comparably high concordance between real-time and video analyses. Interrater reliability was generally good for percent time and task occurrence measurements. However, the interrater reliability of the task duration metric was unsatisfactory, primarily because of the technique used to capture multitasking. Conclusions A task analysis technique used in anesthesia research for several decades showed good intrarater reliability. Off-line analysis of videotapes is a viable alternative to real-time data collection. Acceptable interrater reliability requires the use of strict task definitions, sophisticated software, and rigorous observer training. New techniques must be developed to more accurately capture multitasking. Substantial effort is required to conduct task analyses that will have sufficient reliability for purposes of research or clinical evaluation.

Download Full-text

Objective and Subjective Clinical Swallowing Outcomes via Telehealth: Reliability in Outpatient Clinical Practice

American Journal of Speech-Language Pathology ◽

10.1044/2020_ajslp-20-00234 ◽

2021 ◽

pp. 1-11

Author(s):

James C. Borders ◽

Jordanna S. Sevitz ◽

Jaime Bauer Malandraki ◽

Georgia A. Malandraki ◽

Michelle S. Troche

Keyword(s):

Clinical Practice ◽

Interrater Reliability ◽

Video Quality ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Oral Intake ◽

Caregiver Training ◽

Intrarater Reliability ◽

Intraclass Correlation Coefficients ◽

Remote Patient

Purpose The COVID-19 pandemic has drastically increased the use of telehealth. Prior studies of telehealth clinical swallowing evaluations provide positive evidence for telemanagement of swallowing. However, the reliability of these measures in clinical practice, as opposed to well-controlled research conditions, remains unknown. This study aimed to investigate the reliability of outcome measures derived from clinical swallowing tele-evaluations in real-world clinical practice (e.g., variability in devices and Internet connectivity, lack of in-person clinician assistance, or remote patient/caregiver training). Method Seven raters asynchronously judged clinical swallowing tele-evaluations of 12 movement disorders patients. Outcomes included the Timed Water Swallow Test (TWST), Test of Masticating and Swallowing Solids (TOMASS), and common observations of oral intake. Statistical analyses were performed to examine inter- and intrarater reliability, as well as qualitative analyses exploring patient and clinician-specific factors impacting reliability. Results Forty-four trials were included for reliability analyses. All rater dyads demonstrated “good” to “excellent” interrater reliability for measures of the TWST (intraclass correlation coefficients [ICCs] ≥ .93) and observations of oral intake (≥ 77% agreement). The majority of TOMASS outcomes demonstrated “good” to “excellent” interrater reliability (ICCs ≥ .84), with the exception of the number of bites (ICCs = .43–.99) and swallows (ICCs = .21–.85). Immediate and delayed intrarater reliability were “excellent” for most raters across all tasks, ranging between ICCs of .63 and 1.00. Exploratory factors potentially impacting reliability included infrequent instances of suboptimal video quality, reduced camera stability, camera distance, and obstruction of the patient's mouth during tasks. Conclusions Subjective observations of oral intake and objective measures taken from the TWST and the TOMASS can be reliably measured via telehealth in clinical practice. Our results provide support for the feasibility and reliability of telehealth for outpatient clinical swallowing evaluations during COVID-19 and beyond. Supplemental Material https://doi.org/10.23641/asha.13661378

Download Full-text

Functional Index-3: A Valid and Reliable Functional Outcome Assessment Measure in Patients With Dermatomyositis and Polymyositis

The Journal of Rheumatology ◽

10.3899/jrheum.191374 ◽

2020 ◽

Vol 48 (1) ◽

pp. 94-100 ◽

Cited By ~ 2

Author(s):

Floranne C. Ernste ◽

Christopher Chong ◽

Cynthia S. Crowson ◽

Tanaz A. Kermani ◽

Orla Ni Mhuircheartaigh ◽

...

Keyword(s):

Construct Validity ◽

Interrater Reliability ◽

Intraclass Correlation ◽

Health Assessment ◽

Correlation Coefficients ◽

Measurement Properties ◽

Muscle Endurance ◽

Spearman Correlation ◽

Intrarater Reliability ◽

Functional Index

Objective.Patients with dermatomyositis (DM) and polymyositis (PM) have reduced muscle endurance.The aim of this study was to streamline the Functional Index-2 (FI-2) by developing the Functional Index-3 (FI-3) and to evaluate its measurement properties, content and construct validity, and intra- and interrater reliability.Methods.A dataset of the previously performed and validated FI-2 (n = 63) was analyzed for internal redundancy, floor, and ceiling effects. The content of the FI-2 was revised into the FI-3. Construct validity and intrarater reliability of FI-3 were tested on 43 DM and PM patients at 2 rheumatology centers. Interrater reliability was tested in 25 patients. The construct validity was compared with the Myositis Activities Profile (MAP), Health Assessment Questionnaire (HAQ), and Borg CR-10 using Spearman correlation coefficient.Results.Spearman correlation coefficients of 63 patients performing FI-3 revealed moderate to high correlations between shoulder flexion and hip flexion tasks and similar correlations with MAP and HAQ scores; there were lower correlations for neck flexion task. All FI-3 tasks had very low to moderate correlations with the Borg scale. Intraclass correlation coefficients (ICC) of FI-3 tasks for intrarater reliability (n = 25) were moderate to good (0.88–0.98). ICC of FI-3 tasks for interrater reliability (n = 17) were fair to good (range 0.83–0.96).Conclusion.The FI-3 is an efficient and valid method for clinically assessing muscle endurance in DM and PM patients. FI-3 construct validity is supported by the significant correlations between functional tasks and the MAP, HAQ, and Borg CR-10 scores.

Download Full-text

Interrater and Intrarater Reliability of the Tuck Jump Assessment by Health Professionals of Varied Educational Backgrounds

Journal of Sports Medicine ◽

10.1155/2013/483503 ◽

2013 ◽

Vol 2013 ◽

pp. 1-5 ◽

Cited By ~ 9

Author(s):

Lisa A. Dudley ◽

Craig A. Smith ◽

Brandon K. Olson ◽

Nicole J. Chimera ◽

Brian Schmitz ◽

...

Keyword(s):

Health Professionals ◽

Interrater Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Clinical Implementation ◽

Intrarater Reliability ◽

Study Objective ◽

Intraclass Correlation Coefficients ◽

Educational Backgrounds ◽

And Training

Objective. The Tuck Jump Assessment (TJA), a clinical plyometric assessment, identifies 10 jumping and landing technique flaws. The study objective was to investigate TJA interrater and intrarater reliability with raters of different educational and clinical backgrounds.Methods. 40 participants were video recorded performing the TJA using published protocol and instructions. Five raters of varied educational and clinical backgrounds scored the TJA. Each score of the 10 technique flaws was summed for the total TJA score. Approximately one month later, 3 raters scored the videos again. Intraclass correlation coefficients determined interrater (5 and 3 raters for first and second session, resp.) and intrarater (3 raters) reliability.Results. Interrater reliability with 5 raters was poor (ICC = 0.47; 95% confidence intervals (CI) 0.33–0.62). Interrater reliability between 3 raters who completed 2 scoring sessions improved from 0.52 (95% CI 0.35–0.68) for session one to 0.69 (95% CI 0.55–0.81) for session two. Intrarater reliability was poor to moderate, ranging from 0.44 (95% CI 0.22–0.68) to 0.72 (95% CI 0.55–0.84).Conclusion. Published protocol and training of raters were insufficient to allow consistent TJA scoring. There may be a learned effect with the TJA since interrater reliability improved with repetition. TJA instructions and training should be modified and enhanced before clinical implementation.

Download Full-text

Reliability of Measurement of Maximal Isometric Lateral Trunk-Flexion Strength in Athletes Using Handheld Dynamometry

Journal of Sport Rehabilitation ◽

10.1123/jsr.2012.tr6 ◽

2012 ◽

Vol 21 (4) ◽

Cited By ~ 2

Author(s):

Bram L. Newman ◽

Courtney L. Pollock ◽

Michael A. Hunt

Keyword(s):

Interrater Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Trunk Flexion ◽

Intrarater Reliability ◽

Force Output ◽

Test Occasion ◽

Maximum Effort ◽

And Function ◽

Flexion Strength

Context: Lateral trunk-flexion strength is an important determinant of overall trunk stability and function, but the reliability in measuring this outcome clinically in athletic individuals is not known. Objective: To determine the interrater and intrarater reliability of lateral trunk-flexion strength measurement in athletic individuals using handheld dynamometry. Design: Reliability study. Setting: Research laboratory. Participants: 12 healthy, athletic individuals. Intervention: Lateral trunk-flexion strength was measured using handheld dynamometry across 2 different trunk placements (lateral aspect of the axilla and laterally at the level of the midtrunk) and 2 testing occasions by 2 therapists. Three maximum-effort trials during a "make test" at each placement were completed for each therapist on both occasions. Main Outcome Measures: Maximum force output was identified and converted to a torque. Intraclass correlation coefficients (ICC2,1) were calculated for each dynamometer placement, therapist, and test occasion to determine intrarater and interrater reliability. Results: Intrarater reliability was moderate to good (ICC2,1 = .53-.77), while interrater reliability was good to very good (ICC2,1 = .79-81) at the axilla position. For the midtrunk position, intrarater reliability was good to very good (ICC2,1 = .80-.86), while interrater reliability was good on both days (ICC2,1 = .87-.88). Finally, the standard errors of measurement were low for the axilla position (0.20 Nm/kg; 95% CI .15, .28) and midtrunk position (0.09 Nm/kg; 95% CI .07, .12). Conclusions: Maximum lateral trunk-flexion strength can be reliably measured in athletic individuals with greater overall strength. Based on the 2 positions used in this study, measurement with a dynamometer placement at the midtrunk may be more reliable than that obtained at the axilla.

Download Full-text

Intrarater and Interrater Reliability of a Passive Shoulder Flexion Range of Motion Measurement for Latissimus Dorsi Flexibility in Young Competitive Swimmers

Journal of Sport Rehabilitation ◽

10.1123/jsr.2019-0294 ◽

2020 ◽

Vol 29 (6) ◽

pp. 855-858 ◽

Cited By ~ 1

Author(s):

Stef Feijen ◽

Angela Tate ◽

Kevin Kuppens ◽

Thomas Struyf ◽

Anke Claes ◽

...

Keyword(s):

Range Of Motion ◽

Latissimus Dorsi ◽

Interrater Reliability ◽

Intraclass Correlation ◽

Rest Period ◽

Correlation Coefficients ◽

Competitive Swimming ◽

Motion Measurement ◽

Shoulder Flexion ◽

Competitive Swimmers

Context: The latissimus dorsi plays a major role in generating the propulsive force during swimming. In addition, stiffness of this muscle may result in altered stroke biomechanics and predispose swimmers to shoulder pain. Measuring the flexibility of the latissimus dorsi can be of interest to reduce injury. However, the reliability of such measurement has not yet been investigated in competitive swimmers. Objective: To assess the within-session intrarater and interrater reliability of a passive shoulder flexion range of motion measurement for latissimus dorsi flexibility in competitive swimmers. Design: Within-session intrarater and interrater reliability. Setting: Competitive swimming clubs in Flanders, Belgium. Participants: Twenty-six competitive swimmers (15.46 [2.98] y; 16 men and 10 women). Intervention: Each rater performed 2 alternating (eg, left-right-left-right) measurements of passive shoulder flexion range of motion twice, with a 30-second rest period in between. Main Outcome Measures: The intraclass correlation coefficients were calculated to assess intrarater and interrater reliability. Results: Interrater intraclass correlation coefficient ranged from .54 (95% confidence interval [CI], −.16 to .81) to .57 (95% CI, −.24 to .85). Results for the intrarater reliability ranged from .91 (95% CI, .81 to .96) to .94 (95% CI, .87 to .97). Conclusion: Results of this study suggest that shoulder flexion range of motion in young competitive swimmers can be measured reliably by a single rater within the same session.

Download Full-text

Sagittal Plane Range of Motion of the Pediatric Ankle Joint

Journal of the American Podiatric Medical Association ◽

10.7547/0960418 ◽

2006 ◽

Vol 96 (5) ◽

pp. 418-422 ◽

Cited By ~ 11

Author(s):

Angela M. Evans ◽

Sheila D. Scutter

Keyword(s):

Range Of Motion ◽

Ankle Joint ◽

Interrater Reliability ◽

Sagittal Plane ◽

Ankle Dorsiflexion ◽

Healthy Individuals

Measurement of ankle dorsiflexion is a routine part of the podiatric examination of children, yet the reliability of this measure is largely unknown in healthy individuals. This study assessed the intrarater and interrater reliability of the first and second resistance levels of sagittal ankle range of motion in 4- to 6-year-old children. The results show that measures of ankle dorsiflexion in children are highly variable among examiners, and, in general, gastrocnemius range of motion is more reliable than soleal range of motion. (J Am Podiatr Med Assoc 96(5): 418–422, 2006)

Download Full-text

Interobserver Reliability Using the Phonetic Level Evaluation With Severely and Profoundly Hearing-Impaired Children

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3405.989 ◽

1991 ◽

Vol 34 (5) ◽

pp. 989-999 ◽

Cited By ~ 6

Author(s):

Stephanie Shaw ◽

Truman E. Coggins

Keyword(s):

Interrater Reliability ◽

Interobserver Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Hearing Impaired ◽

Intraclass Correlation Coefficients ◽

Assessment Measure ◽

Impaired Children ◽

Speech Assessment ◽

Hearing Impaired Children

This study examines whether observers reliably categorize selected speech production behaviors in hearing-impaired children. A group of experienced speech-language pathologists was trained to score the elicited imitations of 5 profoundly and 5 severely hearing-impaired subjects using the Phonetic Level Evaluation (Ling, 1976). Interrater reliability was calculated using intraclass correlation coefficients. Overall, the magnitude of the coefficients was found to be considerably below what would be accepted in published behavioral research. Failure to obtain acceptably high levels of reliability suggests that the Phonetic Level Evaluation may not yet be an accurate and objective speech assessment measure for hearing-impaired children.

Download Full-text

Development and Initial Validation of a Project-Based Rubric to Assess the Systems-Based Practice Competency of Residents in the Clinical Chemistry Rotation of a Pathology Residency

Archives of Pathology & Laboratory Medicine ◽

10.5858/arpa.2013-0046-oa ◽

2014 ◽

Vol 138 (6) ◽

pp. 809-813

Author(s):

Carolyn R. Vitek ◽

Jane C. Dale ◽

Henry A. Homburger ◽

Sandra C. Bryant ◽

Amy K. Saenger ◽

...

Keyword(s):

Critical Thinking ◽

Interrater Reliability ◽

Clinical Chemistry ◽

Core Competencies ◽

Intraclass Correlation ◽

Reliability And Validity ◽

Correlation Coefficients ◽

Thinking Skills ◽

Project Evaluation ◽

Critical Thinking Skills

Context.— Systems-based practice (SBP) is 1 of 6 core competencies required in all resident training programs accredited by the Accreditation Council for Graduate Medical Education. Reliable methods of assessing resident competency in SBP have not been described in the medical literature. Objective.— To develop and validate an analytic grading rubric to assess pathology residents' analyses of SBP problems in clinical chemistry. Design.— Residents were assigned an SBP project based upon unmet clinical needs in the clinical chemistry laboratories. Using an iterative method, we created an analytic grading rubric based on critical thinking principles. Four faculty raters used the SBP project evaluation rubric to independently grade 11 residents' projects during their clinical chemistry rotations. Interrater reliability and Cronbach α were calculated to determine the reliability and validity of the rubric. Project mean scores and range were also assessed to determine whether the rubric differentiated resident critical thinking skills related to the SBP projects. Results.— Overall project scores ranged from 6.56 to 16.50 out of a possible 20 points. Cronbach α ranged from 0.91 to 0.96, indicating that the 4 rubric categories were internally consistent without significant overlap. Intraclass correlation coefficients ranged from 0.63 to 0.81, indicating moderate to strong interrater reliability. Conclusions.— We report development and statistical analysis of a novel SBP project evaluation rubric. The results indicate the rubric can be used to reliably assess pathology residents' critical thinking skills in SBP.

Download Full-text

Development of a Model for the Acquisition and Assessment of Advanced Laparoscopic Suturing Skills Using an Automated Device

Surgical Innovation ◽

10.1177/1553350618764221 ◽

2018 ◽

Vol 25 (3) ◽

pp. 286-290 ◽

Cited By ~ 2

Author(s):

Elif Bilgic ◽

Madoka Takao ◽

Pepa Kaneva ◽

Satoshi Endo ◽

Toshitatsu Takao ◽

...

Keyword(s):

Laparoscopic Surgery ◽

Interrater Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Instructional Video ◽

Validity Evidence ◽

Laparoscopic Suturing ◽

Intraclass Correlation Coefficients ◽

Operative Assessment ◽

Suturing Skills

Background. Needs assessment identified a gap regarding laparoscopic suturing skills targeted in simulation. This study collected validity evidence for an advanced laparoscopic suturing task using an Endo StitchTM device. Methods. Experienced (ES) and novice surgeons (NS) performed continuous suturing after watching an instructional video. Scores were based on time and accuracy, and Global Operative Assessment of Laparoscopic Surgery. Data are shown as medians [25th-75th percentiles] (ES vs NS). Interrater reliability was calculated using intraclass correlation coefficients (confidence interval). Results. Seventeen participants were enrolled. Experienced surgeons had significantly greater task (980 [964-999] vs 666 [391-711], P = .0035) and Global Operative Assessment of Laparoscopic Surgery scores (25 [24-25] vs 14 [12-17], P = .0029). Interrater reliability for time and accuracy were 1.0 and 0.9 (0.74-0.96), respectively. All experienced surgeons agreed that the task was relevant to practice. Conclusion. This study provides validity evidence for the task as a measure of laparoscopic suturing skill using an automated suturing device. It could help trainees acquire the skills they need to better prepare for clinical learning.

Download Full-text