scholarly journals Assessment of Operator Reliability in Ultrasound of the Median and Ulnar Nerve Using Bland-Altman Analysis

Diagnostics ◽  
2021 ◽  
Vol 11 (11) ◽  
pp. 2134
Author(s):  
Jörg Philipps ◽  
Hannah Mork ◽  
Maria Katz ◽  
Mark Knaup ◽  
Kira Beyer ◽  
...  

Currently, there is no standardized method to evaluate operator reliability in nerve ultrasound. A short prospective protocol using Bland–Altman analysis was developed to assess the level of agreement between operators with different expertise levels. A control rater without experience in nerve ultrasound, three novices after two months of training, an experienced rater with two years of experience, and a reference rater performed blinded ultrasound examinations of the left median and ulnar nerve in 42 nerve sites in healthy volunteers. The precision of Bland–Altman agreement analysis was tested using the Preiss–Fisher procedure. Intraclass correlation coefficients (ICC), coefficients of variation, and Bland–Altman limits of agreement were calculated. The sample size calculation and Preiss–Fisher procedure showed a sufficient precision of Bland–Altman agreement analysis. Limits of agreement of all trained novices ranged from 2.0 to 2.9 mm2 and were within the test’s maximum tolerated difference. Ninety-five percent confidence intervals of limits of agreement revealed a higher precision in the experienced rater’s measurements. Operator reliability in nerve ultrasound of the median and ulnar nerve arm nerves can be evaluated with a short prospective controlled protocol using Bland–Altman statistics, allowing a clear distinction between an untrained rater, trained novices after two months of training, and an experienced rater.

2006 ◽  
Vol 86 (1) ◽  
pp. 66-76 ◽  
Author(s):  
Geert Verheyden ◽  
Godelieve Nuyens ◽  
Alice Nieuwboer ◽  
Pol Van Asch ◽  
Piet Ketelaer ◽  
...  

Abstract Background and Purpose. Standardized scales are a prerequisite for rehabilitation and research. This study was designed to determine the reliability and validity of scores on items of the trunk assessment of the Melsbroek Disability Scoring Test (MDST) and Trunk Impairment Scale (TIS) in people with multiple sclerosis (MS). Subjects. Thirty people with MS participated in the study. Methods. Interrater and test-retest reliability and construct validity were assessed. Results. Kappa and weighted kappa values for the items of the trunk assessment of the MDST ranged from .74 to .95, and the kappa and weighted kappa values for the TIS items ranged from .46 to 1.00. Intraclass correlation coefficients for interrater and test-retest agreement were .93 and .92, respectively, for the trunk assessment of the MDST and .97 and .95, respectively, for the TIS. Bland-Altman analysis showed consistency of scores without observer bias. Construct validity was established. Discussion and Conclusion. The MDST and TIS provide reliable assessments of the trunk and are valid scales for measuring trunk performance in people with MS. [Verheyden G, Nuyens G, Nieuwboer A, et al. Reliability and validity of trunk assessment for people with multiple sclerosis.


Author(s):  
Munenori Murata ◽  
Hiroo Takahashi

TrackMan, a device used to measure the kinematics of a ball in flight, has recently been used for research and coaching purposes in tennis, and the number of studies on this device is expected to increase. This study is focused on verifying the accuracy of TrackMan; three participants performed services with various velocities and angular velocity combinations. TrackMan and a proven method based on computer vision were used to measure the speed, spin rate, and spin axis of the ball. The results obtained using each method were compared using regression analysis, intraclass correlation coefficients, and Bland–Altman analysis. The speed and spin rate of the two methods agreed well; the degree of error would not be problematic in practical applications. However, TrackMan underestimated the speed if it was unable to detect the hitting position. The spin axis measured by TrackMan was unstable under low spin rate conditions (<3000 rpm). Furthermore, the standard deviation of the spin axis measurement error exceeded the difference between the average slice and kick service spin axes reported in the previous study, so the spin axis measurement accuracy was very low for research purposes, even when the spin rate was above 3000 rpm. For those who are aware of the existence of outliers, TrackMan can be useful for coaching because it provides instant feedback. Similarly, the measured rotational speed and spin rate could be used in the study.


2019 ◽  
Vol 90 (12) ◽  
pp. 1000-1008
Author(s):  
Caleb D. Johnson ◽  
Alice D. LaGoy ◽  
Gert-Jan Pepping ◽  
Shawn R. Eagle ◽  
Anne Z. Beethe ◽  
...  

INTRODUCTION: Designed as a more ecological measure of reaction times, the Perception-Action Coupling Task (PACT) has shown good reliability and within-subject stability. However, a lengthy testing period was required. Perceptual-motor judgments are known to be affected by proximity of the stimulus to the action boundary. The current study sought to determine the effects of action boundary proximity on PACT performance, and whether redundant levels of stimuli, eliciting similar responses, can be eliminated to shorten the PACT.METHODS: There were 9 men and 7 women who completed 4 testing sessions, consisting of 3 familiarization cycles and 6 testing cycles of the PACT. For the PACT, subjects made judgments on whether a series of balls presented on a tablet afford “posting” (can fit) through a series of apertures. There were 8 ratios of ball to aperture size (B-AR) presented, ranging from 0.2 to 1.8, with each ratio appearing 12 times (12 trials) per cycle. Reaction times and judgment accuracy were calculated, and averaged across all B-ARs. Ratios and individual trials within each B-AR were systematically eliminated. Variables were re-averaged, and intraclass correlation coefficients (ICC) and coefficients of variation (CVTE) were calculated in an iterative manner.RESULTS: With elimination of the 0.2 and 1.8 B-ARs, the PACT showed good reliability (ICC = 0.81–0.99) and consistent within-subject stability (CVTE = 2.2–14.7%). Reliability (ICC = 0.81–0.97) and stability (CVTE = 2.6–15.6%) were unaffected with elimination of up to 8 trials from each B-AR.DISCUSSION: The shortened PACT resulted in an almost 50% reduction in total familiarization/testing time required, significantly increasing usability.Johnson CD, LaGoy AD, Pepping G-J, Eagle SR, Beethe AZ, Bower JL, Alfano CA, Simpson RJ, Connaboy C. Action boundary proximity effects on perceptual-motor judgments. Aerosp Med Hum Perform. 2019; 90(12):1000–1008.


1996 ◽  
Vol 80 (1) ◽  
pp. 133-139 ◽  
Author(s):  
M. D. White ◽  
G. Bouchard ◽  
B. Buemann ◽  
N. Almeras ◽  
J. P. Despres ◽  
...  

The maintenance of weight and adipose tissue mass in humans appears to be related to a balance between the rates of oxidation and consumption of macronutrients; yet, little evidence is available on the reproducibility of 24-h macronutrient oxidation rates and how this relates to diet in the days preceding a chamber session. This study examined the reproducibility of 24-h macronutrient oxidation rates, 24-h energy expenditure (EE), and respiratory quotient (RQ) in 30 adults who ate their habitual diets before two 24-h whole body indirect calorimeter sessions. Results showed that the within-subject coefficients of variation (CVws) for 24-h EE and RQ were 2.8 and 2.6%, respectively. CVws for macronutrient oxidations ranged from approximately 15 to 25%. Means comparisons of 24-h EE, RQ, and macronutrient oxidation rates between sessions showed no significant differences, and all variables had significant positive intraclass correlation coefficients (P < 0.05). In conclusion, macronutrient oxidations all showed significant reproducibility for the group and a significant but lower reproducibility for individuals when habitual diet and activity preceded the experimental sessions.


2016 ◽  
Vol 2016 ◽  
pp. 1-6 ◽  
Author(s):  
Kofi Asiedu ◽  
Samuel Kyei ◽  
Emmanuel Ekow Ampiah

The aim of the study was to determine the level of agreement between Javal’s rule, autorefraction, retinoscopy, and refractive astigmatism and to determine which technique is the most suitable substitute when subjective refraction is not applicable using a clinical sample. A total of 36 subjects, 14 males and 22 females, were involved in this study. The intraclass correlation coefficients between subjective refraction, autorefraction, and retinoscopy were 0.895 and 0.989, respectively, for the spherical equivalent. The Bland-Altman 95% limits of agreement between subjective refraction and autorefraction; subjective refraction and retinoscopy; and autorefraction and retinoscopy were −2.84 to 3.58, −0.88 to 1.12, and −3.01 to 3.53, respectively, for the spherical equivalent. The intraclass correlation coefficients between spectacle total astigmatism and the following techniques were as follows: retinoscopy (0.85); autorefraction (0.92); Javal’s rule (0.82); and Grosvenor et al. version (0.85). The Bland-Altman 95% limits of agreement between subjective refraction and autorefraction; subjective refraction and retinoscopy; subjective refraction and Javal’s rule; and subjective refraction and Grosvenor et al. version were −0.87 to 1.25, −1.49 to 1.99, −0.73 to 1.93, and −0.89 to 1.7, respectively, for the total astigmatism. The study showed that autorefraction and Javal’s rule may provide a starting point for subjective refraction cylinder power determination but only retinoscopy may satisfactorily replace subjective refraction total astigmatism when subjective refraction is not applicable.


2007 ◽  
Vol 21 (4) ◽  
pp. 474-477 ◽  
Author(s):  
Roberto Castano ◽  
Gilles Thériault ◽  
Denyse Gautrin ◽  
Heberto Ghezzo ◽  
Carole Trudeau ◽  
...  

Background To diagnose occupational rhinitis, it is mandatory to conduct an objective assessment of changes in nasal patency during specific inhalation challenge (SIC). The reproducibility of acoustic rhinometry measurements in the setting of occupational challenges has never been examined. This study assessed the reproducibility of acoustic rhinometry during SIC investigation of occupational rhinitis. Methods Twenty-four subjects underwent acoustic rhinometry measurements during SIC investigation of occupational rhinitis. Subjects attended 3–6 days of SIC within a week by means of a realistic or closed-circuit apparatus methodology Results All of the within-day intraclass correlation coefficients (ICCs) for nasal volume (2–5 cm) and minimum cross-sectional area (MCA) based on a different number of measurements (2–7) were above 0.85; all of the coefficients of variation (CVs) for the same parameters were low (below 10%). The between-day CVs based on different numbers of SIC sessions ranged from 8.0 to 8.8% and from 6.8 to 8.8% for nasal volume and MCA, respectively. The between-day ICCs ranged from 0.80 to 0.88 and from 0.83 to 0.94 for nasal volume and MCA, respectively. Conclusion Acoustic rhinometry showed good within- and between-day reproducibility and can be recommended for the objective monitoring of nasal patency during SIC investigating occupational rhinitis.


2016 ◽  
Vol 17 (4) ◽  
pp. 294-299 ◽  
Author(s):  
Mohammad Y Hajeer ◽  
Ahmad L Maroua ◽  
Mowaffak Ajaj

ABSTRACT Objective To evaluate the accuracy and reproducibility of linear measurements made on cone-beam computed tomography (CBCT)-derived digital models. Materials and methods A total of 25 patients (44% female, 18.7 ± 4 years) who had CBCT images for diagnostic purposes were included. Plaster models were obtained and digital models were extracted from CBCT scans. Seven linear measurements from predetermined landmarks were measured and analyzed on plaster models and the corresponding digital models. The measurements included arch length and width at different sites. Paired t test and Bland–Altman analysis were used to evaluate the accuracy of measurements on digital models compared to the plaster models. Also, intraclass correlation coefficients (ICCs) were used to evaluate the reproducibility of the measurements in order to assess the intraobserver reliability. Results The statistical analysis showed significant differences on 5 out of 14 variables, and the mean differences ranged from −0.48 to 0.51 mm. The Bland–Altman analysis revealed that the mean difference between variables was (0.14 ± 0.56) and (0.05 ± 0.96) mm and limits of agreement between the two methods ranged from −1.2 to 0.96 and from −1.8 to 1.9 mm in the maxilla and the mandible, respectively. The intraobserver reliability values were determined for all 14 variables of two types of models separately. The mean ICC value for the plaster models was 0.984 (0.924–0.999), while it was 0.946 for the CBCT models (range from 0.850 to 0.985). Conclusion Linear measurements obtained from the CBCTderived models appeared to have a high level of accuracy and reproducibility. How to cite this article Maroua AL, Ajaj M, Hajeer MY. The Accuracy and Reproducibility of Linear Measurements Made on CBCT-derived Digital Models. J Contemp Dent Pract 2016;17(4):294-299.


2003 ◽  
Vol 93 (5) ◽  
pp. 380-387 ◽  
Author(s):  
Hylton B. Menz ◽  
Anne Tiedemann ◽  
Marcella Mun-San Kwan ◽  
Mark Dominic Latt ◽  
Catherine Sherrington ◽  
...  

Lower-extremity problems are common in older people; however, the reliability of clinical tools used to assess foot and ankle characteristics has not been rigorously evaluated. This study evaluated the test–retest reliability of a battery of simple clinical tests of foot and ankle characteristics (tactile sensitivity of the first metatarsophalangeal joint, navicular height, foot length and width, hallux valgus severity, an overall foot problem score, ankle flexibility, ankle dorsiflexion strength, and foot pain) in 31 individuals (13 men and 18 women) aged 76 to 87 years recruited from the community. Three examiners performed the tests on two occasions approximately 2 weeks apart. Intraclass correlation coefficients and coefficients of variation were calculated for continuously scored tests, and the kappa statistic (κ) was used to determine the reliability of hallux valgus severity grading. All of the continuously scored tests had acceptable reliability (intraclass correlation coefficients of 0.64 to 0.98; coefficients of variation of 0.6% to 15.0%), as did hallux valgus severity grading (κ = 0.77; absolute percentage agreement, 84%). These simple clinical tests can now be used with confidence in clinical and research settings to provide reliable and functionally important information regarding foot and ankle characteristics in older people. (J Am Podiatr Med Assoc 93(5): 380-387, 2003)


Author(s):  
Megan Sax van der Weyden ◽  
Christopher D. Black ◽  
Daniel Larson ◽  
Brian Rollberg ◽  
Jason A. Campbell

This investigation examined relationships between a Special Weapons and Tactics-specific fitness test (SORT) and an obstacle course (OC) used for qualification in fourteen male SWAT members from three local, regional police departments. The SORT included: squat, pushup, and lunge in 60 s; pullup hold; sled drag; and Yo-Yo Intermittent Recovery Test L1. The obstacle course included: 25 m sprint (repeated); window ascent; scale under a wall; 25 m serpentine run (repeated), body drag (20 m, repeated). Pearson coefficients examined SORT and OC relationships (p ≤ 0.05); intraclass correlation coefficients (ICC2,1) assessed agreement of SORT trials. Repeated measures ANOVA evaluated differences in SORT metrics across time. Coefficients of variation (COV) examined SORT scoring consistency. The YoYo test was related to all SORT assessments (r = −0.803–0.894), except sled drag. The remaining SORT metrics were related to ≥two tests. SORT COVs ranged from 0.77–13.26% for trials 1–2 but decreased between trials 2–3 (0.95–8.97%). The OC was associated with YoYo, lunges, squats and sled drag (r = −0.790, −0.730, −0.766, and 0.802, respectively). No differences (p > 0.05) existed across SORT trials for event scores. The SORT battery appears to be a valid and reliable testing measure to assess SWAT occupational specific fitness.


2020 ◽  
Vol 33 (6) ◽  
pp. 783-791
Author(s):  
Yusuke Yokota ◽  
Tomohisa Okada ◽  
Yasutaka Fushimi ◽  
Akira Yamamoto ◽  
Satoshi Nakajima ◽  
...  

Abstract Objective To compare the absolute values and repeatability of magnetic resonance fingerprinting (MRF) with 3000 and 1500 echoes/slice acquired in 41 s and 20 s (MRF3k and MRF1.5k, respectively). Materials and methods MRF3k and MRF1.5k scans based on fast imaging with steady precession (FISP) were conducted using a 3 T scanner. Inter-scan agreement and intra-scan repeatability were investigated in 41 and 28 subjects, respectively. Region-of-interest (ROI) analysis was conducted on T1 values of MRF3k by two raters, and their agreement was evaluated using intraclass correlation coefficients (ICCs). Between MRF3k and MRF1.5k, differences in T1 and T2 values and inter-measurement correlation coefficients (CCs) were investigated. Intra-measurement repeatability was evaluated using coefficients of variation (CVs). A p value < 0.05 was considered statistically significant. Results The ICCs of ROI measurements were 0.77–0.96. Differences were observed between the two MRF scans, but the CCs of the overall ROIs were 0.99 and 0.97 for the T1 and T2 values, respectively. The mean and median CVs of repeatability were equal to or less than 1.58% and 3.13% in each of the ROIs for T1 and T2, respectively; there were some significant differences between MRF3k and MRF1.5k, but they were small, measuring less than 1%. Discussion Both MRF3k and MRF1.5k had high repeatability, and a strong to very strong correlation was observed, with a trend toward slightly higher values in MRF1.5k.


Sign in / Sign up

Export Citation Format

Share Document