Quantification of fluoroscopic fundoplication anatomy: inter- and intraobserver reliability

Author(s):  
Siang Wei Gan ◽  
Natalie Lee ◽  
Siao En Tan ◽  
Suzanne M Edwards ◽  
George K Kiroff ◽  
...  

SUMMARY The etiology of postfundoplication dysphagia remains incompletely understood. Subtle changes of gastroesophageal junction (GEJ) anatomy may be contributory. Barium swallows have potential for standardization to evaluate postsurgical anatomical features. Using structured barium swallows, we aim to identify reproducible, objectively measured postfundoplication anatomical features that will permit future comparison between patients with/without dysphagia. At 6–12 months of postfundoplication, 31 patients underwent structured barium swallow with video–fluoroscopy recording: standing anteroposterior; standing oblique (×2); prone oblique (×2); and prone oblique with continuous free drinking. A primary observer recorded 11 variables of GEJ anatomy for each view, repeated 3 months later, forming two datasets to assess intraobserver consistency. Interobserver reliability was determined using a dataset each from the primary observer and two medical students (after training). Intraclass correlation coefficients (ICC) were based on two-way mixed-effects model (ICC agreement: 0.40–0.59 ‘fair’; 0.60–0.74 ‘good’; 0.75–1.00 ‘excellent’). Interobserver reliability was good–excellent for 47 of 66 measurements. Measures of maximal esophageal diameter cf. wrap opening diameter and posterior esophageal angle showed high interobserver reproducibility on all views (ICC range 0.84–0.91; 0.68–0.80, respectively). Interobserver agreement was good–excellent for 5/6 views when measuring anterior GEJ displacement and axis deviation (ICC range 0.56–0.79; 0.41–0.77, respectively). Measures of wrap length showed lower reproducibility. Prone oblique measurements showed highest reproducibility (good–excellent agreement in 19/22 measurements). Intraobserver consistency was excellent for 98% of measurements (ICC range 0.74–0.99). Objective measurements of postfundoplication GEJ anatomy using structured barium swallow are reproducible and may allow further interrogation of anatomical features contributing to postfundoplication dysphagia.

1991 ◽  
Vol 34 (5) ◽  
pp. 989-999 ◽  
Author(s):  
Stephanie Shaw ◽  
Truman E. Coggins

This study examines whether observers reliably categorize selected speech production behaviors in hearing-impaired children. A group of experienced speech-language pathologists was trained to score the elicited imitations of 5 profoundly and 5 severely hearing-impaired subjects using the Phonetic Level Evaluation (Ling, 1976). Interrater reliability was calculated using intraclass correlation coefficients. Overall, the magnitude of the coefficients was found to be considerably below what would be accepted in published behavioral research. Failure to obtain acceptably high levels of reliability suggests that the Phonetic Level Evaluation may not yet be an accurate and objective speech assessment measure for hearing-impaired children.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Minjeong Kim ◽  
Ja Young Oh ◽  
Seon Ha Bae ◽  
Seung Hyeun Lee ◽  
Won Jun Lee ◽  
...  

AbstractWe evaluated the reliability and validity of the 5-scale grading system to interpret the point-of-care immunoassay for tear matrix metalloproteinase (MMP)-9. Six observers graded red bands of photographs of the readout window in MMP-9 immunoassay kit (InflammaDry) two times with 2-week interval based on the 5-scale grading system (i.e. grade 0–4). Interobserver and intraobserver reliability were evaluated using intraclass correlation coefficients. The interobserver agreements were analyzed according to the severity of tear MMP-9 expression. To validate the system, a concentration calibration curve was made using MMP-9 solutions with reference concentrations, then the distribution of MMP-9 concentrations was analyzed according to the 5-scale grading system. Both intraobserver and interobserver reliability was excellent. The readout grades were significantly correlated with the quantified colorimetric densities. The interobserver variance of readout grades had no correlation with the severity of the measured densities. The band density continued to increase up to a maximal concentration (i.e. 5000 ng/mL) according to the calibration curve. The difference of grades reflected the change of MMP-9 concentrations sensitively, especially between grade 2 and 4. Together, our data indicate that the subjective 5-scale grading system in the point-of-care MMP-9 immunoassay is an easy and reliable method with acceptable accuracy.


Author(s):  
Ian S. MacLean ◽  
Taylor M. Southworth ◽  
Ian J. Dempsey ◽  
Neal B. Naveen ◽  
Hailey P. Huddleston ◽  
...  

AbstractThe tibial tubercle–trochlear groove (TT-TG) distance is currently utilized to evaluate knee alignment in patients with patellar instability. Sagittal plane pathology measured by the sagittal tibial tubercle–trochlear groove (sTT-TG) distance has been described in instability but may also be important to consider in patients with cartilage injury. This study aims to (1) describe interobserver reliability of the sTT-TG distance and (2) characterize the change in the sTT-TG distance with respect to changing knee flexion angles. In this cadaveric study, six nonpaired cadaveric knees underwent magnetic resonance imaging (MRI) studies at each of the following degrees of knee flexion: −5, 0, 5, 10, 15, and 20. The sTT-TG distance was measured on the axial T2 sequence. Four reviewers measured this distance for each cadaver at each flexion angle. Intraclass correlation coefficients were calculated to determine interobserver reliability and reproducibility of the sTT-TG measurement. Analysis of variance (ANOVA) tests and Friedman's tests with a Bonferroni's correction were performed for each cadaver to compare sTT-TG distances at each flexion angle. Significance was defined as p < 0.05. There was excellent interobserver reliability of the sTT-TG distance with all intraclass correlation coefficients >0.9. The tibial tubercle progressively becomes more posterior in relation to the trochlear groove (more negative sTT-TG distance) with increasing knee flexion. The sTT-TG distance is a measurement that is reliable between attending surgeons and across training levels. The sTT-TG distance is affected by small changes in knee flexion angle. Awareness of knee flexion angle on MRI is important when this measurement is utilized by surgeons.


2019 ◽  
Vol 40 (6) ◽  
pp. 720-726 ◽  
Author(s):  
Jian Zhong Zhang ◽  
François Lintz ◽  
Alessio Bernasconi ◽  
Shu Zhang ◽  

Background: Weightbearing computed tomography (WBCT) is a useful tool for the assessment of hindfoot alignment (HA). Foot ankle offset (FAO) is a recently introduced parameter, determined from WBCT images using semiautomatic software. The aim of this study was to determine the clinical relevance and reproducibility of FAO for the evaluation of HA. Methods: A prospective comparative study was performed on consecutive patients requiring bilateral WBCT between September 2017 and April 2018. Based on the clinical assessment of HA, patients were divided into 3 groups: (1) normal alignment group (G1), (2) valgus (G2), and (3) varus (G3). FAO and long axial view (HACT) were measured on WBCT images, and the groups were compared. The reproducibility of FAO and HACT was determined through intraclass correlation coefficients (ICCs). Regression analysis was performed to investigate the correlation between the 2 methods. Overall, 249 feet (126 patients) were included (G1 = 115, G2 = 78, and G3 = 56 feet). Results: The mean values for FAO and HACT were 1.2% ± 2.8% and 3.9 ± 3.1, respectively, in G1; 8.1% ± 3.7% and 9.7 ± 4.9 in G2; and −6.6% ± 4.8% and −8.2 ± 6.6 in G3. Intra- and interobserver reliability was 0.987 and 0.988 for FAO and 0.949 and 0.949 for HACT, respectively. There was a good linear correlation between HACT and FAO ( R2 = 0.744), with a regression slope of 1.064. Conclusions: WBCT was a useful method for the characterization of HA. FAO was reproducible and correlated well with physical examination. Level of Evidence: Level II, prospective comparative study.


Scientifica ◽  
2014 ◽  
Vol 2014 ◽  
pp. 1-5
Author(s):  
Zeki Dostbil ◽  
Yusuf Dag ◽  
Ozlem Cetinkaya ◽  
Mehmet Akdag ◽  
Bekir Tasdemir

Objectives.The measurement of mucociliary transport velocity by rhinoscintigraphy with Tc-99m-macroaggregated albumin (99mTc-MAA) is reliable measure of mucociliary clearance. The aim of this study is to assess the intratest, interobserver, and intraobserver reproducibility of nasal mucociliary transport rate (NMTR) measurement.Materials and Methods.Twenty-two subjects were evaluated to determine intratest reproducibility and a group of 35 subjects was examined to determine inter- and intraobserver reproducibility. Rhinoscintigraphy with99mTc-MAA was used to measure NMTR in all study subjects. Paired NMTR measurements were compared using a range of statistical methodologies. Intraclass correlation coefficients (ICC) and repeatability coefficients and Bland-Altman plots were applied to assess the degree of intratest, interobserver, and intraobserver variation.Results.Statistical analysis of test and retest experiments demonstrated the statistical equivalence of intratest NMTR measurements, interobserver NMTR measurements, and intraobserver NMTR measurements. The intratest ICC, interobserver ICC, and intraobserver ICC were 0.96, 0.83, and 0.91, respectively, indicating that intratest and intraobserver reproducibility are excellent and interobserver reproducibility is good.Conclusions.Rhinoscintigraphy using99mTc-MAA results in highly reproducible measurement of NMTR. The use of radionuclide imaging in measuring NMTR results in excellent intratest and intraobserver reproducibility and good interobserver reliability.


2018 ◽  
Vol 1 ◽  
pp. 16
Author(s):  
Fatima Musarrat Hasan ◽  
Musarrat Hasan

Objective The objective of this study was to investigate the interobserver reliability when measuring the carotid intima media thickness (IMT) using superb microvascular imaging (SMI) and B-mode ultrasonography. Methods Two sonologists were selected to scan the left common carotid artery and measure IMT first with B-mode and then with SMI on 20 patients. They were blinded to each other results. Intraclass correlation coefficients (ICCs) were calculated to estimate the inter-rater reliability using both the modes of scanning. Results Interobserver agreement when using SMI, for both near wall and far wall, was almost perfect (ICC, 0.870; 95% confidence interval [CI], 0.700–0.946). Interobserver agreement when using B-mode was poor for near wall (ICC, 0.396; 95% CI, −0.048–0.708) and moderate for far wall (ICC, 0.474, 95% CI, 0.070–0.749). Conclusions SMI proved to be a greatly reliable tool in the measurement of carotid IMT.


2020 ◽  
pp. 193229682097465
Author(s):  
Joel Willem Johan Lasschuit ◽  
Jill Featherston ◽  
Katherine Thuy Trang Tonks

Background: In an era of increasing technology and telehealth utilization, three-dimensional (3D) wound cameras promise reliable, rapid, and touch-free ulceration measurements. However, reliability data for commercially available devices in the diabetes foot service setting is lacking. We aimed to evaluate the reliability of diabetes-related foot ulceration measurement using a 3D wound camera in comparison to the routinely used ruler and probe. Method: Participants were prospectively recruited from a tertiary interdisciplinary diabetes foot service. Ulcerations were measured at each visit by two blinded observers, first by ruler and probe, and then using a 3D wound camera twice. Reliability was evaluated using intraclass correlation coefficients (ICC). Measurement methods were compared by Pearson correlation. Results: Sixty-three ulcerations affecting 38 participants were measured over 122 visits. Interobserver reliability of ruler measurement was excellent for estimated area (ICC 0.98, 95% CI 0.97-0.98) and depth (ICC 0.93, 95% CI 0.90-0.95). Intraobserver and interobserver reliability of the 3D wound camera area was excellent (ICC 0.96, 95%CI 0.95-0.97 and 0.97 95% CI 0.96-0.98, respectively). Depth was unrecordable in over half of 3D wound camera measurements, and reliability was inferior to probe measurement. Area correlation between methods was good ( R = 0.88 and 0.94 per observer); however, depth correlation was poor ( R = 0.49 and 0.65). Conclusions: 3D wound cameras offer practical advantages over ruler-based measurement. In diabetes-related foot ulceration, the reliability and comparability of area measurement was excellent across both methods, although depth was more reliably obtained by the probe. These limitations, together with cost, are important considerations if implementing this technology in diabetes foot care.


2000 ◽  
Vol 122 (2) ◽  
pp. 212-215 ◽  
Author(s):  
Fatma Tulin Kayhan ◽  
David Zurakowski ◽  
Steven D. Rauch

The Toronto Facial Grading System (TFGS) is an observer scale for rating facial nerve dysfunction. The TFGS scores aspects of resting symmetry, symmetry of voluntary movement, and synkinesis for each division of the face (subscores) and then provides calculated total scores and an overall composite score of facial function. The developers of the scale have validated its sensitivity for identifying small changes in facial dysfunction and the independence of the different components measured. Herein we report our results in a study of interob-server reliability using the TFGS. Twenty-five patients from the Massachusetts Eye and Ear Infirmary Facial Nerve Center with varying degrees of facial paresis, paralysis, and synkinesis were videotaped, and the video recordings were scored by 5 independent observers using the TFGS. Intraclass correlation coefficients (κ) and 95% confidence intervals were calculated for subscores and for each total and composite score. Intraclass correlation coefficients ranged from 0.59 to 0.85, all considered substantial to near-perfect agreement between observers. We believe the TFGS is superior to other scales by virtue of its sensitivity, comprehensiveness, ease of use, and interobserver reliability. The TFGS presently appears to be the best option in those situations in which accurate and precise documentation of facial function is required.


2020 ◽  
Vol 102-B (4) ◽  
pp. 478-484 ◽  
Author(s):  
Anne M. Daniels ◽  
Caroline E. Wyers ◽  
Heinrich M. J. Janzing ◽  
Sander Sassen ◽  
Daan Loeffen ◽  
...  

Aims Besides conventional radiographs, the use of MRI, CT, and bone scintigraphy is frequent in the diagnosis of a fracture of the scaphoid. However, which techniques give the best results remain unknown. The investigation of a new imaging technique initially requires an analysis of its precision. The primary aim of this study was to investigate the interobserver agreement of high-resolution peripheral quantitative CT (HR-pQCT) in the diagnosis of a scaphoid fracture. A secondary aim was to investigate the interobserver agreement for the presence of other fractures and for the classification of scaphoid fracture. Methods Two radiologists and two orthopaedic trauma surgeons evaluated HR-pQCT scans of 31 patients with a clinically-suspected scaphoid fracture. The observers were asked to determine the presence of a scaphoid or other fracture and to classify the scaphoid fracture based on the Herbert classification system. Fleiss kappa statistics were used to calculate the interobserver agreement for the diagnosis of a fracture. Intraclass correlation coefficients (ICCs) were used to assess the agreement for the classification of scaphoid fracture. Results A total of nine (29%) scaphoid fractures and 12 (39%) other fractures were diagnosed in 20 patients (65%) using HR-pQCT across the four observers. The interobserver agreement was 91% for the identification of a scaphoid fracture (95% confidence interval (CI) 0.76 to 1.00) and 80% for other fractures (95% CI 0.72 to 0.87). The mean ICC for the classification of a scaphoid fracture in the seven patients diagnosed with scaphoid fracture by all four observers was 73% (95% CI 0.42 to 0.94). Conclusion We conclude that the diagnosis of scaphoid and other fractures is reliable when using HR-pQCT in patients with a clinically-suspected fracture. Cite this article: Bone Joint J 2020;102-B(4):478–484.


2019 ◽  
Vol 19 (1) ◽  
Author(s):  
Annemiek Snoeckx ◽  
Jeroen Cant ◽  
Caro Franck ◽  
Elisa Luyckx ◽  
Ken Carpentier ◽  
...  

Abstract Purpose A newly developed image processing technique fuses conventional windows into a single ‘All-In-One’ (AIO) window. This study aims to evaluate variability of CT measurement of lesions in thoracic oncology patients on this novel AIO-window. Methods Six radiologists with different levels of expertise measured 368 lesions of various size, origin and sharpness. All lesions were measured twice on the AIO-window and twice on the conventional window settings. Intraclass correlation coefficients and Bland-Altman plots were used to assess intra- and interobserver variability. Results Overall intra-observer agreement for lesion diameters on the AIO-window and conventional window settings was 0.986 (95% Confidence interval (CI): 0.983–0.989) and 0.991 (95% CI 0.989–0.993) respectively. For interobserver agreement this was 0.982 (95% CI 0.979–0.985) (AIO) and 0.979 (95% CI 0.957–0.982) (conventional). For both the AIO and conventional windows, intra- and interobserver agreement were dependent on size, sharpness and reader experience. Measurement variability decreased with increasing lesion size. Regarding sharpness, inter- and intra-observer agreement ranged from 0.986–0.989 (AIO) and 0.985–0.992 (conventional) for well-defined lesions and from 0.978–0.983 (AIO) and 0.974–0.991 (conventional) for ill-defined lesions. Conclusions Lesion diameters were consistently smaller on the AIO-window compared to conventional window settings. Overall intra- and interobserver variability rates were similar for the AIO-window and conventional window settings. We conclude that the AIO-window offers a reliable and reproducible alternative for measurement of thoracic lesions.


Sign in / Sign up

Export Citation Format

Share Document