The interobserver reliability of the diagnosis and classification of scaphoid fractures using high-resolution peripheral quantitative CT

Aims Besides conventional radiographs, the use of MRI, CT, and bone scintigraphy is frequent in the diagnosis of a fracture of the scaphoid. However, which techniques give the best results remain unknown. The investigation of a new imaging technique initially requires an analysis of its precision. The primary aim of this study was to investigate the interobserver agreement of high-resolution peripheral quantitative CT (HR-pQCT) in the diagnosis of a scaphoid fracture. A secondary aim was to investigate the interobserver agreement for the presence of other fractures and for the classification of scaphoid fracture. Methods Two radiologists and two orthopaedic trauma surgeons evaluated HR-pQCT scans of 31 patients with a clinically-suspected scaphoid fracture. The observers were asked to determine the presence of a scaphoid or other fracture and to classify the scaphoid fracture based on the Herbert classification system. Fleiss kappa statistics were used to calculate the interobserver agreement for the diagnosis of a fracture. Intraclass correlation coefficients (ICCs) were used to assess the agreement for the classification of scaphoid fracture. Results A total of nine (29%) scaphoid fractures and 12 (39%) other fractures were diagnosed in 20 patients (65%) using HR-pQCT across the four observers. The interobserver agreement was 91% for the identification of a scaphoid fracture (95% confidence interval (CI) 0.76 to 1.00) and 80% for other fractures (95% CI 0.72 to 0.87). The mean ICC for the classification of a scaphoid fracture in the seven patients diagnosed with scaphoid fracture by all four observers was 73% (95% CI 0.42 to 0.94). Conclusion We conclude that the diagnosis of scaphoid and other fractures is reliable when using HR-pQCT in patients with a clinically-suspected fracture. Cite this article: Bone Joint J 2020;102-B(4):478–484.

Download Full-text

The Novel Technique of using Superb Microvascular Imaging to Determine Carotid Intima-media Thickness

American Journal of Sonography ◽

10.25259/ajs-40-2018 ◽

2018 ◽

Vol 1 ◽

pp. 16

Author(s):

Fatima Musarrat Hasan ◽

Musarrat Hasan

Keyword(s):

Interobserver Agreement ◽

Interobserver Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Intima Media Thickness ◽

Carotid Intima Media Thickness ◽

Media Thickness ◽

Superb Microvascular Imaging ◽

Microvascular Imaging ◽

Near Wall

Objective The objective of this study was to investigate the interobserver reliability when measuring the carotid intima media thickness (IMT) using superb microvascular imaging (SMI) and B-mode ultrasonography. Methods Two sonologists were selected to scan the left common carotid artery and measure IMT first with B-mode and then with SMI on 20 patients. They were blinded to each other results. Intraclass correlation coefficients (ICCs) were calculated to estimate the inter-rater reliability using both the modes of scanning. Results Interobserver agreement when using SMI, for both near wall and far wall, was almost perfect (ICC, 0.870; 95% confidence interval [CI], 0.700–0.946). Interobserver agreement when using B-mode was poor for near wall (ICC, 0.396; 95% CI, −0.048–0.708) and moderate for far wall (ICC, 0.474, 95% CI, 0.070–0.749). Conclusions SMI proved to be a greatly reliable tool in the measurement of carotid IMT.

Download Full-text

Quantification of fluoroscopic fundoplication anatomy: inter- and intraobserver reliability

Diseases of the Esophagus ◽

10.1093/dote/doab045 ◽

2021 ◽

Author(s):

Siang Wei Gan ◽

Natalie Lee ◽

Siao En Tan ◽

Suzanne M Edwards ◽

George K Kiroff ◽

...

Keyword(s):

Interobserver Agreement ◽

Interobserver Reliability ◽

Gastroesophageal Junction ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Barium Swallow ◽

Anatomical Features ◽

Intraclass Correlation Coefficients ◽

Objectively Measured ◽

Video Fluoroscopy

SUMMARY The etiology of postfundoplication dysphagia remains incompletely understood. Subtle changes of gastroesophageal junction (GEJ) anatomy may be contributory. Barium swallows have potential for standardization to evaluate postsurgical anatomical features. Using structured barium swallows, we aim to identify reproducible, objectively measured postfundoplication anatomical features that will permit future comparison between patients with/without dysphagia. At 6–12 months of postfundoplication, 31 patients underwent structured barium swallow with video–fluoroscopy recording: standing anteroposterior; standing oblique (×2); prone oblique (×2); and prone oblique with continuous free drinking. A primary observer recorded 11 variables of GEJ anatomy for each view, repeated 3 months later, forming two datasets to assess intraobserver consistency. Interobserver reliability was determined using a dataset each from the primary observer and two medical students (after training). Intraclass correlation coefficients (ICC) were based on two-way mixed-effects model (ICC agreement: 0.40–0.59 ‘fair’; 0.60–0.74 ‘good’; 0.75–1.00 ‘excellent’). Interobserver reliability was good–excellent for 47 of 66 measurements. Measures of maximal esophageal diameter cf. wrap opening diameter and posterior esophageal angle showed high interobserver reproducibility on all views (ICC range 0.84–0.91; 0.68–0.80, respectively). Interobserver agreement was good–excellent for 5/6 views when measuring anterior GEJ displacement and axis deviation (ICC range 0.56–0.79; 0.41–0.77, respectively). Measures of wrap length showed lower reproducibility. Prone oblique measurements showed highest reproducibility (good–excellent agreement in 19/22 measurements). Intraobserver consistency was excellent for 98% of measurements (ICC range 0.74–0.99). Objective measurements of postfundoplication GEJ anatomy using structured barium swallow are reproducible and may allow further interrogation of anatomical features contributing to postfundoplication dysphagia.

Download Full-text

Reliability of a rapid hematology stain for sputum cytology

Jornal Brasileiro de Pneumologia ◽

10.1590/s1806-37132014000300008 ◽

2014 ◽

Vol 40 (3) ◽

pp. 250-258 ◽

Cited By ~ 1

Author(s):

Jéssica Gonçalves ◽

Emilio Pizzichini ◽

Marcia Margaret Menezes Pizzichini ◽

Leila John Marques Steidle ◽

Cristiane Cinara Rocha ◽

...

Keyword(s):

Interobserver Agreement ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Standard Technique ◽

Cross Sectional Study ◽

Induced Sputum ◽

Cell Counting ◽

Kappa Statistics ◽

Cross Sectional ◽

Intraclass Correlation Coefficients

Objective: To determine the reliability of a rapid hematology stain for the cytological analysis of induced sputum samples. Methods: This was a cross-sectional study comparing the standard technique (May-Grünwald-Giemsa stain) with a rapid hematology stain (Diff-Quik). Of the 50 subjects included in the study, 21 had asthma, 19 had COPD, and 10 were healthy (controls). From the induced sputum samples collected, we prepared four slides: two were stained with May-Grünwald-Giemsa, and two were stained with Diff-Quik. The slides were read independently by two trained researchers blinded to the identification of the slides. The reliability for cell counting using the two techniques was evaluated by determining the intraclass correlation coefficients (ICCs) for intraobserver and interobserver agreement. Agreement in the identification of neutrophilic and eosinophilic sputum between the observers and between the stains was evaluated with kappa statistics. Results: In our comparison of the two staining techniques, the ICCs indicated almost perfect interobserver agreement for neutrophil, eosinophil, and macrophage counts (ICC: 0.98-1.00), as well as substantial agreement for lymphocyte counts (ICC: 0.76-0.83). Intraobserver agreement was almost perfect for neutrophil, eosinophil, and macrophage counts (ICC: 0.96-0.99), whereas it was moderate to substantial for lymphocyte counts (ICC = 0.65 and 0.75 for the two observers, respectively). Interobserver agreement for the identification of eosinophilic and neutrophilic sputum using the two techniques ranged from substantial to almost perfect (kappa range: 0.91-1.00). Conclusions: The use of Diff-Quik can be considered a reliable alternative for the processing of sputum samples.

Download Full-text

Interobserver Reliability Using the Phonetic Level Evaluation With Severely and Profoundly Hearing-Impaired Children

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3405.989 ◽

1991 ◽

Vol 34 (5) ◽

pp. 989-999 ◽

Cited By ~ 6

Author(s):

Stephanie Shaw ◽

Truman E. Coggins

Keyword(s):

Interrater Reliability ◽

Interobserver Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Hearing Impaired ◽

Intraclass Correlation Coefficients ◽

Assessment Measure ◽

Impaired Children ◽

Speech Assessment ◽

Hearing Impaired Children

This study examines whether observers reliably categorize selected speech production behaviors in hearing-impaired children. A group of experienced speech-language pathologists was trained to score the elicited imitations of 5 profoundly and 5 severely hearing-impaired subjects using the Phonetic Level Evaluation (Ling, 1976). Interrater reliability was calculated using intraclass correlation coefficients. Overall, the magnitude of the coefficients was found to be considerably below what would be accepted in published behavioral research. Failure to obtain acceptably high levels of reliability suggests that the Phonetic Level Evaluation may not yet be an accurate and objective speech assessment measure for hearing-impaired children.

Download Full-text

Assessment of reliability and validity of the 5-scale grading system of the point-of-care immunoassay for tear matrix metalloproteinase-9

Scientific Reports ◽

10.1038/s41598-021-92020-6 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Minjeong Kim ◽

Ja Young Oh ◽

Seon Ha Bae ◽

Seung Hyeun Lee ◽

Won Jun Lee ◽

...

Keyword(s):

Matrix Metalloproteinase ◽

Calibration Curve ◽

Point Of Care ◽

Interobserver Reliability ◽

Intraclass Correlation ◽

Reliability And Validity ◽

Correlation Coefficients ◽

Grading System ◽

Intraclass Correlation Coefficients ◽

The Difference

AbstractWe evaluated the reliability and validity of the 5-scale grading system to interpret the point-of-care immunoassay for tear matrix metalloproteinase (MMP)-9. Six observers graded red bands of photographs of the readout window in MMP-9 immunoassay kit (InflammaDry) two times with 2-week interval based on the 5-scale grading system (i.e. grade 0–4). Interobserver and intraobserver reliability were evaluated using intraclass correlation coefficients. The interobserver agreements were analyzed according to the severity of tear MMP-9 expression. To validate the system, a concentration calibration curve was made using MMP-9 solutions with reference concentrations, then the distribution of MMP-9 concentrations was analyzed according to the 5-scale grading system. Both intraobserver and interobserver reliability was excellent. The readout grades were significantly correlated with the quantified colorimetric densities. The interobserver variance of readout grades had no correlation with the severity of the measured densities. The band density continued to increase up to a maximal concentration (i.e. 5000 ng/mL) according to the calibration curve. The difference of grades reflected the change of MMP-9 concentrations sensitively, especially between grade 2 and 4. Together, our data indicate that the subjective 5-scale grading system in the point-of-care MMP-9 immunoassay is an easy and reliable method with acceptable accuracy.

Download Full-text

Reliability and Change in Erosion Measurements by High-Resolution peripheral Quantitative Computed Tomography in a Longitudinal Dataset of Rheumatoid Arthritis Patients

The Journal of Rheumatology ◽

10.3899/jrheum.191391 ◽

2020 ◽

pp. jrheum.191391 ◽

Cited By ~ 1

Author(s):

Stephanie Finzel ◽

Sarah L. Manske ◽

Cheryl Barnabe ◽

Andrew J. Burghardt ◽

Hubert Marotte ◽

...

Keyword(s):

Rheumatoid Arthritis ◽

Computed Tomography ◽

High Resolution ◽

Intraclass Correlation ◽

Quantitative Computed Tomography ◽

Correlation Coefficients ◽

Peripheral Quantitative Computed Tomography ◽

Good Reliability ◽

Change Over Time ◽

Over Time

Objective The aim of this multi-reader exercise was to assess the reliability and change over time of erosion measurements in rheumatoid arthritis (RA) patients using high-resolution peripheral quantitative computed tomography (HR-pQCT). Methods HR-pQCT scans of 23 patients with RA were assessed at baseline and 12 months. Four experienced readers examined the dorsal, palmar, radial, and ulnar surfaces of the metacarpal head (MH) and phalangeal base (PB) of the 2nd and 3rd digits, blinded to time order. In total, 368 surfaces (23 patients x16 surfaces) were evaluated per time point to characterize cortical breaks as pathological (erosion) or physiological, and to quantify erosion width and depth. Reliability was evaluated by intraclass correlation coefficients (ICC), percentage agreement, and Light’s kappa; change over time was defined by means ± SD of erosion numbers and dimensions. Results ICCs for the mean measurements of width and depth of the pathological breaks ranged between 0.819 - 0.883, and 0.771 - 0.907 respectively. Most physiological cortical breaks were found at the palmar PB, whereas most pathological cortical breaks were located at the radial MH. There was a significant increase in both the numbers and the dimensions of erosions between baseline and follow-up (p=0.0001 for erosion numbers, width, and depth in axial plane, and p=0.001 for depth in perpendicular plane). Conclusion This exercise confirmed good reliability of HR-pQCT erosion measurements and their ability to detect change over time.

Download Full-text

Interobserver Reliability and Change in the Sagittal Tibial Tubercle–Trochlear Groove Distance with Increasing Knee Flexion Angles

The Journal of Knee Surgery ◽

10.1055/s-0041-1729547 ◽

2021 ◽

Author(s):

Ian S. MacLean ◽

Taylor M. Southworth ◽

Ian J. Dempsey ◽

Neal B. Naveen ◽

Hailey P. Huddleston ◽

...

Keyword(s):

Knee Flexion ◽

Sagittal Plane ◽

Interobserver Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Flexion Angle ◽

Tibial Tubercle ◽

Trochlear Groove ◽

Knee Flexion Angle ◽

Intraclass Correlation Coefficients

AbstractThe tibial tubercle–trochlear groove (TT-TG) distance is currently utilized to evaluate knee alignment in patients with patellar instability. Sagittal plane pathology measured by the sagittal tibial tubercle–trochlear groove (sTT-TG) distance has been described in instability but may also be important to consider in patients with cartilage injury. This study aims to (1) describe interobserver reliability of the sTT-TG distance and (2) characterize the change in the sTT-TG distance with respect to changing knee flexion angles. In this cadaveric study, six nonpaired cadaveric knees underwent magnetic resonance imaging (MRI) studies at each of the following degrees of knee flexion: −5, 0, 5, 10, 15, and 20. The sTT-TG distance was measured on the axial T2 sequence. Four reviewers measured this distance for each cadaver at each flexion angle. Intraclass correlation coefficients were calculated to determine interobserver reliability and reproducibility of the sTT-TG measurement. Analysis of variance (ANOVA) tests and Friedman's tests with a Bonferroni's correction were performed for each cadaver to compare sTT-TG distances at each flexion angle. Significance was defined as p < 0.05. There was excellent interobserver reliability of the sTT-TG distance with all intraclass correlation coefficients >0.9. The tibial tubercle progressively becomes more posterior in relation to the trochlear groove (more negative sTT-TG distance) with increasing knee flexion. The sTT-TG distance is a measurement that is reliable between attending surgeons and across training levels. The sTT-TG distance is affected by small changes in knee flexion angle. Awareness of knee flexion angle on MRI is important when this measurement is utilized by surgeons.

Download Full-text

3D Biometrics for Hindfoot Alignment Using Weightbearing Computed Tomography

Foot & Ankle International ◽

10.1177/1071100719835492 ◽

2019 ◽

Vol 40 (6) ◽

pp. 720-726 ◽

Cited By ~ 24

Author(s):

Jian Zhong Zhang ◽

François Lintz ◽

Alessio Bernasconi ◽

Shu Zhang ◽

Keyword(s):

Computed Tomography ◽

Comparative Study ◽

Interobserver Reliability ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Level Of Evidence ◽

Hindfoot Alignment ◽

Mean Values ◽

Intraclass Correlation Coefficients ◽

Prospective Comparative Study

Background: Weightbearing computed tomography (WBCT) is a useful tool for the assessment of hindfoot alignment (HA). Foot ankle offset (FAO) is a recently introduced parameter, determined from WBCT images using semiautomatic software. The aim of this study was to determine the clinical relevance and reproducibility of FAO for the evaluation of HA. Methods: A prospective comparative study was performed on consecutive patients requiring bilateral WBCT between September 2017 and April 2018. Based on the clinical assessment of HA, patients were divided into 3 groups: (1) normal alignment group (G1), (2) valgus (G2), and (3) varus (G3). FAO and long axial view (HACT) were measured on WBCT images, and the groups were compared. The reproducibility of FAO and HACT was determined through intraclass correlation coefficients (ICCs). Regression analysis was performed to investigate the correlation between the 2 methods. Overall, 249 feet (126 patients) were included (G1 = 115, G2 = 78, and G3 = 56 feet). Results: The mean values for FAO and HACT were 1.2% ± 2.8% and 3.9 ± 3.1, respectively, in G1; 8.1% ± 3.7% and 9.7 ± 4.9 in G2; and −6.6% ± 4.8% and −8.2 ± 6.6 in G3. Intra- and interobserver reliability was 0.987 and 0.988 for FAO and 0.949 and 0.949 for HACT, respectively. There was a good linear correlation between HACT and FAO ( R2 = 0.744), with a regression slope of 1.064. Conclusions: WBCT was a useful method for the characterization of HA. FAO was reproducible and correlated well with physical examination. Level of Evidence: Level II, prospective comparative study.

Download Full-text

Interobserver reliability of the classification of capitellar osteochondritis dissecans using magnetic resonance imaging

Shoulder & Elbow ◽

10.1177/1758573218821151 ◽

2019 ◽

Vol 12 (4) ◽

pp. 284-293 ◽

Cited By ~ 1

Author(s):

Rens Bexkens ◽

F. Joseph Simeone ◽

Denise Eygendaal ◽

Michel PJ van den Bekerom ◽

Luke S Oh ◽

...

Keyword(s):

Magnetic Resonance Imaging ◽

Magnetic Resonance ◽

Osteochondritis Dissecans ◽

Interobserver Agreement ◽

Interobserver Reliability ◽

Lateral Wall ◽

Magnetic Resonance Images ◽

Resonance Imaging ◽

Instability Criteria

Aim (1) To determine the interobserver reliability of magnetic resonance classifications and lesion instability criteria for capitellar osteochondritis dissecans lesions and (2) to assess differences in reliability between subgroups. Methods Magnetic resonance images of 20 patients with capitellar osteochondritis dissecans were reviewed by 33 observers, 18 orthopaedic surgeons and 15 musculoskeletal radiologists. Observers were asked to classify the osteochondritis dissecans according to classifications developed by Hepple, Dipaola/Nelson, Itsubo, as well as to apply the lesion instability criteria of DeSmet/Kijowski and Satake. Interobserver agreement was calculated using the multirater kappa (k) coefficient. Results Interobserver agreement ranged from slight to fair: Hepple (k = 0.23); Dipaola/Nelson (k = 0.19); Itsubo (k = 0.18); DeSmet/Kijowksi (k = 0.16); Satake (k = 0.12). When classifications/instability criteria were dichotomized into either a stable or unstable osteochondritis dissecans, there was more agreement for Hepple (k = 0.52; p = .002), Dipaola/Nelson (k = 0.38; p = .015), DeSmet/Kijowski (k = 0.42; p = .001) and Satake (k = 0.41; p < .001). Overall, agreement was not associated with the number of years in practice or the number of osteochondritis dissecans cases encountered per year (p > .05). Conclusion One should be cautious when assigning grades using magnetic resonance classifications for capitellar osteochondritis dissecans. When making treatment decisions, one should rather use relatively simple distinctions (e.g. stable versus unstable osteochondritis dissecans; lateral wall intact versus not intact), as these are more reliable.

Download Full-text

The Validity of In Vivo Tooth Volume Determinations From Cone-Beam Computed Tomography

The Angle Orthodontist ◽

10.2319/121608-639.1 ◽

2010 ◽

Vol 80 (1) ◽

pp. 160-166 ◽

Cited By ~ 47

Author(s):

Yi Liu ◽

Raphael Olszewski ◽

Emanuel Stefan Alexandroni ◽

Reyes Enciso ◽

Tianmin Xu ◽

...

Keyword(s):

Computed Tomography ◽

Cone Beam Computed Tomography ◽

Interobserver Reliability ◽

Pearson Correlation ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Volumetric Analysis ◽

Cone Beam ◽

Water Displacement

Abstract Objective: To determine the accuracy of volumetric analysis of teeth in vivo using cone-beam computed tomography (CBCT). Materials and Methods: The physical volume (Vw) of 24 bicuspids extracted for orthodontic purposes (16 were imaged with the I-CAT and 8 with the CB MercuRay) were determined using the water displacement technique. Corresponding pretreatment CBCT image data were uploaded into Amira 4.0 for segmentation and radiographic volume (Va). All measurements were performed twice by two observers. The statistical difference between Vw and Va was assessed using a paired t-test. The intraobserver and interobserver reliability were determined by calculating Pearson correlation coefficients and intraclass correlation coefficients. Results: The overall mean Vw of teeth specimens was 0.553 ± 0.082 cm3, while the overall mean Va was 0.548 ± 0.079 cm3 (0.529 ± 0.078 cm3 for observer 1 and 0.567 ± 0.085 cm3 for observer 2). There were statistically significant differences between Va and Vw (P < .05). Between observer 1 and observer 2, Va measurements were statistically significantly different (P < .05). The interobserver and intraobserver correlation coefficient for Vw was high. Lastly, surface smoothing reduced the volume by 3% to 12%. Conclusions: In vivo determination of tooth volumes from CBCT data is feasible. The measurements slightly deviate from the physical volumes within −4% to 7%. Smoothing operations reduce volume measurements. Currently, no requirements for accuracy of volumetric determinations of tooth volume have been established.

Download Full-text