Reliability of Dermatologists in Acne Lesion Counts and Global Assessments

Background: There is a paucity of data on the reliability of dermatologists in acne lesion counting and global severity assessments. The effects of training and practice on reliability are also uncertain. The objective of this study was to determine the reliability of these outcome measurements when performed by trained dermatologists. Methods: Eleven dermatologists were divided into two groups that evaluated the same six acne subjects twice on the same day. A training session was provided either after (group A) or before (group B) the first patient evaluation sessions. Reliability of raters in lesion counting and global severity assessment was determined by calculation of intraclass correlation coefficients (ICCs). ICC values close to 1.0 indicate excellent reliability, whereas values less than 0.75 are considered unacceptable. Results: Intrarater ICCs ranged from 0.37 to 0.99 for noninflammatory lesions, 0.26 to 0.97 for inflammatory lesions, and 0.56 to 0.83 for global assessments for group A (trained after); corresponding values for group B (trained before) were 0.84 to 0.98, 0.61 to 0.95, and 0.43 to 0.91. ICC values ≥ 0.75 for all three outcome parameters were observed in one of six group A and three of five group B raters. Interrater ICCs for groups A and B after the first evaluation session were 0.17 versus 0.68 for noninflammatory counts, 0.84 versus 0.72 for inflammatory counts, and 0.71 versus 0.65 for global assessments, respectively. Corresponding values after session 2 were 0.79 and 0.77 for noninflammatory, 0.81 and 0.90 for inflammatory, and 0.61 and 0.77 for global assessments. Conclusion: Dermatologists tended to be reliable in acne lesion counting but somewhat less so in global assessments. Training tended to improve group reliability in noninflammatory lesion counts and increased the proportion of raters with good reliability in all three outcome measures. Practice enhanced reliability in all outcome measurements.

Download Full-text

Action Boundary Proximity Effects on Perceptual-Motor Judgments

Aerospace Medicine and Human Performance ◽

10.3357/amhp.5376.2019 ◽

2019 ◽

Vol 90 (12) ◽

pp. 1000-1008

Author(s):

Caleb D. Johnson ◽

Alice D. LaGoy ◽

Gert-Jan Pepping ◽

Shawn R. Eagle ◽

Anne Z. Beethe ◽

...

Keyword(s):

Intraclass Correlation ◽

Reaction Times ◽

Correlation Coefficients ◽

Proximity Effects ◽

Testing Time ◽

Good Reliability ◽

Intraclass Correlation Coefficients ◽

Coefficients Of Variation ◽

Time Required ◽

Action Coupling

INTRODUCTION: Designed as a more ecological measure of reaction times, the Perception-Action Coupling Task (PACT) has shown good reliability and within-subject stability. However, a lengthy testing period was required. Perceptual-motor judgments are known to be affected by proximity of the stimulus to the action boundary. The current study sought to determine the effects of action boundary proximity on PACT performance, and whether redundant levels of stimuli, eliciting similar responses, can be eliminated to shorten the PACT.METHODS: There were 9 men and 7 women who completed 4 testing sessions, consisting of 3 familiarization cycles and 6 testing cycles of the PACT. For the PACT, subjects made judgments on whether a series of balls presented on a tablet afford “posting” (can fit) through a series of apertures. There were 8 ratios of ball to aperture size (B-AR) presented, ranging from 0.2 to 1.8, with each ratio appearing 12 times (12 trials) per cycle. Reaction times and judgment accuracy were calculated, and averaged across all B-ARs. Ratios and individual trials within each B-AR were systematically eliminated. Variables were re-averaged, and intraclass correlation coefficients (ICC) and coefficients of variation (CVTE) were calculated in an iterative manner.RESULTS: With elimination of the 0.2 and 1.8 B-ARs, the PACT showed good reliability (ICC = 0.81–0.99) and consistent within-subject stability (CVTE = 2.2–14.7%). Reliability (ICC = 0.81–0.97) and stability (CVTE = 2.6–15.6%) were unaffected with elimination of up to 8 trials from each B-AR.DISCUSSION: The shortened PACT resulted in an almost 50% reduction in total familiarization/testing time required, significantly increasing usability.Johnson CD, LaGoy AD, Pepping G-J, Eagle SR, Beethe AZ, Bower JL, Alfano CA, Simpson RJ, Connaboy C. Action boundary proximity effects on perceptual-motor judgments. Aerosp Med Hum Perform. 2019; 90(12):1000–1008.

Download Full-text

Test–Retest Reliability of a New Device Versus a Long-Arm Goniometer to Evaluate Knee Proprioception

Journal of Sport Rehabilitation ◽

10.1123/jsr.2021-0146 ◽

2021 ◽

pp. 1-6

Author(s):

Fei Tian ◽

Yaqi Zhao ◽

Jixin Li ◽

Wenjin Wang ◽

Danni Wu ◽

...

Keyword(s):

Standard Error ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Joint Position Sense ◽

Repeated Measurements ◽

Good Reliability ◽

Retest Reliability ◽

New Device ◽

Intraclass Correlation Coefficients ◽

Test Retest Reliability

Context: Many methods used to evaluate knee proprioception have shortcomings that limit their use in clinical settings. Based on an inexpensive 3D camera, a new portable device was recently used to evaluate the joint position sense (JPS) of the knee joint. However, the test–retest reliability of the new method remains unclear. This study aimed to evaluate the test–retest reliability of the new device and a long-arm goniometer for assessing knee JPS, and to compare the variability of the 2 methods. Design: Prospective observational study of the test–retest reliability of knee JPS measurements. Methods: Twenty-one healthy adults were tested in 2 sessions with a 1-week interval. Three target knee flexion angles (30°, 45°, and 60°) were reproduced in each session. Target and reproduced angles were measured with both methods. Intraclass correlation coefficients, standard error of the measurement, and Bland–Altman plots were used to quantify test–retest reliability. Paired t tests were used to compare knee JPS (absolute error of the target-reproduced angle) between the methods. Results: The new device (good to excellent intraclass correlation coefficients .74–.80; standard error of the measurement 0.52°–0.61°) demonstrated better test–retest reliability than the goniometer (poor to fair intraclass correlation coefficients .23–.43; standard error of the measurement 0.89°–2.07°) and better test–retest agreement (respective mean differences for the 30°, 45°, and 60° knee angles: 0.11°, 0.13°, and 0.41° for the new system; 0.84°, 1.52°, and 1.18° for the goniometer). The measurements (absolute errors of the target-reproduced angles) with the goniometer were significantly greater than those with the new device (P < .05); the SDs of repeated measurements with the goniometer (1.50°–2.41°) were greater than with the new device (1.08°–1.38°). Conclusions: Given that the new device has good reliability and sufficient precision, it is the better alternative for evaluating knee JPS. Goniometers should be used with caution to assess knee JPS.

Download Full-text

Assessment of Physical Activity in Adults: A Review of Validated Questionnaires From a Nutritionist’s Point of View

Evaluation & the Health Professions ◽

10.1177/0163278720928500 ◽

2020 ◽

Vol 43 (4) ◽

pp. 235-254

Author(s):

Angela Patricia Bacelis-Rivero ◽

Anabel Vázquez-Rodríguez ◽

Claudia E. Carrasco-Legleu ◽

Lidia G. De León ◽

Sandra Alicia Reza-López

Keyword(s):

Physical Activity ◽

Energy Expenditure ◽

Intraclass Correlation ◽

Reliability And Validity ◽

Correlation Coefficients ◽

Epidemiological Studies ◽

Point Of View ◽

Eligibility Criteria ◽

Good Reliability ◽

Intraclass Correlation Coefficients

Physical activity (PA) is a component of total energy expenditure. PA and PA energy expenditure (PAEE) can be estimated by objective techniques (OTs). However, the use of questionnaires is frequent in clinical settings and epidemiological studies. We conducted a search on PubMed, Scopus, and Google Scholar databases to perform a review of studies reporting the reliability and validity of PA questionnaires validated against OTs—doubly labeled water (DLW) or accelerometers—in free-living adults. We selected original articles published between 2009 and 2019 that reported validation studies of PA questionnaires. We identified 53 studies that fulfilled the eligibility criteria. Four PA questionnaires were validated against DLW and the remaining against accelerometers. Three questionnaires were compared with both DLW and accelerometer results. The correlation between questionnaire-estimated PAEE and DLW results ranged from r = .22 to r = .46, while that between questionnaire-estimated total PA (TPA) and accelerometer results ranged from r = .11 to r = .54 The intraclass correlation coefficients were between .56 and .84. Despite having good reliability, most of the questionnaires included in this review have shown limited validity for estimating TPA in adults. OTs should be considered as a first option, when possible. Further research is warranted on techniques to obtain more accurate PA and PAEE estimates.

Download Full-text

The Reliability of the Manual Supination Resistance Test

Journal of the American Podiatric Medical Association ◽

10.7547/87507315-93-3-185 ◽

2003 ◽

Vol 93 (3) ◽

pp. 185-189 ◽

Cited By ~ 18

Author(s):

Hazel Noakes ◽

Craig Payne

Keyword(s):

Clinical Outcomes ◽

Correlation Coefficient ◽

Intraclass Correlation Coefficient ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Resistance Test ◽

Foot Orthoses ◽

Good Reliability ◽

Intraclass Correlation Coefficients ◽

Foot Pronation

Several decisions need to be made when prescribing foot orthoses for abnormal foot pronation. One of these decisions is how much force is needed from orthoses to supinate the foot. The supination resistance test has been described as one technique to help determine the amount of force needed. The aim of this project was to determine the reliability of the manual supination resistance test. Four clinicians of differing levels of experience performed the test on 44 subjects (88 feet) on 2 separate days. The test had good reliability overall, with an intertester intraclass correlation coefficient of 0.89. For the two more experienced clinicians, the intratester intraclass correlation coefficients were good (0.82 and 0.78), but for the two inexperienced clinicians they were poor (0.56 and 0.62). The supination resistance test may be clinically useful in the prescription of foot orthoses, but more work is needed to determine its validity and its relationship to clinical outcomes. (J Am Podiatr Med Assoc 93(3): 185-189, 2003)

Download Full-text

Cadence, Age, and Weight as Determinants of Forefoot Plantar Pressures Using the Biofoot In-shoe System

Journal of the American Podiatric Medical Association ◽

10.7547/0980302 ◽

2008 ◽

Vol 98 (4) ◽

pp. 302-310 ◽

Cited By ~ 13

Author(s):

Alfonso Martínez-Nova ◽

Javier Pascual Huerta ◽

Raquel Sánchez-Rodríguez

Keyword(s):

Healthy Subjects ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Metatarsal Head ◽

Multiple Regression Model ◽

Good Reliability ◽

Intraclass Correlation Coefficients ◽

Plantar Pressures ◽

Foot Biomechanics ◽

Peak Plantar Pressure

Background: We evaluated normal plantar pressures and studied the effect of weight, cadence, and age on forefoot plantar pressures in healthy subjects by using the Biofoot (Instituto de Biomecánica de Valencia, Valencia, Spain) in-shoe measurement system. Methods: The feet of 45 healthy subjects with no evident foot or lower-limb diseases were measured with the Biofoot in-shoe system. The forefoot was divided into seven areas: the first through fifth metatarsal heads, the hallux, and the second through fifth lesser toes. Three trials of 8 sec each were recorded twice in each subject, and the mean was used to analyze peak and mean plantar pressures. A multiple regression model including weight, age, and cadence was run for each metatarsal head, the hallux, and the lesser toes. Intraclass correlation coefficients and coefficients of variation were also calculated to assess reliability. Results: The second metatarsal head had the greatest peak (960 kPa) and mean (585.1 kPa) pressures, followed by the third metatarsal head. Weight and cadence combined explained 18% and 23% of peak plantar pressure at the second and third metatarsal heads, respectively (P < .001). The intraclass correlation coefficient varied from 0.76 to 0.96 for all variables. The coefficient of variation between sessions ranged from 5.8% to 9.0%. Conclusion: The highest peak and mean plantar pressures were found at the second and third metatarsal heads in healthy subjects. Weight, cadence, and age explained a low variability of this pressure pattern. The Biofoot in-shoe system has good reliability to measure plantar pressures. These data will have implications for the understanding of normal foot biomechanics and its determinants. (J Am Podiatr Med Assoc 98(4): 302–310, 2008)

Download Full-text

Test–Retest Reliability of Soccer Dribbling Tests in Children

Journal of Motor Learning and Development ◽

10.1123/jmld.2021-0003 ◽

2021 ◽

pp. 1-7

Author(s):

Michael J. Duncan ◽

Darren Richardson ◽

Rhys Morris ◽

Emma Eyre ◽

Neil D. Clarke

Keyword(s):

Coefficient Of Variation ◽

Pediatric Population ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Systematic Bias ◽

Limits Of Agreement ◽

Good Reliability ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Test Retest Reliability

The present study examined the test–retest reliability of the Ghent University dribbling test and short dribble test in a pediatric population. Fifty-four boys aged 9–14 years (mean ± SD = 11 ± 2 years) undertook the Ghent University and dribbling tests on two occasions separated by 2 weeks. Intraclass correlation coefficients, coefficient of variation, and 95% limits of agreement were conducted for each test. Intraclass correlation coefficients and coefficient of variation indicated good to excellent reliability, and relatively small variability for both dribbling tests. The 95% limits of agreement indicated relatively little bias Ghent University dribbling test for both running alone and running with the ball components, and the Bangsbo and Mohr short dribble test. The results of the present study suggest both dribbling tests examined in the present study demonstrate good reliability and low levels of systematic bias.

Download Full-text

Pellucid marginal degeneration versus keratoconus: distinction with wide-field SD-OCT corneal sublayer pachymetry

British Journal of Ophthalmology ◽

10.1136/bjophthalmol-2020-316496 ◽

2020 ◽

pp. bjophthalmol-2020-316496

Author(s):

Niklas Mohr ◽

Mehdi Shajari ◽

Daniel Krause ◽

Stefan Kassumeh ◽

Jakob Siedlecki ◽

...

Keyword(s):

Intraclass Correlation ◽

Area Under The Curve ◽

Correlation Coefficients ◽

Cross Sectional Study ◽

Wide Field ◽

Good Reliability ◽

Cross Sectional ◽

Intraclass Correlation Coefficients ◽

Calculated Ratio ◽

Sd Oct

PurposeTo evaluate the capability of wide-field spectral-domain optical coherence tomography (SD-OCT) corneal sublayer pachymetry to distinguish between keratoconus and pellucid marginal degeneration (PMD).MethodsThis prospective cross-sectional study included 69 eyes (59 with keratoconus and 10 with PMD) of 69 patients. All patients were examined three subsequent times with the RTVue XR system. Pachymetry maps of the total cornea (CT), the epithelium (ET) and the stroma were generated using the onboard software. For analysis of reliability, the coefficients of variation and intraclass correlation coefficients (ICC) were calculated. Receiver operating characteristic analyses were performed to elaborate the most accurate pachymetric parameters for distinguishing between PMD and keratoconus.ResultsOverall repeatability of (sublayer) pachymetry was comparably good in both keratoconus (ICC ranging between 0.827 and 0.986) and PMD (ICC ranging between 0.753 and 0.998). Measurement reliability in keratoconic eyes was a negative function of Kmax (p<0.05). As compared with keratoconus, PMD exhibited higher CT (526±31 µm vs 503±30 µm; p=0.02) and ET (56±7 µm vs 51±5 µm; p=0.02) in the inferotemporal 2–5 mm sector as well as lower ET in the inferior 7–9 mm sector (52±5 µm vs 57±5 µm; p<0.01). The calculated ratio between CT in the inferotemporal 2–5 mm and in the inferior 7–9 mm sector yielded the highest diagnostic accuracy for distinguishing between PMD and keratoconus with an area under the curve of 0.977 and an optimal cut-off value of 0.90.ConclusionWide-field SD-OCT corneal sublayer pachymetry showed good reliability in PMD and keratoconus and may be useful to differentiate between the two ectatic diseases.

Download Full-text

Assessment of nanoindentation in stiffness measurement of soft biomaterials: kidney, liver, spleen and uterus

Scientific Reports ◽

10.1038/s41598-020-75738-7 ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Guanlin Wu ◽

Michael Gotthardt ◽

Maik Gollasch

Keyword(s):

Mechanical Properties ◽

Ex Vivo ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Biological Specimen ◽

Good Reliability ◽

Intraclass Correlation Coefficients ◽

Coefficients Of Variation ◽

Kidney Liver ◽

Soft Biomaterials

Abstract Nanoindentation technology with high spatial resolution and force sensitivity is widely used to measure the mechanical properties of hard biomaterials and tissues. However, its reliability to analyze soft biomaterials and organs has not been tested. Here, we evaluated the utility of nanoindentation to measure the passive mechanical properties of soft biological specimen. Kidney, liver, spleen and uterus samples were harvested from C57BL/6 N mice. We assessed test–retest repeatability in biological specimen and hydrogel controls using Bland–Altman diagrams, intraclass correlation coefficients (ICCs) and the within-subject coefficients of variation (COVs). The results were calculated using Hertzian, JKR and Oliver & Pharr models. Similar to hydrogels, Bland–Altman plots of all biological specimen showed good reliability in stiffness test and retest examinations. In gels, ICCs were larger than 0.8 and COVs were smaller than 15% in all three models. In kidney, liver, spleen and uterus, ICCs were consistently larger than 0.8 only in the Hertzian model but not in the JKR and Oliver & Pharr models. Similarly, COVs were consistently smaller than 15% in kidney, liver, spleen and uterus only in the Hertzian model but not in the other models. We conclude that nanoindentation technology is feasible in detecting the stiffness of kidney, liver, spleen and uterus. The Hertzian model is the preferred method to provide reliable results on ex vivo organ stiffness of the biological specimen under study.

Download Full-text

Three-Year Reliability of MEG Visual and Somatosensory Responses

Cerebral Cortex ◽

10.1093/cercor/bhaa372 ◽

2020 ◽

Author(s):

Marie C McCusker ◽

Brandon J Lew ◽

Tony W Wilson

Keyword(s):

Intraclass Correlation ◽

Correlation Coefficients ◽

Gamma Activity ◽

Evoked Responses ◽

Good Reliability ◽

Intraclass Correlation Coefficients ◽

Baseline Activity ◽

Somatosensory Responses ◽

High Degree

Abstract A major goal of many translational neuroimaging studies is the identification of biomarkers of disease. However, a prerequisite for any such biomarker is robust reliability, which for magnetoencephalography (MEG) and many other imaging modalities has not been established. In this study, we examined the reliability of visual (Experiment 1) and somatosensory gating (Experiment 2) responses in 19 healthy adults who repeated these experiments for three visits spaced 18 months apart. Visual oscillatory and somatosensory oscillatory and evoked responses were imaged, and intraclass correlation coefficients (ICC) were computed to examine the long-term reliability of these responses. In Experiment 1, ICCs showed good reliability for visual theta and alpha responses in occipital cortices, but poor reliability for gamma responses. In Experiment 2, the time series of somatosensory gamma and evoked responses in the contralateral somatosensory cortex showed good reliability. Finally, analyses of spontaneous baseline activity indicated excellent reliability for occipital alpha, moderate reliability for occipital theta, and poor reliability for visual/somatosensory gamma activity. Overall, MEG responses to visual and somatosensory stimuli show a high degree of reliability across 3 years and therefore may be stable indicators of sensory processing long term and thereby of potential interest as biomarkers of disease.

Download Full-text

Test–Retest Reliability of Common Measures of Eating Disorder Symptoms in Men Versus Women

Assessment ◽

10.1177/1073191117700267 ◽

2017 ◽

Vol 26 (3) ◽

pp. 419-431 ◽

Cited By ~ 10

Author(s):

Kelsie T. Forbush ◽

Lindsay A. Hilderbrand ◽

Brittany K. Bohrer ◽

Danielle A. N. Chapa

Keyword(s):

Binge Eating ◽

Intraclass Correlation ◽

Correlation Coefficients ◽

Self Report ◽

Good Reliability ◽

Retest Reliability ◽

Intraclass Correlation Coefficients ◽

Eating Disorder Symptoms ◽

Disinhibited Eating ◽

Test Retest Reliability

Approximately 10% to 30% of individuals with eating disorders (EDs) are male, yet because measures often have not been tested among male participants, it is unclear whether the psychometric properties of ED measures are equivalent between sexes. The purpose of this study was to compare the test–retest reliability of common ED measures in men versus women. Participants ( N = 227; 58.1% female) completed self-report measures of body dissatisfaction, restrained eating, disinhibited eating, bulimic symptoms, and desire-for-muscularity at baseline and 2-to-4 weeks later. Intraclass correlations were used to compute retest correlations. Spearman’s rho was used to compute retest correlations for skewed and kurtotic variables. We compared 95% confidence intervals for intraclass correlation coefficients to determine whether measures differed in reliability between sexes. Most ED measures had at least acceptable test–retest reliabilities. However, few measures of disinhibited and binge eating demonstrated good reliability in men. Results highlight the utility of several ED measures for assessing symptom change over time, and the need for additional research to identify and correct for sources of gender unreliability among ED self-report measures in men—particularly for assessing constructs that include binge-eating behavior.

Download Full-text