ascertainment bias
Recently Published Documents


TOTAL DOCUMENTS

141
(FIVE YEARS 27)

H-INDEX

27
(FIVE YEARS 5)

2021 ◽  
Vol 51 ◽  
pp. e81-e82
Author(s):  
Younga Heather Lee ◽  
Tanayott Thaweethai ◽  
Yi-han Sheu ◽  
Yen-Chen Anne Feng ◽  
Tian Ge ◽  
...  

2021 ◽  
Author(s):  
Sharon E. Johnatty ◽  
Tina Pesaran ◽  
Jill Dolinsky ◽  
Amal Yussuf ◽  
Holly La Duca ◽  
...  

BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Johannes Geibel ◽  
Christian Reimer ◽  
Torsten Pook ◽  
Steffen Weigend ◽  
Annett Weigend ◽  
...  

Abstract Background Population genetic studies based on genotyped single nucleotide polymorphisms (SNPs) are influenced by a non-random selection of the SNPs included in the used genotyping arrays. The resulting bias in the estimation of allele frequency spectra and population genetics parameters like heterozygosity and genetic distances relative to whole genome sequencing (WGS) data is known as SNP ascertainment bias. Full correction for this bias requires detailed knowledge of the array design process, which is often not available in practice. This study suggests an alternative approach to mitigate ascertainment bias of a large set of genotyped individuals by using information of a small set of sequenced individuals via imputation without the need for prior knowledge on the array design. Results The strategy was first tested by simulating additional ascertainment bias with a set of 1566 chickens from 74 populations that were genotyped for the positions of the Affymetrix Axiom™ 580 k Genome-Wide Chicken Array. Imputation accuracy was shown to be consistently higher for populations used for SNP discovery during the simulated array design process. Reference sets of at least one individual per population in the study set led to a strong correction of ascertainment bias for estimates of expected and observed heterozygosity, Wright’s Fixation Index and Nei’s Standard Genetic Distance. In contrast, unbalanced reference sets (overrepresentation of populations compared to the study set) introduced a new bias towards the reference populations. Finally, the array genotypes were imputed to WGS by utilization of reference sets of 74 individuals (one per population) to 98 individuals (additional commercial chickens) and compared with a mixture of individually and pooled sequenced populations. The imputation reduced the slope between heterozygosity estimates of array data and WGS data from 1.94 to 1.26 when using the smaller balanced reference panel and to 1.44 when using the larger but unbalanced reference panel. This generally supported the results from simulation but was less favorable, advocating for a larger reference panel when imputing to WGS. Conclusions The results highlight the potential of using imputation for mitigation of SNP ascertainment bias but also underline the need for unbiased reference sets.


Author(s):  
Jonathan O’B. Hourihane ◽  
Aideen M. Byrne ◽  
Katharina Blümchen ◽  
Paul J. Turner ◽  
Matthew Greenhawt

Author(s):  
Kotaro Dokan ◽  
Sayu Kawamura ◽  
Kosuke M Teshima

Abstract Single nucleotide polymorphism (SNP) data are widely used in research on natural populations. Although they are useful, SNP genotyping data are known to contain bias, normally referred to as ascertainment bias, because they are conditioned by already confirmed variants. This bias is introduced during the genotyping process, including the selection of populations for novel SNP discovery and the number of individuals involved in the discovery panel and selection of SNP markers. It is widely recognized that ascertainment bias can cause inaccurate inferences in population genetics and several methods to address these bias issues have been proposed. However, especially in natural populations, it is not always possible to apply an ideal ascertainment scheme because natural populations tend to have complex structures and histories. In addition, it was not fully assessed if ascertainment bias has the same effect on different types of population structure. Here we examine the effects of bias produced during the selection of population for SNP discovery and consequent SNP marker selection processes under three demographic models: the island, stepping-stone, and population split models. Results show that site frequency spectra and summary statistics contain biases that depend on the joint effect of population structure and ascertainment schemes. Additionally, population structure inferences are also affected by ascertainment bias. Based on these results, it is recommended to evaluate the validity of the ascertainment strategy prior to the actual typing process because the direction and extent of ascertainment bias vary depending on several factors.


2021 ◽  
Vol 26 (2) ◽  
pp. 180-186
Author(s):  
Mazen S Albaghdadi ◽  
Michael N Young ◽  
Mohammed M. Chowdhury ◽  
Susan Assmann ◽  
Taye Hamza ◽  
...  

Ascertainment bias is a well-recognized source of bias in research, but few studies have systematically analyzed sources of ascertainment bias in randomized trials in which blinding is not possible and endpoint assessment is not protocolized. In the current study, we sought to evaluate differences in the clinical practice patterns of trial investigators with respect to bias in the ascertainment of pre-revascularization patient risk and the incidence of secondary endpoints post-revascularization. We conducted a cross-sectional survey of active investigators ( n = 936) from the Best Endovascular Versus Best Surgical Therapy for Patients with Critical Limb Ischemia (BEST-CLI) trial. The total survey response rate was 19.6% (183/936). Vascular surgeons were more likely than nonsurgical interventionalists to order tests for cardiac complications after both surgical bypass ( p < 0.001) and endovascular revascularization ( p = 0.038). Post-procedure, investigators were more likely to order additional testing for cardiac complications in open surgery versus endovascular cases (7% vs 16% never, 41% vs 65% rarely, 43% vs 17% sometimes, 9% vs 2% always, respectively; p < 0.0001). Significant variation in practice patterns exist in the pre- and post-procedure assessment of cardiac risk and events for patients with CLI undergoing revascularization. Variation in the ascertainment of risk and outcomes according to the type of revascularization procedure and physician specialty should be considered when interpreting the results of clinical studies, such as the BEST-CLI trial. ClinicalTrials.gov Identifier: NCT02060630


PLoS ONE ◽  
2021 ◽  
Vol 16 (3) ◽  
pp. e0245178
Author(s):  
Johannes Geibel ◽  
Christian Reimer ◽  
Steffen Weigend ◽  
Annett Weigend ◽  
Torsten Pook ◽  
...  

Single nucleotide polymorphisms (SNPs), genotyped with arrays, have become a widely used marker type in population genetic analyses over the last 10 years. However, compared to whole genome re-sequencing data, arrays are known to lack a substantial proportion of globally rare variants and tend to be biased towards variants present in populations involved in the development process of the respective array. This affects population genetic estimators and is known as SNP ascertainment bias. We investigated factors contributing to ascertainment bias in array development by redesigning the Axiom™ Genome-Wide Chicken Array in silico and evaluating changes in allele frequency spectra and heterozygosity estimates in a stepwise manner. A sequential reduction of rare alleles during the development process was shown. This was mainly caused by the identification of SNPs in a limited set of populations and a within-population selection of common SNPs when aiming for equidistant spacing. These effects were shown to be less severe with a larger discovery panel. Additionally, a generally massive overestimation of expected heterozygosity for the ascertained SNP sets was shown. This overestimation was 24% higher for populations involved in the discovery process than not involved populations in case of the original array. The same was observed after the SNP discovery step in the redesign. However, an unequal contribution of populations during the SNP selection can mask this effect but also adds uncertainty. Finally, we make suggestions for the design of specialized arrays for large scale projects where whole genome re-sequencing techniques are still too expensive.


2021 ◽  
pp. 174077452098007
Author(s):  
Denise A Esserman ◽  
Thomas M Gill ◽  
Michael E Miller ◽  
Erich J Greene ◽  
James D Dziura ◽  
...  

Background/Aim In clinical trials, there is potential for bias from unblinded observers that may influence ascertainment of outcomes. This issue arose in the Strategies to Reduce Injuries and Develop Confidence in Elders trial, a cluster randomized trial to test a multicomponent intervention versus enhanced usual care (control) to prevent serious fall injuries, originally defined as a fall injury leading to medical attention. An unblinded nurse falls care manager administered the intervention, while the usual care arm did not involve contact with a falls care manager. Thus, there was an opportunity for falls care managers to refer participants reporting falls to seek medical attention. Since this type of observer bias could not occur in the usual care arm, there was potential for additional falls to be reported in the intervention arm, leading to dilution of the intervention effect and a reduction in study power. We describe the clinical basis for ascertainment bias, the statistical approach used to assess it, and its effect on study power. Methods The prespecified interim monitoring plan included a decision algorithm for assessing ascertainment bias and adapting (revising) the primary outcome definition, if necessary. The original definition categorized serious fall injuries requiring medical attention into Type 1 (fracture other than thoracic/lumbar vertebral, joint dislocation, cut requiring closure) and Type 2 (head injury, sprain or strain, bruising or swelling, other). The revised definition, proposed by the monitoring plan, excluded Type 2 injuries that did not necessarily require an overnight hospitalization since these would be most subject to bias. These injuries were categorized into those with (Type 2b) and without (Type 2c) medical attention. The remaining Type 2a injuries required medical attention and an overnight hospitalization. We used the ratio of 2b/(2b + 2c) in intervention versus control as a measure of ascertainment bias; ratios > 1 indicated the likelihood of falls care manager bias. We determined the effect of ascertainment bias on study power for the revised (Types 1 and 2a) versus original definition (Types 1, 2a, and 2b). Results The estimate of ascertainment bias was 1.14 (95% confidence interval: 0.98, 1.30), providing evidence of the likelihood of falls care manager bias. We estimated that this bias diluted the hazard ratio from the hypothesized 0.80 to 0.86 and reduced power to under 80% for the original primary outcome definition. In contrast, adapting the revised definition maintained study power at nearly 90%. Conclusion There was evidence of ascertainment bias in the Strategies to Reduce Injuries and Develop Confidence in Elders trial. The decision to adapt the primary outcome definition reduced the likelihood of this bias while preserving the intervention effect and study power.


Author(s):  
Jeremy D. Wortzel ◽  
Douglas J. Wiebe ◽  
Shabnam Elahi ◽  
Atu Agawu ◽  
Frances K. Barg ◽  
...  

This paper describes follow-up for a cohort of 4530 residents living in the asbestos manufacturing community of Ambler, PA, U.S. in 1930. Using re-identified census data, cause and date of death data obtained from the genealogic website Ancestry.com, along with geospatial analysis, we explored relationships among demographic characteristics, occupational, paraoccupational and environmental asbestos exposures. We identified death data for 2430/4530 individuals. Exposure differed significantly according to race, gender, age, and recency of immigration to the U.S. Notably, there was a significant difference in the availability of year of death information for non-white vs. white individuals (odds ratio (OR) = 0.62 p-value < 0.001), females (OR = 0.53, p-value < 0.001), first-generation immigrants (OR = 0.67, p-value = 0.001), second-generation immigrants (OR = 0.31, p-value < 0.001) vs. non-immigrants, individuals aged less than 20 (OR = 0.31 p-value < 0.001) and individuals aged 20 to 59 (OR = 0.63, p-value < 0.001) vs. older individuals. Similarly, the cause of death was less often available for non-white individuals (OR = 0.42, p-value <0.001), first-generation immigrants and (OR = 0.71, p-value = 0.009), second-generation immigrants (OR = 0.49, p-value < 0.001), individuals aged less than 20 (OR = 0.028 p-value < 0.001), and individuals aged 20 to 59 (OR = 0.26, p-value < 0.001). These results identified ascertainment bias that is important to consider in analyses that investigate occupational, para-occupational and environmental asbestos exposure as risk factors for mortality in this historic cohort. While this study attempts to describe methods for assessing itemized asbestos exposure profiles for a community in 1930 using Ancestry.com and other publicly accessible databases, it also highlights how historic cohort studies likely underestimate the impact of asbestos exposure on vulnerable populations. Future work will aim to assess mortality patterns in this cohort.


Sign in / Sign up

Export Citation Format

Share Document