cryptic relatedness
Recently Published Documents


TOTAL DOCUMENTS

24
(FIVE YEARS 3)

H-INDEX

7
(FIVE YEARS 1)

2020 ◽  
Vol 10 (8) ◽  
pp. 2787-2799 ◽  
Author(s):  
Emily Humble ◽  
Anneke J. Paijmans ◽  
Jaume Forcada ◽  
Joseph I. Hoffman

High density single nucleotide polymorphism (SNP) arrays allow large numbers of individuals to be rapidly and cost-effectively genotyped at large numbers of genetic markers. However, despite being widely used in studies of humans and domesticated plants and animals, SNP arrays are lacking for most wild organisms. We developed a custom 85K Affymetrix Axiom array for an intensively studied pinniped, the Antarctic fur seal (Arctocephalus gazella). SNPs were discovered from a combination of genomic and transcriptomic resources and filtered according to strict criteria. Out of a total of 85,359 SNPs tiled on the array, 75,601 (88.6%) successfully converted and were polymorphic in 270 animals from a breeding colony at Bird Island in South Georgia. Evidence was found for inbreeding, with three genomic inbreeding coefficients being strongly intercorrelated and the proportion of the genome in runs of homozygosity being non-zero in all individuals. Furthermore, analysis of genomic relatedness coefficients identified previously unknown first-degree relatives and multiple second-degree relatives among a sample of ostensibly unrelated individuals. Such “cryptic relatedness” within fur seal breeding colonies may increase the likelihood of consanguineous matings and could therefore have implications for understanding fitness variation and mate choice. Finally, we demonstrate the cross-amplification potential of the array in three related pinniped species. Overall, our SNP array will facilitate future studies of Antarctic fur seals and has the potential to serve as a more general resource for the wider pinniped research community.


2020 ◽  
Author(s):  
Emily Humble ◽  
Anneke J. Paijmans ◽  
Jaume Forcada ◽  
Joseph I. Hoffman

ABSTRACTHigh density single nucleotide polymorphism (SNP) arrays allow large numbers of individuals to be rapidly and cost-effectively genotyped at large numbers of genetic markers. However, despite being widely used in studies of humans and domesticated plants and animals, SNP arrays are lacking for most wild organisms. We developed a custom 90K Affymetrix Axiom array for an intensively studied pinniped, the Antarctic fur seal (Arctocephalus gazella). SNPs were discovered from a combination of genomic and transcriptomic resources and filtered according to strict criteria. Out of a total of 85,359 SNPs tiled on the array, 75,601 (88.6%) successfully converted and were polymorphic in 274 animals from a breeding colony at Bird Island in South Georgia. Evidence was found for inbreeding, with three genomic inbreeding coefficients being strongly intercorrelated and the proportion of the genome in ROH being non-zero in all individuals. Furthermore, analysis of genomic relatedness coefficients identified multiple second and third order relatives among a sample of ostensibly unrelated individuals. Such “cryptic relatedness” within fur seal breeding colonies may increase the likelihood of consanguinous matings and could therefore have implications for understanding fitness variation and mate choice. Finally, we demonstrate the cross-amplification potential of the array in three related species. Overall, our SNP array will facilitate future studies of Antarctic fur seals and has the potential to serve as a more general resource for the wider pinniped research community.


2020 ◽  
Author(s):  
Nichol Schultz ◽  
Kent Weigel

AbstractLinear mixed models are effective tools to identify genetic loci contributing to phenotypic variation while handling confounding due to population structure and cryptic relatedness. Recent improvements of the linear mixed model for genome-wide association analysis have been directed at more accurately modeling loci of large effect. We describe FFselect (https://github.com/NicholSchultz/FFselect), a novel method that both builds upon recent advances and further extends the linear mixed model for genome-wide association analysis to allow modeling of shared environmental effects. FFselect improves power, controls false discovery rate, and simultaneously corrects for environmental confounding to improve the utility of GWAS.


2018 ◽  
Vol 35 (15) ◽  
pp. 2683-2685
Author(s):  
Ehsan Ullah ◽  
Michaël Aupetit ◽  
Arun Das ◽  
Abhishek Patil ◽  
Noora Al Muftah ◽  
...  

Abstract Motivation It is important to characterize individual relatedness in terms of familial relationships and underlying population structure in genome-wide association studies for correct downstream analysis. The characterization of individual relatedness becomes vital if the cohort is to be used as reference panel in other studies for association tests and for identifying ethnic diversities. In this paper, we propose a kinship visualization tool to detect cryptic relatedness between subjects. We utilize multi-dimensional scaling, bar charts, heat maps and node-link visualizations to enable analysis of relatedness information. Availability and implementation Available online as well as can be downloaded at http://shiny-vis.qcri.org/public/kinvis/. Supplementary information Supplementary data are available at Bioinformatics online.


2018 ◽  
Vol 18 (3) ◽  
pp. 435-447
Author(s):  
Pablo A. S. Fonseca ◽  
Thiago P. Leal ◽  
Fernanda C. Santos ◽  
Mateus H. Gouveia ◽  
Samir Id-Lahoucine ◽  
...  

2017 ◽  
Author(s):  
Jeffrey Staples ◽  
Evan K. Maxwell ◽  
Nehal Gosalia ◽  
Claudia Gonzaga-Jauregui ◽  
Christopher Snyder ◽  
...  

AbstractLarge-scale human genetics studies are ascertaining increasing proportions of populations as they continue growing in both number and scale. As a result, the amount of cryptic relatedness within these study cohorts is growing rapidly and has significant implications on downstream analyses. We demonstrate this growth empirically among the first 92,455 exomes from the DiscovEHR cohort and, via a custom simulation framework we developed called SimProgeny, show that these measures are in-line with expectations given the underlying population and ascertainment approach. For example, we identified ∼66,000 close (first- and second-degree) relationships within DiscovEHR involving 55.6% of study participants. Our simulation results project that >70% of the cohort will be involved in these close relationships as DiscovEHR scales to 250,000 recruited individuals. We reconstructed 12,574 pedigrees using these relationships (including 2,192 nuclear families) and leveraged them for multiple applications. The pedigrees substantially improved the phasing accuracy of 20,947 rare, deleterious compound heterozygous mutations. Reconstructed nuclear families were critical for identifying 3,415 de novo mutations in ∼1,783 genes. Finally, we demonstrate the segregation of known and suspected disease-causing mutations through reconstructed pedigrees, including a tandem duplication in LDLR causing familial hypercholesterolemia. In summary, this work highlights the prevalence of cryptic relatedness expected among large healthcare population genomic studies and demonstrates several analyses that are uniquely enabled by large amounts of cryptic relatedness.


2017 ◽  
Author(s):  
Dominic Holland ◽  
Chun-Chieh Fan ◽  
Oleksandr Frei ◽  
Alexey A. Shadrin ◽  
Olav B. Smeland ◽  
...  

AbstractCryptic relatedness is inherently a feature of large genome-wide association studies (GWAS), and can give rise to considerable inflation in summary statistics for single nucleotide polymorphism (SNP) associations with phenotypes. It has proven difficult to disentangle these inflationary effects from true polygenic effects. Here we present results of a model that enables estimation of polygenicity, mean strength of association, and residual inflation in GWAS summary statistics. We show that there is substantial residual inflation in recent large GWAS of height and schizophrenia; correcting for this reduces the number of independent genome-wide significant loci from the reported values of 697 for height and 108 for schizophrenia to 368 and 61, respectively. In contrast, a larger GWAS of educational attainment shows no residual inflation. Additionally, we find that height has a relatively low polygenicity, with approximately 8k SNPs having causal association, more than an order of magnitude less than has been reported. The residual inflation in GWAS summary statistics can be corrected using the standard genomic control procedure with the estimated residual inflation factor.


2014 ◽  
Author(s):  
Matthew P Conomos ◽  
Michael B Miller ◽  
Timothy A Thornton

Population structure inference with genetic data has been motivated by a variety of applications in population genetics and genetic association studies. Several approaches have been proposed for the identification of genetic ancestry differences in samples where study participants are assumed to be unrelated, including principal components analysis (PCA), multi-dimensional scaling (MDS), and model-based methods for proportional ancestry estimation. Many genetic studies, however, include individuals with some degree of relatedness, and existing methods for inferring genetic ancestry fail in related samples. We present a method, PC-AiR, for robust population structure inference in the presence of known or cryptic relatedness. PC-AiR utilizes genome-screen data and an efficient algorithm to identify a diverse subset of unrelated individuals that is representative of all ancestries in the sample. The PC-AiR method directly performs PCA on the identified ancestry representative subset and then predicts components of variation for all remaining individuals based on genetic similarities. In simulation studies and in applications to real data from Phase III of the HapMap Project, we demonstrate that PC-AiR provides a substantial improvement over existing approaches for population structure inference in related samples. We also demonstrate significant efficiency gains, where a single axis of variation from PC-AiR provides better prediction of ancestry in a variety of structure settings than using ten (or more) components of variation from widely used PCA and MDS approaches. Finally, we illustrate that PC-AiR can provide improved population stratification correction over existing methods in genetic association studies with population structure and relatedness.


Sign in / Sign up

Export Citation Format

Share Document