pooling design
Recently Published Documents


TOTAL DOCUMENTS

24
(FIVE YEARS 5)

H-INDEX

4
(FIVE YEARS 0)

2021 ◽  
Author(s):  
Camille Clouard ◽  
Kristiina Ausmees ◽  
Carl Nettelblad

Abstract Background: Despite continuing technological advances, the cost for large-scale genotyping of a high number of samples can be prohibitive. The purpose of this study is to design a cost-saving strategy for SNP genotyping. We suggest making use of pooling, a group testing technique, to drop the amount of SNP arrays needed. We believe that this will be of the greatest importance for non-model organisms with more limited resources in terms of cost-efficient large-scale chips and high-quality reference genomes, such as application in wildlife monitoring, plant and animal breeding, but it is in essence species-agnostic. The proposed approach consists in grouping and mixing individual DNA samples into pools before testing these pools on bead-chips, such that the number of pools is less than the number of individual samples. We present a statistical estimation algorithm, based on the pooling outcomes, for inferring marker-wise the most likely genotype of every sample in each pool. Finally, we input these estimated genotypes into existing imputation algorithms. We compare the imputation performance from pooled data with the Beagle algorithm, and a local likelihood-aware phasing algorithm closely modeled on MaCH that we implemented. Results: We conduct simulations based on human data from the 1000 Genomes Project, to aid comparison with other imputation studies. Based on the simulated data, we find that pooling impacts the genotype frequencies of the directly identifiable markers, without imputation. We also demonstrate how a combinatorial estimation of the genotype probabilities from the pooling design can improve the prediction performance of imputation models. Our algorithm achieves 93% concordance in predicting unassayed markers from pooled data, thus it outperforms the Beagle imputation model which reaches 80% concordance. We observe that the pooling design gives higher concordance for the rare variants than traditional low-density to high-density imputation commonly used for cost-effective genotyping of large cohorts. Conclusions: We present promising results for combining a pooling scheme for SNP genotyping with computational genotype imputation, as demonstrated in simulations on human data, while using half the number of assays needed for sample-wise genotyping. These results could find potential applications in any context where the genotyping costs form a limiting factor on the study size, such as in marker-assisted selection in plant breeding.


Author(s):  
Napoleón Vargas Jurado ◽  
Larry A Kuehn ◽  
John W Keele ◽  
Ronald M Lewis

Abstract Despite decreasing genotyping costs, in some cases individually genotyping animals is not economically feasible (e.g., in small ruminants). An alternative is to pool DNA, using the pooled allele frequency (PAF) to garner information on performance. Still, the use of PAF for prediction (estimation of genomic breeding values; GEBV) has been limited. Two potential sources of error on accuracy of GEBV of sires, obtained from PAF of their progeny themselves lacking pedigree information, were tested: i) pool construction error (unequal contribution of DNA from animals in pools), and ii) technical error (variability when reading the array). Pooling design (random, extremes, K-means), pool size (5, 10, 25, 50 and 100 individuals), and selection scenario (random, phenotypic) also were considered. These factors were tested by simulating a sheep population. Accuracy of GEBV—the correlation between true and estimated values—was not substantially affected by pool construction or technical error, or selection scenario. A significant interaction, however, between pool size and design was found. Still, regardless of design, mean accuracy was higher for pools of 10 or less individuals. Mean accuracy of GEBV was 0.174 (SE 0.001) for random pooling, and 0.704 (SE 0.004) and 0.696 (SE 0.004) for extreme and K-means pooling, respectively. Non-random pooling resulted in moderate accuracy of GEBV. Overall, pooled genotypes can be used in conjunction with individual genotypes of sires for moderately accurate predictions of their genetic merit with little effect of pool construction or technical error.


2020 ◽  
Author(s):  
Dimitrije Čvokić
Keyword(s):  

Увод/циљ: Сврха алгоритама групног тестирања јесте да обезбједе рационализацију ресурса. Стога, очекује се да се њиховим коришћењем могу, такође, остварити одређене уштеде при масовном тестирању RT(q)PCR методом ради идентификације заражених вирусом САРСКоВ2.Методе: Предочена су два приступа неадаптивног групног тестирања заснованог на удруживању узорака, у англојезичној литератури познатом као pooling design: Хвангов алгоритам уопштеног цијепања и матрична стратегија.Резултати: Уз дискусију добрих и лоших страна наведене су и оцјене максималног броја тестова. Матрична стратегија представљена је својеврсном модификацијом која смањује поменуту оцјену, а, с друге стране, не утиче много на комплексност процедуре, што је значајно за њену примјену у пракси.Закључак: Узимајући у обзир тренутну ситуацију, има смисла разматрати овакву врсту рационализације ресурса, ради повећања ефикасности епидемиолошкихмјера.


2020 ◽  
Vol 68 (4) ◽  
pp. 743-759
Author(s):  
Dimitrije Čvokić

Introduction/purpose: The purpose of group testing algorithms is to provide a more rational resource usage. Therefore, it is expected to improve the efficiency of large-scale COVID-19 screening as well. Methods: Two variants of non-adaptive group testing approaches are presented: Hwang's generalized binary-splitting algorithm and the matrix strategy. Results: The positive and negative sides of both approaches are discussed. Also, the estimations of the maximum number of tests are given. The matrix strategy is presented with a particular modification which reduces the corresponding estimation of the maximum number of tests and which does not affect the complexity of the procedure. This modification can be interesting from the applicability viewpoint. Conclusion: Taking into account the current situation, it makes sense to consider these methods in order to achieve some resource cuts in testing, thus making the epidemiological measures more efficient than they are now.


2015 ◽  
Author(s):  
Abhinav Ganesan ◽  
Sidharth Jaggi ◽  
Venkatesh Saligrama

This paper deals with an abstraction of a unified problem of drug discovery and pathogen identification. Here, the ``lead compounds'' are abstracted as inhibitors, pathogenic proteins as defectives, and the mixture of ``ineffective'' chemical compounds and non-pathogenic proteins as normal items. A defective could be immune to the presence of an inhibitor in a test. So, a test containing a defective is positive iff it does not contain its ``associated'' inhibitor. The goal of this paper is to identify the defectives, inhibitors, and their ``associations'' with high probability, or in other words, learn the Immune Defectives Graph (IDG). We propose a probabilistic non-adaptive pooling design, a probabilistic two-stage adaptive pooling design and decoding algorithms for learning the IDG. For the two-stage adaptive-pooling design, we show that the sample complexity of the number of tests required to guarantee recovery of the inhibitors, defectives and their associations with high probability, i.e., the upper bound, exceeds the proposed lower bound by a logarithmic multiplicative factor in the number of items. For the non-adaptive pooling design, in the large inhibitor regime, we show that the upper bound exceeds the proposed lower bound by a logarithmic multiplicative factor in the number of inhibitors.


2012 ◽  
Vol 6 (1) ◽  
pp. 220-238 ◽  
Author(s):  
Takafumi Kanamori ◽  
Hiroaki Uehara ◽  
Masakazu Jimbo

2011 ◽  
Vol 85 (1) ◽  
pp. 121-127
Author(s):  
FENGLIANG JIN ◽  
HOUCHUN ZHOU ◽  
JUAN XU

AbstractPooling designs are a very helpful tool for reducing the number of tests for DNA library screening. A disjunct matrix is usually used to represent the pooling design. In this paper, we construct a new family of disjunct matrices and prove that it has a good row to column ratio and error-tolerant property.


Sign in / Sign up

Export Citation Format

Share Document