population genetic data
Recently Published Documents


TOTAL DOCUMENTS

161
(FIVE YEARS 25)

H-INDEX

25
(FIVE YEARS 2)

2021 ◽  
pp. 1-14
Author(s):  
Anand Kumar ◽  
Rajesh Kumar ◽  
R. K. Kumawat ◽  
Pankaj Shrivastava ◽  
Rajesh Yadav ◽  
...  

Author(s):  
Winfield Chen ◽  
Lloyd T. Elliott

We improve the efficiency of population genetic file formats and GWAS computation by leveraging the distribution of samples in population-level genetic data. We identify conditional exchangeability of these data, recommending finite state entropy algorithms as an arithmetic code naturally suited for compression of population genetic data. We show between [Formula: see text] and [Formula: see text] speed and size improvements over modern dictionary compression methods that are often used for population genetic data such as Zstd and Zlib in computation and decompression tasks. We provide open source prototype software for multi-phenotype GWAS with finite state entropy compression demonstrating significant space saving and speed comparable to the state-of-the-art.


2021 ◽  
pp. 1-6
Author(s):  
Safia A. Messaoudi ◽  
Saranya R. Babu ◽  
Abrar B. Alsaleh ◽  
Mohammed Albujja ◽  
Noora R. Al-Snan ◽  
...  

Author(s):  
Dairis Morillo ◽  
Francheska Acosta ◽  
Santa Jiménez ◽  
Víctor Calderón ◽  
Patricia León ◽  
...  

eLife ◽  
2021 ◽  
Vol 10 ◽  
Author(s):  
Joseph Marcus ◽  
Wooseok Ha ◽  
Rina Foygel Barber ◽  
John Novembre

Spatial population genetic data often exhibits ‘isolation-by-distance,’ where genetic similarity tends to decrease as individuals become more geographically distant. The rate at which genetic similarity decays with distance is often spatially heterogeneous due to variable population processes like genetic drift, gene flow, and natural selection. Petkova et al., 2016 developed a statistical method called Estimating Effective Migration Surfaces (EEMS) for visualizing spatially heterogeneous isolation-by-distance on a geographic map. While EEMS is a powerful tool for depicting spatial population structure, it can suffer from slow runtimes. Here, we develop a related method called Fast Estimation of Effective Migration Surfaces (FEEMS). FEEMS uses a Gaussian Markov Random Field model in a penalized likelihood framework that allows for efficient optimization and output of effective migration surfaces. Further, the efficient optimization facilitates the inference of migration parameters per edge in the graph, rather than per node (as in EEMS). With simulations, we show conditions under which FEEMS can accurately recover effective migration surfaces with complex gene-flow histories, including those with anisotropy. We apply FEEMS to population genetic data from North American gray wolves and show it performs favorably in comparison to EEMS, with solutions obtained orders of magnitude faster. Overall, FEEMS expands the ability of users to quickly visualize and interpret spatial structure in their data.


Author(s):  
Andrei Semikhodskii ◽  
Yevgeniy Krassotkin ◽  
Tatiana Makarova ◽  
Vladislav Zavarin ◽  
Viktoria Ilina ◽  
...  

2021 ◽  
Author(s):  
Winfield Chen ◽  
Lloyd T. Elliott

AbstractWe improve the efficiency of population genetic file formats and GWAS computation by leveraging the distribution of sample ordering in population-level genetic data. We identify conditional exchangeability of these data, recommending finite state entropy algorithms as an arithmetic code naturally suited to population genetic data. We show between 10% and 40% speed and size improvements over dictionary compression methods for population genetic data such as Zstd and Zlib in computation and and decompression tasks. We provide a prototype for genome-wide association study with finite state entropy compression demonstrating significant space saving and speed comparable to the state-of-the-art.


2021 ◽  
Vol 48 ◽  
pp. 101825
Author(s):  
Huyen Linh Tran ◽  
Thi Anh May Ta ◽  
Ngoc Nam Nguyen ◽  
Thanh Tung Pham ◽  
Ha Hoang ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document