entropy compression
Recently Published Documents


TOTAL DOCUMENTS

26
(FIVE YEARS 7)

H-INDEX

6
(FIVE YEARS 1)

Author(s):  
Winfield Chen ◽  
Lloyd T. Elliott

We improve the efficiency of population genetic file formats and GWAS computation by leveraging the distribution of samples in population-level genetic data. We identify conditional exchangeability of these data, recommending finite state entropy algorithms as an arithmetic code naturally suited for compression of population genetic data. We show between [Formula: see text] and [Formula: see text] speed and size improvements over modern dictionary compression methods that are often used for population genetic data such as Zstd and Zlib in computation and decompression tasks. We provide open source prototype software for multi-phenotype GWAS with finite state entropy compression demonstrating significant space saving and speed comparable to the state-of-the-art.


2021 ◽  
Vol 125 ◽  
pp. 102163
Author(s):  
Rogério G. Alves ◽  
Aldo Procacci ◽  
Remy Sanchis

2021 ◽  
Author(s):  
Winfield Chen ◽  
Lloyd T. Elliott

AbstractWe improve the efficiency of population genetic file formats and GWAS computation by leveraging the distribution of sample ordering in population-level genetic data. We identify conditional exchangeability of these data, recommending finite state entropy algorithms as an arithmetic code naturally suited to population genetic data. We show between 10% and 40% speed and size improvements over dictionary compression methods for population genetic data such as Zstd and Zlib in computation and and decompression tasks. We provide a prototype for genome-wide association study with finite state entropy compression demonstrating significant space saving and speed comparable to the state-of-the-art.


10.37236/9667 ◽  
2020 ◽  
Vol 27 (3) ◽  
Author(s):  
Matthieu Rosenfeld

We propose a new proof technique that applies to the same problems as the  Lovász Local Lemma or the entropy-compression method. We present this approach in the context of non-repetitive colorings and we use it to improve upper-bounds relating different non-repetitive chromatic numbers to the maximal degree of a graph. It seems that there should be other interesting applications of the presented approach. In terms of upper-bounds our approach seems to be as strong as entropy-compression, but the proofs are more elementary and shorter. The applications we provide in this paper are upper bounds for graphs of maximal degree at most $\Delta$: a minor improvement on the upper-bound of the non-repetitive chromatic number, a $4.25\Delta +o(\Delta)$ upper-bound on the weak total non-repetitive chromatic number, and a $ \Delta^2+\frac{3}{2^{1/3}}\Delta^{5/3}+ o(\Delta^{5/3})$ upper-bound on the total non-repetitive chromatic number of graphs. This last result implies the same upper-bound for the non-repetitive chromatic index of graphs, which improves the best known bound. 


2020 ◽  
Vol 343 (4) ◽  
pp. 111772
Author(s):  
Daniel Gonçalves ◽  
Mickael Montassier ◽  
Alexandre Pinlou

2020 ◽  
Vol 245 ◽  
pp. 01001
Author(s):  
Michael Lettrich

In LHC Run 3, the upgraded ALICE detector will record 50 kHz Pb-Pb collisions using continuous readout. The resulting stream of raw data to be inspected increases to ~ 1 TB/s a hundredfold increase over Run 2 must be processed with a set of lossy and lossless compression and data reduction techniques to decrease the data rate to storage to 90 GB/s without affecting the physics. This contribution focuses on lossless entropy coding for ALICE Run 3 data which is the final component in the compression stage. We analyze data from the ALICE TPC and point out the challenges imposed by the non-standard data with a patchy distribution and symbol sizes of up to 25 Bit. We then explain why rANS, a variant of Asymmetric Numeral System coders is suitable for compressing this data effectively. Finally we present first compression performance numbers and bandwidth measurements obtained from a prototype implementation and give an outlook for future developments.


2018 ◽  
Vol 35 (3) ◽  
pp. 906-920 ◽  
Author(s):  
Laihao Ding ◽  
Guanghui Wang ◽  
Jianliang Wu

2017 ◽  
Vol 226 (15) ◽  
pp. 3251-3272 ◽  
Author(s):  
Nithin Nagaraj ◽  
Karthi Balasubramanian
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document