combinatorial encoding Latest Research Papers

Storage needs represent a significant burden on the economy and the environment. Some of this can potentially be offset by improved density molecular storage. The potential of using DNA for storing data is immense. DNA can be harnessed as a high density, durable archiving medium for compressing and storing the exponentially growing quantities of digital data that mankind generates. Several studies have demonstrated the potential of DNA-based data storage systems. These include exploration of different encoding and error correction schemes and the use of different technologies for DNA synthesis and sequencing. Recently, the use of composite DNA letters has been demonstrated to leverage the inherent redundancy in DNA based storage systems to achieve higher logical density, offering a more cost-effective approach. However, the suggested composite DNA approach is still limited due to its sensitivity to the stochastic nature of the process. Combinatorial assembly methods were also suggested to encode information on DNA in high density, while avoding the challenges of the stochastic system. These are based on enzynatic assembly processes for producing the synthetic DNA. In this paper, we propose a novel method to encode information into DNA molecules using combinatorial encoding and shortmer DNA synthesis, in compatibility with current chemical DNA synthesis technologies. Our approach is based on a set of easily distinguishable DNA shortmers serving as building blocks and allowing for near-zero error rates. We construct an extended combinatorial alphabet in which every letter is a subset of the set of building blocks. We suggest different combinatorial encoding schemes and explore their theoretical properties and practical implications in terms of error probabilities and required sequencing depth. To demonstrate the feasibility of our approach, we implemented an end-to-end computer simulation of a DNA-based storage system, using our suggested combinatorial encodings. We use simulations to assess the performance of the system and the effect of different parameters. Our simulations suggest that our combinatorial approach can potentially achieve up to 6.5-fold increase in the logical density over standard DNA based storage systems, with near zero reconstruction error. Implementing our approach at scale to perform actual synthesis, requires minimal alterations to current technologies. Our work thus suggests that the combination of combinatorial encoding with standard DNA chemical synthesis technologies can potentially improve current solutions, achieving scalable, efficient and cost- effective DNA-based storage.

Download Full-text

Adaptation of Combinatorial Encoding Scheme to Index Compression

2020 28th Signal Processing and Communications Applications Conference (SIU) ◽

10.1109/siu49456.2020.9302061 ◽

2020 ◽

Author(s):

Can Ozbey

Keyword(s):

Encoding Scheme ◽

Index Compression ◽

Combinatorial Encoding

Download Full-text

The Problem of Combinatorial Encoding of a Continuous Dynamics and the Notion of Transfer of Paths in Graphs

Journal of Mathematical Sciences ◽

10.1007/s10958-020-04827-9 ◽

2020 ◽

Vol 247 (5) ◽

pp. 646-656

Author(s):

A. M. Vershik

Keyword(s):

Combinatorial Encoding

Download Full-text

Combinatorial Encoding of Bernoulli Schemes and the Asymptotic Behavior of Young Tableaux

Functional Analysis and Its Applications ◽

10.1134/s001626632002001x ◽

2020 ◽

Vol 54 (2) ◽

pp. 77-92

Author(s):

A. M. Vershik

Keyword(s):

Asymptotic Behavior ◽

Young Tableaux ◽

Combinatorial Encoding

Download Full-text

Shuffle-Seq: En masse combinatorial encoding for n-way genetic interaction screens

10.1101/861443 ◽

2019 ◽

Author(s):

Atray Dixit ◽

Olena Kuksenko ◽

David Feldman ◽

Aviv Regev

Keyword(s):

Large Scale ◽

Genetic Interaction ◽

Synthetic Lethality ◽

Genetic Interactions ◽

Pairwise Interactions ◽

Affect Cell Viability ◽

Melanoma Model ◽

Dividing Cells ◽

Combinatorial Encoding ◽

Massive Number

AbstractGenetic interactions, defined as the non-additive phenotypic impact of combinations of genes, are a hallmark of the mapping from genotype to phenotype. However, genetic interactions remain challenging to systematically test given the massive number of possible combinations. In particular, while large-scale screening efforts in yeast have quantified pairwise interactions that affect cell viability, or synthetic lethality, between all pairs of genes as well as for a limited number of three-way interactions, it has previously been intractable to perform the large screens needed to comprehensively assess interactions in a mammalian genome. Here, we develop Shuffle-Seq, a scalable method to assay genetic interactions. Shuffle-Seq leverages the co-inheritance of genetically encoded barcodes in dividing cells and can scale in proportion to sequencing throughput. We demonstrate the technical validity of Shuffle-Seq and apply it to screening for mechanisms underlying drug resistance in a melanoma model. Shuffle-Seq should allow screens of hundreds of millions of combinatorial perturbations and facilitate the understanding of genetic dependencies and drug sensitivities.

Download Full-text

GenEpi: Gene-based Epistasis Discovery Using Machine Learning

10.1101/421719 ◽

2018 ◽

Author(s):

Yu-Chuan Chang ◽

June-Tai Wu ◽

Ming-Yi Hong ◽

Yi-An Tung ◽

Ping-Han Hsieh ◽

...

Keyword(s):

Machine Learning ◽

Genetic Variants ◽

Prediction Models ◽

Association Studies ◽

Simulated Data ◽

Ground Truth ◽

Real Data ◽

Genome Wide Association Studies ◽

Combinatorial Encoding ◽

Python Package

AbstractGenome-wide association studies (GWAS) provide a powerful means to identify associations between genetic variants and phenotypes. However, GWAS techniques for detecting epistasis, the interactions between genetic variants associated with phenotypes, are still limited. We believe that developing an efficient and effective GWAS method to detect epistasis will be a key for discovering sophisticated pathogenesis, which is especially important for complex diseases such as Alzheimer’s disease (AD). In this regard, this study presents GenEpi, a computational package to uncover epistasis associated with phenotypes by the proposed machine learning approach. GenEpi identifies both within-gene and cross-gene epistasis through a two-stage modeling workflow. In both stages, GenEpi adopts two-element combinatorial encoding when producing features and constructs the prediction models by L1-regularized regression with stability selection. The simulated data showed that GenEpi outperforms other widely-used methods on detecting ground-truth epistasis. As real data is concerned, this study uses AD as an example to reveal the capability of GenEpi in finding disease-related variants and variant interactions that show both biological meanings and predictive power. Availability: GenEpi is an open-source python package and available free of charge only for non-commercial users. The package can be downloaded from https://github.com/Chester75321/GenEpi, and has also been published on The Python Package Index.

Download Full-text

A Molecular Odorant Transduction Model and the Complexity of Combinatorial Encoding in the Drosophila Antenna

10.1101/237669 ◽

2017 ◽

Author(s):

Aurel A. Lazar ◽

Chung-Heng Yeh

Keyword(s):

Theoretical Foundation ◽

Odorant Receptor ◽

Fruit Fly ◽

Dissociation Rate ◽

Odorant Receptors ◽

Molecular Models ◽

The Past ◽

Binding Rate ◽

Combinatorial Encoding ◽

Key Aspects

AbstractIn the past two decades, a substantial amount of work characterized the odorant receptors, neuroanatomy and odorant response properties of the early olfactory system of Drosophila melanogaster. Yet many odorant receptors remain only partially characterized and, the odorant transduction process and the axon hillock spiking mechanism of the olfactory sensory neurons (OSNs) have yet to be fully determined.Identity and concentration, two key aspects of olfactory coding, originate in the odorant transduction process. Detailed molecular models of the odorant transduction process are, however, scarce for fruit flies. To address these challenges we advance a comprehensive model of fruit fly OSNs as a cascade consisting of an odorant transduction process (OTP) and a biophysical spike generator (BSG). We model identity and concentration in OTP using an odorant-receptor binding rate tensor, modulated by the odorant concentration profile, and an odorant-receptor dissociation rate tensor, and quantitatively describe the ligand binding/dissociation process. We model the BSG as a Connor-Stevens point neuron.The resulting combinatorial encoding model of the Drosophila antenna provides a theoretical foundation for understanding the neural code of both odorant identity and odorant concentration and advances the state-of-the-art in a number of ways. First, it quantifies on the molecular level the combinatorial complexity of the transformation taking place in the antennae. The concentration-dependent combinatorial code determines the complexity of the input space driving olfactory processing in the downstream neuropils, such as odorant recognition and olfactory associative learning. Second, the model is biologically validated using multiple electrophysiology recordings. Third, the model demonstrates that the currently available data for odorant-receptor responses only enable the estimation of the affinity of the odorant-receptor pairs. The odorant-dissociation rate is only available for a few odorant-receptor pairs. Finally, our model calls for new experiments for massively identifying the odorant-receptor dissociation rates of relevance to flies.

Download Full-text

Olfactory receptors are sensitive to molecular volume of odorants

10.1101/013516 ◽

2015 ◽

Cited By ~ 2

Author(s):

Majid Saberi ◽

Hamed Seyed-allaei

Keyword(s):

Olfactory Receptor ◽

Experimental Studies ◽

Chemical Properties ◽

Olfactory Receptors ◽

Binding Pocket ◽

Olfactory Receptor Neuron ◽

Neural Response ◽

Molecular Volume ◽

Combinatorial Encoding ◽

Two Parameters

To study olfaction,  first we should know which physical or chemical properties of odorant molecules determine   the response of olfactory receptor neurons,   and then we should study the effect of those properties on the combinatorial encoding in olfactory system.    In this work we show that the response of an olfactory receptor neuron in Drosophila depends on molecular volume of an odorant;   The molecular volume determines the upper limits of the neural response,   while the actual neural response may depend on other properties of the molecules.  Each olfactory receptor prefers a particular volume,   with some degree of flexibility.  These two parameters predict the volume and flexibility of the binding-pocket of the olfactory receptors,   which are the targets of structural biology studies.     At the end we argue that the molecular volume can affects the quality of perceived smell of an odorant via the combinatorial encoding,  molecular volume may mask other underlying relations between properties of molecules and neural responses   and we suggest a way to improve the selection of odorants in further experimental studies.

Download Full-text

Extended Multiplexed Fluorescent In Situ Hybridization by Combinatorial Encoding of Individual Transcripts

In Situ Hybridization Methods - Neuromethods ◽

10.1007/978-1-4939-2303-8_27 ◽

2015 ◽

pp. 509-537

Author(s):

Lars Martin Jakt ◽

Satoko Moriwaki

Keyword(s):

In Situ Hybridization ◽

Fluorescent In Situ Hybridization ◽

Combinatorial Encoding

Download Full-text

Discovery of CD8 T cell epitopes in Merkel cell polyomavirus through combinatorial encoding with MHC multimers

Journal for ImmunoTherapy of Cancer ◽

10.1186/2051-1426-1-s1-p13 ◽

2013 ◽

Vol 1 (Suppl 1) ◽

pp. P13

Author(s):

Rikke Lyngaa ◽

Natasja W Pedersen ◽

David Schrama ◽

Charlotte A Thrue ◽

Özcan Met ◽

...

Keyword(s):

T Cell ◽

Cd8 T Cell ◽

Merkel Cell Polyomavirus ◽

T Cell Epitopes ◽

Merkel Cell ◽

Combinatorial Encoding ◽

Mhc Multimers

Download Full-text

combinatorial encoding
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Data Storage Based on Combinatorial Synthesis of DNA Shortmers

Adaptation of Combinatorial Encoding Scheme to Index Compression

The Problem of Combinatorial Encoding of a Continuous Dynamics and the Notion of Transfer of Paths in Graphs

Combinatorial Encoding of Bernoulli Schemes and the Asymptotic Behavior of Young Tableaux

Shuffle-Seq: En masse combinatorial encoding for n-way genetic interaction screens

GenEpi: Gene-based Epistasis Discovery Using Machine Learning

A Molecular Odorant Transduction Model and the Complexity of Combinatorial Encoding in the Drosophila Antenna

Olfactory receptors are sensitive to molecular volume of odorants

Extended Multiplexed Fluorescent In Situ Hybridization by Combinatorial Encoding of Individual Transcripts

Discovery of CD8 T cell epitopes in Merkel cell polyomavirus through combinatorial encoding with MHC multimers

Export Citation Format

combinatorial encodingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Data Storage Based on Combinatorial Synthesis of DNA Shortmers

Adaptation of Combinatorial Encoding Scheme to Index Compression

The Problem of Combinatorial Encoding of a Continuous Dynamics and the Notion of Transfer of Paths in Graphs

Combinatorial Encoding of Bernoulli Schemes and the Asymptotic Behavior of Young Tableaux

Shuffle-Seq: En masse combinatorial encoding for n-way genetic interaction screens

GenEpi: Gene-based Epistasis Discovery Using Machine Learning

A Molecular Odorant Transduction Model and the Complexity of Combinatorial Encoding in the Drosophila Antenna

Olfactory receptors are sensitive to molecular volume of odorants

Extended Multiplexed Fluorescent In Situ Hybridization by Combinatorial Encoding of Individual Transcripts

Discovery of CD8 T cell epitopes in Merkel cell polyomavirus through combinatorial encoding with MHC multimers

combinatorial encoding
Recently Published Documents