scholarly journals Modelling, characterization of data-dependent and process-dependent errors in DNA data storage

2021 ◽  
Author(s):  
Wang Yixin ◽  
MD Noor Rahim ◽  
Erry Gunawan ◽  
Yong Liang Guan ◽  
Chueh Loo Poh

Using DNA as the medium to store information has recently been recognized as a promising solution for long-term data storage. While several system prototypes have been demonstrated, the error characteristics in DNA data storage are discussed with limited content. Due to the data and process variations from experiment to experiment, the error variation and its effect on data recovery remain to be uncovered. To close the gap, we systematically investigate the storage channel, i.e., error characteristics in the storage process. We first propose a new concept named sequence corruption to unify the error characteristics into the sequence level, easing the channel analysis. Then we derived the formulations of the data imperfection at the decoder including both sequence loss and sequence corruption, revealing the decoding demand and monitoring the data recovery. Furthermore, we extensively explored several data-dependent unevenness observed in the base error patterns and studied a few potential factors and their impacts on the data imperfection at the decoder both theoretically and experimentally. The results presented here introduce a more comprehensive channel model and offer a new angle towards the data recovery issue in DNA data storage by further elucidating the error characteristics of the storage process.

GigaScience ◽  
2020 ◽  
Vol 9 (10) ◽  
Author(s):  
Daniel Arend ◽  
Patrick König ◽  
Astrid Junker ◽  
Uwe Scholz ◽  
Matthias Lange

Abstract Background The FAIR data principle as a commitment to support long-term research data management is widely accepted in the scientific community. Although the ELIXIR Core Data Resources and other established infrastructures provide comprehensive and long-term stable services and platforms for FAIR data management, a large quantity of research data is still hidden or at risk of getting lost. Currently, high-throughput plant genomics and phenomics technologies are producing research data in abundance, the storage of which is not covered by established core databases. This concerns the data volume, e.g., time series of images or high-resolution hyper-spectral data; the quality of data formatting and annotation, e.g., with regard to structure and annotation specifications of core databases; uncovered data domains; or organizational constraints prohibiting primary data storage outside institional boundaries. Results To share these potentially dark data in a FAIR way and master these challenges the ELIXIR Germany/de.NBI service Plant Genomic and Phenomics Research Data Repository (PGP) implements a “bring the infrastructure to the data” approach, which allows research data to be kept in place and wrapped in a FAIR-aware software infrastructure. This article presents new features of the e!DAL infrastructure software and the PGP repository as a best practice on how to easily set up FAIR-compliant and intuitive research data services. Furthermore, the integration of the ELIXIR Authentication and Authorization Infrastructure (AAI) and data discovery services are introduced as means to lower technical barriers and to increase the visibility of research data. Conclusion The e!DAL software matured to a powerful and FAIR-compliant infrastructure, while keeping the focus on flexible setup and integration into existing infrastructures and into the daily research process.


2016 ◽  
Vol 52 (7) ◽  
pp. 1-4
Author(s):  
Kazutoshi Katayama ◽  
Yuka Chinda ◽  
Osamu Shimizu ◽  
Tatsuo Mikami ◽  
Mayumi Suzuki ◽  
...  

2021 ◽  
Vol 58 (2) ◽  
pp. 1706-1717
Author(s):  
Krisada Sungkhamanee, Piyadhida Sungkhamanee

Investment decisions have great importance in different sectors of various countries and these decisions are the basis on which the outcomes of the investments are based. However, there might be certain factors that might lead to the incorrect long term and short term investment decisions. In this regard, the current study has been conducted with the core motive to explore the impact casted by the environment and potential factors i.e. salience and overconfidence on the long term investment decisions for accommodation business along with the moderation of a variable i.e. financial literacy. To fulfill this objective, the researcher has collected data from the investors of accommodation businesses in Thailand. The collected data has been subjected to different statistical techniques and tools for analysis purpose and the results have been obtained. The results obtained by the analysis of the collected data indicate that salience and overconfidence have significant impact on the long term investment decision. In addition, the moderating role of financial literacy has also been found as significant in the study. The results suggest that the investors of the accommodation business must consider the aspects of salience and overconfidence before taking any long term investment decision to avoid failure of the investment decision.    


2021 ◽  
Author(s):  
Min Li ◽  
Junbiao Dai ◽  
Qingshan Jiang ◽  
Yang Wang

Abstract Current research on DNA storage usually focuses on the improvement of storage density with reduced gene synthesis cost by developing effective encoding and decoding schemes while lacking the consideration on the uncertainty in ultra long-term data storage and retention. Consequently, the current DNA storage systems are often not self-containment, implying that they have to resort to external tools for the restoration of the stored gene data. This may result in high risks in data loss since the required tools might not be available due to the high uncertainty in far future. To address this issue, we propose in this paper a self-contained DNA storage system that can make self-explanatory to its stored data without relying on any external tools. To this end, we design a specific DNA file format whereby a separate storage scheme is developed to reduce the data redundancy while an effective indexing is designed for random read operations to the stored data file. We verified through experimental data that the proposed self-contained and self-explanatory method can not only get rid of the reliance on external tools for data restoration but also minimize the data redundancy brought about when the amount of data to be stored reaches a certain scale.


2021 ◽  
Vol 1 ◽  
pp. 80
Author(s):  
Thijs Devriendt ◽  
Clemens Ammann ◽  
Folkert W. Asselbergs ◽  
Alexander Bernier ◽  
Rodrigo Costas ◽  
...  

Various data sharing platforms are being developed to enhance the sharing of cohort data by addressing the fragmented state of data storage and access systems. However, policy challenges in several domains remain unresolved. The euCanSHare workshop was organized to identify and discuss these challenges and to set the future research agenda. Concerns over the multiplicity and long-term sustainability of platforms, lack of resources, access of commercial parties to medical data, credit and recognition mechanisms in academia and the organization of data access committees are outlined. Within these areas, solutions need to be devised to ensure an optimal functioning of platforms.


2021 ◽  
Author(s):  
Yifei Yang ◽  
Mingkun Xu ◽  
Lujie Xu ◽  
Xinxin Wang ◽  
Huan Liu ◽  
...  

Abstract The electrochemical (EC) resistive switching (RS) cross-point arrays, composed of nonvolatile RS (NV-RS) memories and volatile RS (V-RS) selectors, hold promise for high-density data storage, in-memory computing and neuromorphic computing. However, the conventional EC-RS devices based on metallic filaments suffer from the notorious current-volatility dilemma that the low and high current requirements for NV-RS memories and V-RS selectors, respectively, cannot be satisfied simultaneously, due to the dominant EC nature of the RS. In this work, we demonstrate electrochemically active, low thermal-conductivity and low melting-temperature semiconducting tellurium filament-based RS devices that solve this dilemma, enabling NV-RS memories to operate under lower currents than do V-RS selectors. This novel phenomenon arises as the consequence of the adversarial EC and Joule heating (JH) effects. The devices also show unusual stimulus frequency dependent long-term plasticity (LTP)-to-short-term plasticity (STP) transition. Devices with this property can be generically utilized as spatial-temporal filters in spiking neural networks (SNNs) for high-performance event-based visual recognition tasks, as illustrated in our noise filtering simulations. By regulating the EC-JH relationship using dielectric materials with decreasing thermal conductivities, full functional-range tunable Te filament-based devices, from always-NV RS, to NV-to-V transitionable RS, and to always-V RS, are also demonstrated. The tellurium filament-based RS devices are promising enablers for functional cross-point arrays.


Sign in / Sign up

Export Citation Format

Share Document