A Hadoop-based Massive Molecular Data Storage Solution for Virtual Screening

Author(s):  
Yan Zhang ◽  
Ruisheng Zhang ◽  
Qiuqiang Chen ◽  
Xiaopan Gao ◽  
Rongjing Hu ◽  
...  
2020 ◽  
Vol 3 (1) ◽  
Author(s):  
Maximiliane Frölich ◽  
Dennis Hofheinz ◽  
Michael A. R. Meier

AbstractIn recent years, the field of molecular data storage has emerged from a niche to a vibrant research topic. Herein, we describe a simultaneous and automated read-out of data stored in mixtures of sequence-defined oligomers. Therefore, twelve different sequence-defined tetramers and three hexamers with different mass markers and side chains are successfully synthesised via iterative Passerini three-component reactions and subsequent deprotection steps. By programming a straightforward python script for ESI-MS/MS analysis, it is possible to automatically sequence and thus read-out the information stored in these oligomers within one second. Most importantly, we demonstrate that the use of mass-markers as starting compounds eases MS/MS data interpretation and furthermore allows the unambiguous reading of sequences of mixtures of sequence-defined oligomers. Thus, high data storage capacity considering the field of synthetic macromolecules (up to 64.5 bit in our examples) can be obtained without the need of synthesizing long sequences, but by mixing and simultaneously analysing shorter sequence-defined oligomers.


Sign in / Sign up

Export Citation Format

Share Document