Quantitative single molecule SERS sensing enabled by machine learning (Conference Presentation)

Author(s):  
Regina Ragan ◽  
William Thrift
2018 ◽  
Vol 35 (15) ◽  
pp. 2654-2656 ◽  
Author(s):  
Guoli Ji ◽  
Wenbin Ye ◽  
Yaru Su ◽  
Moliang Chen ◽  
Guangzao Huang ◽  
...  

Abstract Summary Alternative splicing (AS) is a well-established mechanism for increasing transcriptome and proteome diversity, however, detecting AS events and distinguishing among AS types in organisms without available reference genomes remains challenging. We developed a de novo approach called AStrap for AS analysis without using a reference genome. AStrap identifies AS events by extensive pair-wise alignments of transcript sequences and predicts AS types by a machine-learning model integrating more than 500 assembled features. We evaluated AStrap using collected AS events from reference genomes of rice and human as well as single-molecule real-time sequencing data from Amborella trichopoda. Results show that AStrap can identify much more AS events with comparable or higher accuracy than the competing method. AStrap also possesses a unique feature of predicting AS types, which achieves an overall accuracy of ∼0.87 for different species. Extensive evaluation of AStrap using different parameters, sample sizes and machine-learning models on different species also demonstrates the robustness and flexibility of AStrap. AStrap could be a valuable addition to the community for the study of AS in non-model organisms with limited genetic resources. Availability and implementation AStrap is available for download at https://github.com/BMILAB/AStrap. Supplementary information Supplementary data are available at Bioinformatics online.


2020 ◽  
Vol 124 (43) ◽  
pp. 24029-24031
Author(s):  
Nathan D. Bamberger ◽  
Jeffrey A. Ivie ◽  
Keshaba N. Parida ◽  
Dominic V. McGrath ◽  
Oliver L. A. Monti

2019 ◽  
pp. 1900415 ◽  
Author(s):  
Kundan Sivashanmugan ◽  
Kenneth Squire ◽  
Joseph A. Kraai ◽  
Ailing Tan ◽  
Yong Zhao ◽  
...  

2016 ◽  
Vol 6 (1) ◽  
Author(s):  
Daniela M. Borgmann ◽  
Sandra Mayr ◽  
Helene Polin ◽  
Susanne Schaller ◽  
Viktoria Dorfer ◽  
...  

2021 ◽  
Vol 118 (11) ◽  
pp. e2022806118
Author(s):  
Ke Xia ◽  
James T. Hagan ◽  
Li Fu ◽  
Brian S. Sheetz ◽  
Somdatta Bhattacharya ◽  
...  

The application of solid-state (SS) nanopore devices to single-molecule nucleic acid sequencing has been challenging. Thus, the early successes in applying SS nanopore devices to the more difficult class of biopolymer, glycosaminoglycans (GAGs), have been surprising, motivating us to examine the potential use of an SS nanopore to analyze synthetic heparan sulfate GAG chains of controlled composition and sequence prepared through a promising, recently developed chemoenzymatic route. A minimal representation of the nanopore data, using only signal magnitude and duration, revealed, by eye and image recognition algorithms, clear differences between the signals generated by four synthetic GAGs. By subsequent machine learning, it was possible to determine disaccharide and even monosaccharide composition of these four synthetic GAGs using as few as 500 events, corresponding to a zeptomole of sample. These data suggest that ultrasensitive GAG analysis may be possible using SS nanopore detection and well-characterized molecular training sets.


2021 ◽  
Author(s):  
Mathew Schneider ◽  
Alaa Al-Shaer ◽  
Nancy R. Forde

AbstractSingle-molecule imaging is widely used to determine statistical distributions of molecular properties. One such characteristic is the bending flexibility of biological filaments, which can be parameterized via the persistence length. Quantitative extraction of persistence length from images of individual filaments requires both the ability to trace the backbone of the chains in the images and sufficient chain statistics to accurately assess the persistence length. Chain tracing can be a tedious task, performed manually or using algorithms that require user input and/or supervision. Such interventions have the potential to introduce user-dependent bias into the chain selection and tracing. Here, we introduce a fully automated algorithm for chain tracing and determination of persistence lengths. Dubbed “AutoSmarTrace”, the algorithm is built off a neural network, trained via machine learning to identify filaments within images recorded using atomic force microscopy (AFM). We validate the performance of AutoSmarTrace on simulated images with widely varying levels of noise, demonstrating its ability to return persistence lengths in agreement with the ground truth. Persistence lengths returned from analysis of experimental images of collagen and DNA agree with previous values obtained from these images with different chain-tracing approaches. While trained on AFM-like images, the algorithm also shows promise to identify chains in other single-molecule imaging approaches, such as rotary shadowing electron microscopy and fluorescence imaging.Statement of SignificanceMachine learning presents powerful capabilities to the analysis of large data sets. Here, we apply this approach to the determination of bending flexibility – described through persistence length – from single-molecule images of biological filaments. We present AutoSmarTrace, a tool for automated tracing and analysis of chain flexibility. Built on a neural network trained via machine learning, we show that AutoSmarTrace can determine persistence lengths from AFM images of a variety of biological macromolecules including collagen and DNA. While trained on AFM-like images, the algorithm works well to identify filaments in other types of images. This technique can free researchers from tedious tracing of chains in images, removing user bias and standardizing determination of chain mechanical parameters from single-molecule conformational images.


2021 ◽  
Author(s):  
Yerdos A. Ordabayev ◽  
Larry J. Friedman ◽  
Jeff Gelles ◽  
Douglas L. Theobald

AbstractMulti-wavelength single-molecule fluorescence colocalization (CoSMoS) methods allow elucidation of complex biochemical reaction mechanisms. However, analysis of CoSMoS data is intrinsically challenging because of low image signal-to-noise ratios, non-specific surface binding of the fluorescent molecules, and analysis methods that require subjective inputs to achieve accurate results. Here, we use Bayesian probabilistic programming to implement Tapqir, an unsupervised machine learning method based on a holistic, physics-based causal model of CoSMoS data. This method accounts for uncertainties in image analysis due to photon and camera noise, optical non-uniformities, non-specific binding, and spot detection. Rather than merely producing a binary “spot/no spot” classification of unspecified reliability, Tapqir objectively assigns spot classification probabilities that allow accurate downstream analysis of molecular dynamics, thermodynamics, and kinetics. We both quantitatively validate Tapqir performance against simulated CoSMoS image data with known properties and also demonstrate that it implements fully objective, automated analysis of experiment-derived data sets with a wide range of signal, noise, and non-specific binding characteristics.


Sign in / Sign up

Export Citation Format

Share Document