scholarly journals A Universal Sequencing System for Unknown Oligomers

Author(s):  
David Doran ◽  
Emma Clarke ◽  
Graham Keenan ◽  
Emma Carrick ◽  
Cole Mathis ◽  
...  

<p><b>No synthetic chemical system can produce complex oligomers with fidelities comparable to biological systems. To bridge this gap, chemists must be able to </b><b>characterise</b><b> synthetic oligomers. Currently there are no tools for identifying synthetic oligomers with sequence resolution. Herein, we present a system that allows us to do omics-level sequencing for synthetic oligomers and use this to explore unconstrained complex mixtures. The system, Oligomer-Soup-Sequencing (OLIGOSS), can sequence individual oligomers in heterogeneous and polydisperse mixtures from tandem mass spectrometry (MS/MS) data. Unlike existing software, OLIGOSS can sequence oligomers with different backbone chemistries. Using an input file format, OLIG, that formalizes the set of abstract properties, any MS/MS fragmentation pathway can be defined. This has been demonstrated on four model systems of linear oligomers. OLIGOSS can screen large sequence spaces, enabling reliable sequencing of synthetic oligomeric mixtures, with false discovery rates (FDRs) of 0-1.1%, providing sequence resolution comparable to bioinformatic tools.</b></p>

2020 ◽  
Author(s):  
David Doran ◽  
Emma Clarke ◽  
Graham Keenan ◽  
Emma Carrick ◽  
Cole Mathis ◽  
...  

<p><b>No synthetic chemical system can produce complex oligomers with fidelities comparable to biological systems. To bridge this gap, chemists must be able to </b><b>characterise</b><b> synthetic oligomers. Currently there are no tools for identifying synthetic oligomers with sequence resolution. Herein, we present a system that allows us to do omics-level sequencing for synthetic oligomers and use this to explore unconstrained complex mixtures. The system, Oligomer-Soup-Sequencing (OLIGOSS), can sequence individual oligomers in heterogeneous and polydisperse mixtures from tandem mass spectrometry (MS/MS) data. Unlike existing software, OLIGOSS can sequence oligomers with different backbone chemistries. Using an input file format, OLIG, that formalizes the set of abstract properties, any MS/MS fragmentation pathway can be defined. This has been demonstrated on four model systems of linear oligomers. OLIGOSS can screen large sequence spaces, enabling reliable sequencing of synthetic oligomeric mixtures, with false discovery rates (FDRs) of 0-1.1%, providing sequence resolution comparable to bioinformatic tools.</b></p>


Biometrika ◽  
2011 ◽  
Vol 98 (2) ◽  
pp. 251-271 ◽  
Author(s):  
Bradley Efron ◽  
Nancy R. Zhang

Scientifica ◽  
2012 ◽  
Vol 2012 ◽  
pp. 1-9 ◽  
Author(s):  
Emily Hansen ◽  
Kathleen F. Kerr

The goal of many microarray studies is to identify genes that are differentially expressed between two classes or populations. Many data analysts choose to estimate the false discovery rate (FDR) associated with the list of genes declared differentially expressed. Estimating an FDR largely reduces to estimatingπ1, the proportion of differentially expressed genes among all analyzed genes. Estimatingπ1is usually done throughP-values, but computingP-values can be viewed as a nuisance and potentially problematic step. We evaluated methods for estimatingπ1directly from test statistics, circumventing the need to computeP-values. We adapted existing methodology for estimatingπ1fromt- andz-statistics so thatπ1could be estimated from other statistics. We compared the quality of these estimates to estimates generated by two established methods for estimatingπ1fromP-values. Overall, methods varied widely in bias and variability. The least biased and least variable estimates ofπ1, the proportion of differentially expressed genes, were produced by applying the “convest” mixture model method toP-values computed from a pooled permutation null distribution. Estimates computed directly from test statistics rather thanP-values did not reliably perform well.


Author(s):  
Balthasar Bickel

Large-scale areal patterns point to ancient population history and form a well-known confound for language universals. Despite their importance, demonstrating such patterns remains a challenge. This chapter argues that large-scale area hypotheses are better tested by modeling diachronic family biases than by controlling for genealogical relations in regression models. A case study of the Trans-Pacific area reveals that diachronic bias estimates do not depend much on the amount of phylogenetic information that is used when inferring them. After controlling for false discovery rates, about 39 variables in WALS and AUTOTYP show diachronic biases that differ significantly inside vs. outside the Trans-Pacific area. Nearly three times as many biases hold outside than inside the Trans-Pacific area, indicating that the Trans-Pacific area is not so much characterized by the spread of biases but rather by the retention of earlier diversity, in line with earlier suggestions in the literature.


PROTEOMICS ◽  
2009 ◽  
Vol 9 (5) ◽  
pp. 1220-1229 ◽  
Author(s):  
Andrew R. Jones ◽  
Jennifer A. Siepen ◽  
Simon J. Hubbard ◽  
Norman W. Paton

Sign in / Sign up

Export Citation Format

Share Document