counting approach
Recently Published Documents


TOTAL DOCUMENTS

124
(FIVE YEARS 35)

H-INDEX

14
(FIVE YEARS 2)

2021 ◽  
Author(s):  
Harriet Dashnow ◽  
Brent S. Pedersen ◽  
Laurel Hiatt ◽  
Joe Brown ◽  
Sarah J. Beecroft ◽  
...  

Expansions of short tandem repeats (STRs) cause dozens of rare Mendelian diseases. However, STR expansions, especially those arising from repeats not present in the reference genome, are challenging to detect from short-read sequencing data. Such "novel" STRs include new repeat units occurring at known STR loci, or entirely new STR loci where the sequence is absent from the reference genome. A primary cause of difficulty detecting STR expansions is that reads arising from STR expansions are frequently mismapped or unmapped. To address this challenge, we have developed STRling, a new STR detection algorithm that counts k-mers (short DNA sequences of length k) in DNA sequencing reads, to efficiently recover reads that inform the presence and size of STR expansions. As a result, STRling can call expansions at both known and novel STR loci. STRling has a sensitivity of 83% for 14 known STR disease loci, including the novel STRs that cause CANVAS and DBQD2. It is the first method to resolve the position of novel STR expansions to base pair accuracy. Such accuracy is essential to interpreting the consequence of each expansion. STRling has an estimated 0.078 false discovery rate for known pathogenic loci in unaffected individuals and a 0.20 false discovery rate for genome-wide loci in unaffected individuals when using variants called from long-read data as truth. STRling is fast, scalable on cloud computing, open-source, and freely available at https://github.com/quinlan-lab/STRling.


2021 ◽  
Author(s):  
Hongfei Wang ◽  
Yanzhou Zhang ◽  
Zhanyuan Ye ◽  
Hengfa Liu ◽  
Xin Wei ◽  
...  

2021 ◽  
Author(s):  
Johannes Stein ◽  
Florian Stehr ◽  
Ralf Jungmann ◽  
Petra Schwille

Single-Molecule Localization Microscopy (SMLM) has revolutionized light microscopy by enabling optical resolutions down to a few nanometer. Yet, localization precisions commonly not suffice to visually resolve single subunits in molecular assemblies or multimeric complexes. Since each targeted molecule contributes localizations during image acquisition, molecular counting approaches to reveal the target copy numbers within localization clusters have been continuously proposed since the early days of SMLM, most of which rely on preliminary knowledge of the dye photo-physics or on a calibration to a reference. Previously, we developed localization-based Fluorescence Correlation Spectroscopy (lbFCS) as an absolute ensemble counting approach for the SMLM-variant DNA-Points Accumulation for Imaging in Nanoscale Topography (PAINT), for the first time circumventing the necessity for reference calibrations. Here, we present a revised framework termed lbFCS+ which allows absolute counting of copy numbers for individual localization clusters in a single DNA-PAINT image. In lbFCS+, absolute counting in individual clusters is achieved via precise measurement of the local hybridization rates of the fluorescently-labeled oligonucleotides (imagers) employed in DNA-PAINT imaging. In proof-of-principle experiments on DNA origami nanostructures, we demonstrate the ability of lbFCS+ to truthfully determine molecular copy numbers and imager association and dissociation rates in well-separated localization clusters containing up to six docking strands. We show that lbFCS+ allows to resolve heterogeneous binding dynamics enabling the distinction of stochastically generated and a priori indistinguishable DNA assemblies. Beyond advancing quantitative DNA-PAINT imaging, we believe that lbFCS+ could find promising applications ranging from bio-sensing to DNA computing.


Author(s):  
Hui Lin ◽  
Xiaopeng Hong ◽  
Zhiheng Ma ◽  
Xing Wei ◽  
Yunfeng Qiu ◽  
...  

Traditional crowd counting approaches usually use Gaussian assumption to generate pseudo density ground truth, which suffers from problems like inaccurate estimation of the Gaussian kernel sizes. In this paper, we propose a new measure-based counting approach to regress the predicted density maps to the scattered point-annotated ground truth directly. First, crowd counting is formulated as a measure matching problem. Second, we derive a semi-balanced form of Sinkhorn divergence, based on which a Sinkhorn counting loss is designed for measure matching. Third, we propose a self-supervised mechanism by devising a Sinkhorn scale consistency loss to resist scale changes. Finally, an efficient optimization method is provided to minimize the overall loss function. Extensive experiments on four challenging crowd counting datasets namely ShanghaiTech, UCF-QNRF, JHU++ and NWPU have validated the proposed method.


Energies ◽  
2021 ◽  
Vol 14 (14) ◽  
pp. 4074
Author(s):  
Kiarash Movassagh ◽  
Arif Raihan ◽  
Balakumar Balasingam ◽  
Krishna Pattipati

In this paper, we consider the problem of state-of-charge estimation for rechargeable batteries. Coulomb counting is a well-known method for estimating the state of charge, and it is regarded as accurate as long as the battery capacity and the beginning state of charge are known. The Coulomb counting approach, on the other hand, is prone to inaccuracies from a variety of sources, and the magnitude of these errors has not been explored in the literature. We formally construct and quantify the state-of-charge estimate error during Coulomb counting due to four types of error sources: (1) current measurement error; (2) current integration approximation error; (3) battery capacity uncertainty; and (4) timing oscillator error/drift. It is demonstrated that the state-of-charge error produced can be either time-cumulative or state-of-charge-proportional. Time-cumulative errors accumulate over time and have the potential to render the state-of-charge estimation utterly invalid in the long term.The proportional errors of the state of charge rise with the accumulated state of charge and reach their worst value within one charge/discharge cycle. The study presents methods for reducing time-cumulative and state-of-charge-proportional mistakes through simulation analysis.


2021 ◽  
pp. 2150019
Author(s):  
Takashi Komatsu ◽  
Norio Konno ◽  
Hisashi Morioka ◽  
Etsuo Segawa

We consider the time-independent scattering theory for time evolution operators of one-dimensional two-state quantum walks. The scattering matrix associated with the position-dependent quantum walk naturally appears in the asymptotic behavior at the spatial infinity of generalized eigenfunctions. The asymptotic behavior of generalized eigenfunctions is a consequence of an explicit expression of the Green function associated with the free quantum walk. When the position-dependent quantum walk is a finite rank perturbation of the free quantum walk, we derive a kind of combinatorial construction of the scattering matrix by counting paths of quantum walkers. We also mention some remarks on the tunneling effect.


2021 ◽  
Vol 118 (8) ◽  
pp. e2016191118
Author(s):  
Timo Dimitriadis ◽  
Tilmann Gneiting ◽  
Alexander I. Jordan

A probability forecast or probabilistic classifier is reliable or calibrated if the predicted probabilities are matched by ex post observed frequencies, as examined visually in reliability diagrams. The classical binning and counting approach to plotting reliability diagrams has been hampered by a lack of stability under unavoidable, ad hoc implementation decisions. Here, we introduce the CORP approach, which generates provably statistically consistent, optimally binned, and reproducible reliability diagrams in an automated way. CORP is based on nonparametric isotonic regression and implemented via the pool-adjacent-violators (PAV) algorithm—essentially, the CORP reliability diagram shows the graph of the PAV-(re)calibrated forecast probabilities. The CORP approach allows for uncertainty quantification via either resampling techniques or asymptotic theory, furnishes a numerical measure of miscalibration, and provides a CORP-based Brier-score decomposition that generalizes to any proper scoring rule. We anticipate that judicious uses of the PAV algorithm yield improved tools for diagnostics and inference for a very wide range of statistical and machine learning methods.


2021 ◽  
Author(s):  
Laura H. Tung ◽  
Carl Kingsford

AbstractDespite numerous RNA-seq samples available at large databases, most RNA-seq analysis tools are evaluated on a limited number of RNA-seq samples. This drives a need for methods to select a representative subset from all available RNA-seq samples to facilitate comprehensive, unbiased evaluation of bioinformatics tools. In sequence-based approaches for representative set selection (e.g. a k-mer counting approach that selects a subset based on k-mer similarities between RNA-seq samples), because of the huge number of available RNA-seq samples and the large number of k-mers/sequences in each sample, computing the full similarity matrix between all samples using k-mers/sequences for the entire set of RNA-seq samples in a large database (e.g. the SRA) has memory and runtime challenges, making direct representative set selection infeasible with limited computing resources. Therefore, we developed a novel computational method called “hierarchical representative set selection” to handle this challenge. Hierarchical representative set selection is a divide-and-conquer-like algorithm that breaks the representative set selection into sub-selections and hierarchically selects representative samples through multiple levels. We demonstrate that hierarchical representative set selection can achieve performance close to that of direct representative set selection, while largely reducing the runtime and memory requirements of computing the full similarity matrix (up to 8.4X runtime reduction and 4.7X memory reduction for 10000 samples that could be practically run with direct subset selection). We show that hierarchical representative set selection substantially outperforms random sampling on the entire SRA set of RNA-seq samples, making it a practical solution to representative set selection on large databases such as the SRA.


The Analyst ◽  
2021 ◽  
Author(s):  
Xiaojun Liu ◽  
Zhangjian Wu ◽  
Xinyi Lin ◽  
Wei Bu ◽  
Lei Qin ◽  
...  

Monitoring ctDNA in blood is important to cancer management. Here, we develop a one-step single particle counting approach for directly quantifying ctDNA in plasma. Hairpin DNA containing a triple helix...


Sign in / Sign up

Export Citation Format

Share Document