sequence classification
Recently Published Documents


TOTAL DOCUMENTS

341
(FIVE YEARS 70)

H-INDEX

25
(FIVE YEARS 3)

Mathematics ◽  
2021 ◽  
Vol 9 (24) ◽  
pp. 3297
Author(s):  
Tat’y Mwata-Velu ◽  
Juan Gabriel Avina-Cervantes ◽  
Jorge Mario Cruz-Duarte ◽  
Horacio Rostro-Gonzalez ◽  
Jose Ruiz-Pinales

Motor Imagery Electroencephalogram (MI-EEG) signals are widely used in Brain-Computer Interfaces (BCI). MI-EEG signals of large limbs movements have been explored in recent researches because they deliver relevant classification rates for BCI systems. However, smaller and noisy signals corresponding to hand-finger imagined movements are less frequently used because they are difficult to classify. This study proposes a method for decoding finger imagined movements of the right hand. For this purpose, MI-EEG signals from C3, Cz, P3, and Pz sensors were carefully selected to be processed in the proposed framework. Therefore, a method based on Empirical Mode Decomposition (EMD) is used to tackle the problem of noisy signals. At the same time, the sequence classification is performed by a stacked Bidirectional Long Short-Term Memory (BiLSTM) network. The proposed method was evaluated using k-fold cross-validation on a public dataset, obtaining an accuracy of 82.26%.


Entropy ◽  
2021 ◽  
Vol 23 (10) ◽  
pp. 1357
Author(s):  
Katrin Sophie Bohnsack ◽  
Marika Kaden ◽  
Julia Abel ◽  
Sascha Saralajew ◽  
Thomas Villmann

In the present article we propose the application of variants of the mutual information function as characteristic fingerprints of biomolecular sequences for classification analysis. In particular, we consider the resolved mutual information functions based on Shannon-, Rényi-, and Tsallis-entropy. In combination with interpretable machine learning classifier models based on generalized learning vector quantization, a powerful methodology for sequence classification is achieved which allows substantial knowledge extraction in addition to the high classification ability due to the model-inherent robustness. Any potential (slightly) inferior performance of the used classifier is compensated by the additional knowledge provided by interpretable models. This knowledge may assist the user in the analysis and understanding of the used data and considered task. After theoretical justification of the concepts, we demonstrate the approach for various example data sets covering different areas in biomolecular sequence analysis.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Fatema Tuz Zohora ◽  
M. Ziaur Rahman ◽  
Ngoc Hieu Tran ◽  
Lei Xin ◽  
Baozhen Shan ◽  
...  

AbstractA promising technique of discovering disease biomarkers is to measure the relative protein abundance in multiple biofluid samples through liquid chromatography with tandem mass spectrometry (LC-MS/MS) based quantitative proteomics. The key step involves peptide feature detection in the LC-MS map, along with its charge and intensity. Existing heuristic algorithms suffer from inaccurate parameters and human errors. As a solution, we propose PointIso, the first point cloud based arbitrary-precision deep learning network to address this problem. It consists of attention based scanning step for segmenting the multi-isotopic pattern of 3D peptide features along with the charge, and a sequence classification step for grouping those isotopes into potential peptide features. PointIso achieves 98% detection of high-quality MS/MS identified peptide features in a benchmark dataset. Next, the model is adapted for handling the additional ‘ion mobility’ dimension and achieves 4% higher detection than existing algorithms on the human proteome dataset. Besides contributing to the proteomics study, our novel segmentation technique should serve the general object detection domain as well.


Author(s):  
David Sundell ◽  
Caroline Öhrman ◽  
Daniel Svensson ◽  
Edvin Karlsson ◽  
Björn Brindefalk ◽  
...  

Abstract Summary The Flexible Taxonomy Database (FlexTaxD) framework provides a method for modification and merging official and custom taxonomic databases to create improved databases. Using such databases will increase accuracy and precision of existing methods to classify sequence reads. Availability and implementation Source code is freely available at https://github.com/FOI-Bioinformatics/flextaxd and installable through Bioconda.


Author(s):  
Hemalatha Gunasekaran ◽  
K. Ramalakshmi ◽  
Shalini Ramanathan ◽  
R. Venkatesan

2021 ◽  
Author(s):  
Yuka Yoshimura ◽  
Akifumi Hamada ◽  
Yohann Augey ◽  
Manato Akiyama ◽  
Yasubumi Sakakibara

Motivation: Biological sequence classification is the most fundamental task in bioinformatics analysis. For example, in metagenome analysis, binning is a typical type of DNA sequence classification. In order to classify sequences, it is necessary to define sequence features. The k-mer frequency, base composition, and alignment-based metrics are commonly used. In contrast, in the field of image recognition using machine learning, image classification is broadly divided into those based on shape and those based on style. A style matrix was introduced as a method of expressing the style of an image (e.g., color usage and texture). Results: We propose a novel sequence feature, called genomic style, inspired by image classification approaches, for classifying and clustering DNA sequences. As with the style of images, the DNA sequence is considered to have a genomic style unique to the bacterial species, and the style matrix concept is applied to the DNA sequence. Our main aim is to introduce the genomics style as yet another basic sequence feature for metagenome binning problem in replace of the most commonly used sequence feature k-mer frequency. Performance evaluations show that our method using style matrix achieves the superior accuracy than state-of-the-art binning tools based on k-mer frequency.


Sign in / Sign up

Export Citation Format

Share Document