string kernel
Recently Published Documents

TOTAL DOCUMENTS

47

(FIVE YEARS 3)

H-INDEX

9

(FIVE YEARS 1)

Latest Documents Most Cited Documents Contributed Authors Related Sources Related Keywords

Hardware Acceleration of the STRIKE String Kernel Algorithm for Estimating Protein to Protein Interactions

IEEE/ACM Transactions on Computational Biology and Bioinformatics ◽

10.1109/tcbb.2021.3066591 ◽

2021 ◽

pp. 1-1

Author(s):

Fadi Sibai ◽

Ali A. El-Moursy ◽

Abu Asaduzzaman ◽

Sohaib Majzoub

Keyword(s):

Protein Interactions ◽

Hardware Acceleration ◽

Download Full-text

Transfer String Kernel for Cross-Context DNA-Protein Binding Prediction

IEEE/ACM Transactions on Computational Biology and Bioinformatics ◽

10.1109/tcbb.2016.2609918 ◽

2019 ◽

Vol 16 (5) ◽

pp. 1524-1536 ◽

Author(s):

Ritambhara Singh ◽

Jack Lanchantin ◽

Gabriel Robins ◽

Yanjun Qi

Keyword(s):

Protein Binding ◽

String Kernel ◽

Binding Prediction

Download Full-text

Efficient Global String Kernel with Random Features

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining - KDD '19 ◽

10.1145/3292500.3330923 ◽

2019 ◽

Author(s):

Lingfei Wu ◽

Ian En-Hsu Yen ◽

Siyu Huo ◽

Liang Zhao ◽

Kun Xu ◽

...

Keyword(s):

Download Full-text

GaKCo: a Fast Gapped k-mer string Kernel using Counting

10.1101/329425 ◽

2018 ◽

Author(s):

Ritambhara Singh ◽

Arshdeep Sekhon ◽

Jack Lanchantin ◽

Kamran Kowsari ◽

Beilun Wang ◽

...

Keyword(s):

Asymptotic Analysis ◽

State Of The Art ◽

The State ◽

English Text ◽

Great Success ◽

String Kernel ◽

AbstractString Kernel (SK) techniques, especially those using gapped k-mers as features (gk), have obtained great success in classifying sequences like DNA, protein, and text. However, the state-of-the-art gk-SK runs extremely slow when we increase the dictionary size (Σ) or allow more mismatches (M). This is because current gk-SK uses a trie-based algorithm to calculate co-occurrence of mismatched substrings resulting in a time cost proportional to O(ΣM). We propose a fast algorithm for calculating Gapped k-mer Kernel using Counting (GaKCo). GaKCo uses associative arrays to calculate the co-occurrence of substrings using cumulative counting. This algorithm is fast, scalable to larger Σ and M, and naturally parallelizable. We provide a rigorous asymptotic analysis that compares GaKCo with the state-of-the-art gk-SK. Theoretically, the time cost of GaKCo is independent of the ΣM term that slows down the trie-based approach. Experimentally, we observe that GaKCo achieves the same accuracy as the state-of-the-art and outperforms its speed by factors of 2, 100, and 4, on classifying sequences of DNA (5 datasets), protein (12 datasets), and character-based English text (2 datasets). 1

Download Full-text

TISK 1.0: An easy-to-use Python implementation of the time-invariant string kernel model of spoken word recognition

Behavior Research Methods ◽

10.3758/s13428-017-1012-5 ◽

2018 ◽

Vol 50 (3) ◽

pp. 871-889 ◽

Author(s):

Heejo You ◽

James S. Magnuson

Keyword(s):

Word Recognition ◽

Spoken Word Recognition ◽

Spoken Word ◽

String Kernel ◽

Time Invariant ◽

Download Full-text

A weighted string kernel for protein fold recognition

BMC Bioinformatics ◽

10.1186/s12859-017-1795-5 ◽

2017 ◽

Vol 18 (1) ◽

Author(s):

Saghi Nojoomi ◽

Patrice Koehl

Keyword(s):

Fold Recognition ◽

Protein Fold ◽

String Kernel ◽

Protein Fold Recognition

Download Full-text

GaKCo: A Fast Gapped k-mer String Kernel Using Counting

Machine Learning and Knowledge Discovery in Databases - Lecture Notes in Computer Science ◽

10.1007/978-3-319-71249-9_22 ◽

2017 ◽

pp. 356-373 ◽

Author(s):

Ritambhara Singh ◽

Arshdeep Sekhon ◽

Kamran Kowsari ◽

Jack Lanchantin ◽

Beilun Wang ◽

...

Keyword(s):

Download Full-text

String Kernel

Encyclopedia of Machine Learning and Data Mining ◽

10.1007/978-1-4899-7687-1_790 ◽

2017 ◽

pp. 1200-1200

Keyword(s):

Download Full-text

Chemically Augmented String Kernel for Extraction and Classification of Chemical Compounds from Text

Proceedings of the Knowledge Capture Conference on ZZZ - K-CAP 2015 ◽

10.1145/2815833.2816954 ◽

2015 ◽

Author(s):

Venkata Joopudi ◽

Akansha Singh ◽

Keerthana Kumar ◽

Anirudh Murali ◽

Priya Gandhi ◽

...

Keyword(s):

Chemical Compounds ◽

Download Full-text

A multi-fold string kernel for sequence classification

2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) ◽

10.1109/embc.2015.7319874 ◽

2015 ◽

Author(s):

Aniruddha Maiti ◽

Santanu Ghorai ◽

Anirban Mukherjee

Keyword(s):

Sequence Classification ◽

Download Full-text