Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing

AbstractOpen modification searching (OMS) is a powerful search strategy that identifies peptides carrying any type of modification by allowing a modified spectrum to match against its unmodified variant by using a very wide precursor mass window. A drawback of this strategy, however, is that it leads to a large increase in search time. Although performing an open search can be done using existing spectral library search engines by simply setting a wide precursor mass window, none of these tools have been optimized for OMS, leading to excessive runtimes and suboptimal identification results. Here we present the ANN-SoLo tool for fast and accurate open spectral library searching. ANN-SoLo uses approximate nearest neighbor indexing to speed up OMS by selecting only a limited number of the most relevant library spectra to compare to an unknown query spectrum. This approach is combined with a cascade search strategy to maximize the number of identified unmodified and modified spectra while strictly controlling the false discovery rate, as well as a shifted dot product score to sensitively match modified spectra to their unmodified counterparts. ANN-SoLo achieves state-of-the-art performance in terms of speed and the number of identifications. On a previously published human cell line data set, ANN-SoLo confidently identifies more spectra than SpectraST or MSFragger and achieves a speedup of an order of magnitude compared to SpectraST.ANN-SoLo is implemented in Python and C++. It is freely available under the Apache 2.0 license athttps://github.com/bittremieux/ANN-SoLo.

Download Full-text

Adaptive bit allocation hashing for approximate nearest neighbor search

Neurocomputing ◽

10.1016/j.neucom.2014.10.042 ◽

2015 ◽

Vol 151 ◽

pp. 719-728 ◽

Cited By ~ 4

Author(s):

Qin-Zhen Guo ◽

Zhi Zeng ◽

Shuwu Zhang

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Bit Allocation ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text

Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination

Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data ◽

10.1145/3318464.3380600 ◽

2020 ◽

Author(s):

Conglong Li ◽

Minjia Zhang ◽

David G. Andersen ◽

Yuxiong He

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

Early Termination ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text

Local partition subdivision algorithm for approximate nearest neighbor query

10.1109/icisce50968.2020.00168 ◽

2020 ◽

Author(s):

Penghui Dong ◽

Yan Yang

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Query ◽

Approximate Nearest Neighbor ◽

Subdivision Algorithm

Download Full-text

Scaling Up Ensemble of Adaptations for Classification by Approximate Nearest Neighbor Retrieval

Case-Based Reasoning Research and Development - Lecture Notes in Computer Science ◽

10.1007/978-3-319-61030-6_11 ◽

2017 ◽

pp. 154-169 ◽

Cited By ~ 1

Author(s):

Vahid Jalali ◽

David Leake

Keyword(s):

Nearest Neighbor ◽

Scaling Up ◽

Approximate Nearest Neighbor

Download Full-text

Pengenalan Motif Karawo Menggunakan Ekstraksi Fitur SIFT Dan Approximate Nearest Neighbor

10.31227/osf.io/zb857 ◽

2019 ◽

Author(s):

Syahrial

Keyword(s):

Nearest Neighbor ◽

Feature Matching ◽

Recognition Accuracy ◽

Sift Algorithm ◽

Approximate Nearest Neighbor ◽

Single Pattern ◽

Tree Data ◽

Threshold Testing ◽

Root Word ◽

Tree Data Structure

An art culture from Gorontalo became iconic handcraft is kerawang or karawo. The word “karawo” came from root word of “mokarawo” which means slicing or making holes. It’s created with precision, carefulness, and patience in work using handmade masterpiece. Pattern of karawo itself held four kinds which is flora, fauna, geometric, and nature. From those kinds born vary pattern which come difficult to identify both its names and its kind. Karawo patterns can be form as a single pattern or a pattern that it parts came from several or many pattern combined. Those patterns had its own characteristic from shape and scale perspective. Identifying single pattern on combined pattern are particularly a problem because it’s combined involve scaling and rotation. This research is recognizing single pattern on combined pattern using feature extraction SIFT algorithm which is capable extract feature that invariant from scale and rotation. Feature matching using approximate nearest neighbor (aNN) for similarity of features labor best bin first strategy on kd-tree data structure. Those methods can be a reference to recognize single pattern on combined pattern using from range 5 to 20 match features as a threshold. Testing result indicated recognition accuracy is good which range form 76.36% to 85.45% on recognize the kind of karawo pattern and 76.36% on its name.

Download Full-text

Feature matching algorithm based on KAZE and fast approximate nearest neighbor search

Proceedings of the 3rd International Conference on Computer Science and Service System ◽

10.2991/csss-14.2014.63 ◽

2014 ◽

Author(s):

Cai Ze-Ping ◽

Xiao De-Gui

Keyword(s):

Nearest Neighbor ◽

Feature Matching ◽

Nearest Neighbor Search ◽

Matching Algorithm ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text

APPROXIMATE NEAREST NEIGHBOR SEARCH IN HIGH DIMENSIONS

Proceedings of the International Congress of Mathematicians (ICM 2018) ◽

10.1142/9789813272880_0182 ◽

2019 ◽

Cited By ~ 3

Author(s):

ALEXANDR ANDONI ◽

PIOTR INDYK ◽

ILYA RAZENSHTEYN

Keyword(s):

Nearest Neighbor ◽

Nearest Neighbor Search ◽

High Dimensions ◽

Approximate Nearest Neighbor Search ◽

Approximate Nearest Neighbor ◽

Neighbor Search

Download Full-text

Scale-Invariant Feature Transform Algorithm with Fast Approximate Nearest Neighbor

Baghdad Science Journal ◽

10.21123/bsj.14.3.651-661 ◽

2017 ◽

Vol 14 (3) ◽

pp. 651-661 ◽

Cited By ~ 1

Author(s):

Baghdad Science Journal

Keyword(s):

Nearest Neighbor ◽

Daily Basis ◽

Scale Invariant Feature Transform ◽

Scale Invariant ◽

Suggested Approach ◽

Approximate Nearest Neighbor ◽

Invariant Feature ◽

Key Points ◽

Feature Transform ◽

Scale Invariant Feature

There is a great deal of systems dealing with image processing that are being used and developed on a daily basis. Those systems need the deployment of some basic operations such as detecting the Regions of Interest and matching those regions, in addition to the description of their properties. Those operations play a significant role in decision making which is necessary for the next operations depending on the assigned task. In order to accomplish those tasks, various algorithms have been introduced throughout years. One of the most popular algorithms is the Scale Invariant Feature Transform (SIFT). The efficiency of this algorithm is its performance in the process of detection and property description, and that is due to the fact that it operates on a big number of key-points, the only drawback it has is that it is rather time consuming. In the suggested approach, the system deploys SIFT to perform its basic tasks of matching and description is focused on minimizing the number of key-points which is performed via applying Fast Approximate Nearest Neighbor algorithm, which will reduce the redundancy of matching leading to speeding up the process. The proposed application has been evaluated in terms of two criteria which are time and accuracy, and has accomplished a percentage of accuracy of up to 100%, in addition to speeding up the processes of matching and description.

Download Full-text

Fusion of multiple approximate nearest neighbor classifiers for fast and efficient classification

Information Fusion ◽

10.1016/j.inffus.2004.02.003 ◽

2004 ◽

Vol 5 (4) ◽

pp. 239-250 ◽

Cited By ~ 16

Author(s):

P. Viswanath ◽

M. Narasimha Murty ◽

Shalabh Bhatnagar

Keyword(s):

Nearest Neighbor ◽

Approximate Nearest Neighbor ◽

Nearest Neighbor Classifiers

Download Full-text