Indexing algorithm based on storing additional distances in metric space for multi-vantage-point tree

Introduction: The similarity search paradigm is used in various computational tasks, such as classification, data mining, pattern recognition, etc. Currently, the technology of tree-like metric access methods occupies a significant place among search algorithms. The classical problem of reducing the time of similarity search in metric space is relevant for modern systems when processing big complex data. Due to multidimensional nature of the search algorithm effectiveness problem, local research in this direction is in demand, constantly bringing useful results. Purpose: To reduce the computational complexity of tree search algorithms in problems involving metric proximity. Results: We developed a search algorithm for a multi-vantage-point tree, based on the priority node-processing queue. We mathematically formalized the problems of additional calculations and ways to solve them. To improve the performance of similarity search, we have proposed procedures for forming a priority queue of processing nodes and reducing the number of intersections of same level nodes. Structural changes in the multi-vantage-point tree and the use of minimum distances between vantage points and node subtrees provide better search efficiency. More accurate determination of the distance from the search object to the nodes and the fact that the search area intersects with a tree node allows you to reduce the amount of calculations. Practical relevance: The resulting search algorithms need less time to process information due to an insignificant increase in memory requirements. Reducing the information processing time expands the application boundaries of tree metric indexing methods in search problems involving large data sets.

Download Full-text

LC–MS/MS Software for Screening Unknown Erectile Dysfunction Drugs and Analogues: Artificial Neural Network Classification, Peak-Count Scoring, Simple Similarity Search, and Hybrid Similarity Search Algorithms

Analytical Chemistry ◽

10.1021/acs.analchem.9b01643 ◽

2019 ◽

Vol 91 (14) ◽

pp. 9119-9128 ◽

Cited By ~ 5

Author(s):

Inae Jang ◽

Jae-ung Lee ◽

Jung-min Lee ◽

Beom Hee Kim ◽

Bongjin Moon ◽

...

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Erectile Dysfunction ◽

Similarity Search ◽

Search Algorithms ◽

Neural Network Classification ◽

Artificial Neural ◽

Peak Count

Download Full-text

Visualizing the Bohr effect in hemoglobin: neutron structure of equine cyanomethemoglobin in the R state and comparison with human deoxyhemoglobin in the T state

Acta Crystallographica Section D Structural Biology ◽

10.1107/s2059798316009049 ◽

2016 ◽

Vol 72 (7) ◽

pp. 892-903 ◽

Cited By ~ 5

Author(s):

Steven Dajnowicz ◽

Sean Seaver ◽

B. Leif Hanson ◽

S. Zoë Fisher ◽

Paul Langan ◽

...

Keyword(s):

Structural Changes ◽

Accurate Determination ◽

Bohr Effect ◽

Salt Bridges ◽

Histidine Residues ◽

Regions Of Stability ◽

The Individual ◽

T State

Neutron crystallography provides direct visual evidence of the atomic positions of deuterium-exchanged H atoms, enabling the accurate determination of the protonation/deuteration state of hydrated biomolecules. Comparison of two neutron structures of hemoglobins, human deoxyhemoglobin (T state) and equine cyanomethemoglobin (R state), offers a direct observation of histidine residues that are likely to contribute to the Bohr effect. Previous studies have shown that the T-state N-terminal and C-terminal salt bridges appear to have a partial instead of a primary overall contribution. Four conserved histidine residues [αHis72(EF1), αHis103(G10), αHis89(FG1), αHis112(G19) and βHis97(FG4)] can become protonated/deuterated from the R to the T state, while two histidine residues [αHis20(B1) and βHis117(G19)] can lose a proton/deuteron. αHis103(G10), located in the α1:β1dimer interface, appears to be a Bohr group that undergoes structural changes: in the R state it is singly protonated/deuterated and hydrogen-bonded through a water network to βAsn108(G10) and in the T state it is doubly protonated/deuterated with the network uncoupled. The very long-term H/D exchange of the amide protons identifies regions that are accessible to exchange as well as regions that are impermeable to exchange. The liganded relaxed state (R state) has comparable levels of exchange (17.1% non-exchanged) compared with the deoxy tense state (T state; 11.8% non-exchanged). Interestingly, the regions of non-exchanged protons shift from the tetramer interfaces in the T-state interface (α1:β2and α2:β1) to the cores of the individual monomers and to the dimer interfaces (α1:β1and α2:β2) in the R state. The comparison of regions of stability in the two states allows a visualization of the conservation of fold energy necessary for ligand binding and release.

Download Full-text

Circumspect descent prevails in solving random constraint satisfaction problems

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.0712263105 ◽

2008 ◽

Vol 105 (40) ◽

pp. 15253-15257 ◽

Cited By ~ 30

Author(s):

Mikko Alava ◽

John Ardelius ◽

Erik Aurell ◽

Petteri Kaski ◽

Supriya Krishnamurthy ◽

...

Keyword(s):

Local Search ◽

Linear Time ◽

Search Algorithm ◽

Solution Space ◽

Search Algorithms ◽

Constraint Satisfaction Problems ◽

Problem Instance ◽

Stochastic Local Search ◽

Local Search Algorithms ◽

Local Energy Minimum

We study the performance of stochastic local search algorithms for random instances of the K-satisfiability (K-SAT) problem. We present a stochastic local search algorithm, ChainSAT, which moves in the energy landscape of a problem instance by never going upwards in energy. ChainSAT is a focused algorithm in the sense that it focuses on variables occurring in unsatisfied clauses. We show by extensive numerical investigations that ChainSAT and other focused algorithms solve large K-SAT instances almost surely in linear time, up to high clause-to-variable ratios α; for example, for K = 4 we observe linear-time performance well beyond the recently postulated clustering and condensation transitions in the solution space. The performance of ChainSAT is a surprise given that by design the algorithm gets trapped into the first local energy minimum it encounters, yet no such minima are encountered. We also study the geometry of the solution space as accessed by stochastic local search algorithms.

Download Full-text

Distributed similarity search algorithm in distributed heterogeneous multimedia databases

Information Processing Letters ◽

10.1016/s0020-0190(00)00068-5 ◽

2000 ◽

Vol 75 (1-2) ◽

pp. 35-42 ◽

Cited By ~ 7

Author(s):

Ju-Hong Lee ◽

Deok-Hwan Kim ◽

Seok-Lyong Lee ◽

Chin-Wan Chung ◽

Guang-Ho Cha

Keyword(s):

Similarity Search ◽

Search Algorithm ◽

Multimedia Databases

Download Full-text

Finding A Small Vertex Cover in Massive Sparse Graphs: Construct, Local Search, and Preprocess

Journal of Artificial Intelligence Research ◽

10.1613/jair.5443 ◽

2017 ◽

Vol 59 ◽

pp. 463-494 ◽

Cited By ~ 6

Author(s):

Shaowei Cai ◽

Jinkun Lin ◽

Chuan Luo

Keyword(s):

Local Search ◽

Real World ◽

Large Scale ◽

Heuristic Algorithms ◽

Search Algorithm ◽

Vertex Cover ◽

Search Algorithms ◽

Theory And Practice ◽

Sparse Graphs ◽

Massive Graphs

The problem of finding a minimum vertex cover (MinVC) in a graph is a well known NP-hard combinatorial optimization problem of great importance in theory and practice. Due to its NP-hardness, there has been much interest in developing heuristic algorithms for finding a small vertex cover in reasonable time. Previously, heuristic algorithms for MinVC have focused on solving graphs of relatively small size, and they are not suitable for solving massive graphs as they usually have high-complexity heuristics. This paper explores techniques for solving MinVC in very large scale real-world graphs, including a construction algorithm, a local search algorithm and a preprocessing algorithm. Both the construction and search algorithms are based on low-complexity heuristics, and we combine them to develop a heuristic algorithm for MinVC called FastVC. Experimental results on a broad range of real-world massive graphs show that, our algorithms are very fast and have better performance than previous heuristic algorithms for MinVC. We also develop a preprocessing algorithm to simplify graphs for MinVC algorithms. By applying the preprocessing algorithm to local search algorithms, we obtain two efficient MinVC solvers called NuMVC2+p and FastVC2+p, which show further improvement on the massive graphs.

Download Full-text

Similarity search without tears: the OMNI-family of all-purpose access methods

Proceedings 17th International Conference on Data Engineering ◽

10.1109/icde.2001.914877 ◽

2002 ◽

Cited By ~ 27

Author(s):

R.F.S. Filho ◽

A. Traina ◽

C. Traina ◽

C. Faloutsos

Keyword(s):

Similarity Search ◽

Access Methods

Download Full-text

Time-Aware Similarity Search: A Metric-Temporal Representation for Complex Data

Advances in Spatial and Temporal Databases - Lecture Notes in Computer Science ◽

10.1007/978-3-642-02982-0_20 ◽

2009 ◽

pp. 302-319 ◽

Cited By ~ 2

Author(s):

Renato Bueno ◽

Daniel S. Kaster ◽

Agma Juci Machado Traina ◽

Caetano Traina

Keyword(s):

Similarity Search ◽

Complex Data ◽

Temporal Representation ◽

Time Aware

Download Full-text

Recent Results on the Application of a Metric — Space Search Algorithm (AESA) to Multispeaker Data

Recent Advances in Speech Understanding and Dialog Systems ◽

10.1007/978-3-642-83476-9_27 ◽

1988 ◽

pp. 285-289

Author(s):

Enrique Vidal ◽

M. José Lloret

Keyword(s):

Metric Space ◽

Search Algorithm

Download Full-text

On the Similarity Search With Hamming Space Sketches

Advances in Data Mining and Database Management - Intelligent Analytics With Advanced Multi-Industry Applications ◽

10.4018/978-1-7998-4963-6.ch005 ◽

2021 ◽

pp. 97-127

Author(s):

Vladimir Mic ◽

Pavel Zezula

Keyword(s):

Computational Complexity ◽

Metric Space ◽

Similarity Search ◽

Space Model ◽

Similarity Query ◽

Speed Up ◽

Hamming Space ◽

Definition Of ◽

Metric Function ◽

Selection Of

This chapter focuses on data searching, which is nowadays mostly based on similarity. The similarity search is challenging due to its computational complexity, and also the fact that similarity is subjective and context dependent. The authors assume the metric space model of similarity, defined by the domain of objects and the metric function that measures the dissimilarity of object pairs. The volume of contemporary data is large, and the time efficiency of similarity query executions is essential. This chapter investigates transformations of metric space to Hamming space to decrease the memory and computational complexity of the search. Various challenges of the similarity search with sketches in the Hamming space are addressed, including the definition of sketching transformation and efficient search algorithms that exploit sketches to speed-up searching. The indexing of Hamming space and a heuristic to facilitate the selection of a suitable sketching technique for any given application are also considered.

Download Full-text

Quantum dialogue protocol based on Grover’s search algorithms

Modern Physics Letters A ◽

10.1142/s0217732319501694 ◽

2019 ◽

Vol 34 (21) ◽

pp. 1950169

Author(s):

Aihan Yin ◽

Kemeng He ◽

Ping Fan

Keyword(s):

Heuristic Search ◽

Search Algorithm ◽

Search Algorithms ◽

Quantum Dialogue ◽

Quantum Search ◽

Decoy State ◽

Quantum Search Algorithm ◽

Heuristic Search Algorithms ◽

Security Information

Among many classic heuristic search algorithms, the Grover quantum search algorithm (QSA) can play a role of secondary acceleration. Based on the properties of the two-qubit Grover QSA, a quantum dialogue (QD) protocol is proposed. In addition, our protocol also utilizes the unitary operations and single-particle measurements. The transmitted quantum state (except for the decoy state used for detection) can transmit two-bits of security information simultaneously. Theoretical analysis shows that the proposed protocol has high security.

Download Full-text