clique algorithm Latest Research Papers

Finding a maximum clique is important in research areas such as computational chemistry, social network analysis, and bioinformatics. It is possible to compare the maximum clique size between protein graphs to determine their similarity and function. In this paper, improvements based on machine learning (ML) are added to a dynamic algorithm for finding the maximum clique in a protein graph, Maximum Clique Dynamic (MaxCliqueDyn; short: MCQD). This algorithm was published in 2007 and has been widely used in bioinformatics since then. It uses an empirically determined parameter, Tlimit, that determines the algorithm’s flow. We have extended the MCQD algorithm with an initial phase of a machine learning-based prediction of the Tlimit parameter that is best suited for each input graph. Such adaptability to graph types based on state-of-the-art machine learning is a novel approach that has not been used in most graph-theoretic algorithms. We show empirically that the resulting new algorithm MCQD-ML improves search speed on certain types of graphs, in particular molecular docking graphs used in drug design where they determine energetically favorable conformations of small molecules in a protein binding site. In such cases, the speed-up is twofold.

Download Full-text

AN EMPIRICAL RESEARCH OF CRITICAL INCIDENT OF EARTHQUAKE DISASTER BASEDON CREDIBLE ASSOCIATION DATA MINING

International Journal of Global Operations Research ◽

10.47194/ijgor.v2i1.65 ◽

2021 ◽

Vol 2 (1) ◽

pp. 7-16

Author(s):

Mochamad Suyudi ◽

Sukono Sukono

Keyword(s):

Data Mining ◽

Association Rules ◽

Early Warning ◽

Critical Incident ◽

Maximum Clique ◽

Logistic Distribution ◽

Critical Events ◽

The Past ◽

Association Data ◽

Clique Algorithm

Earthquake disasters usually cause panic in the community affected areas, so it is necessary to be analyzed to deal with earthquake events in the future. This paper analyzes data from 9 major earthquakes in Indonesia over the past 4 years and determines 14 critical events. The analysis is based on credible association rules (CAR), data mining, and the maximum clique algorithm. To verify the accuracy of the association relationship and CAR effectiveness, it is performed using a maximum clique algorithm. Based on the results of data mining, that earthquakes have a credible association relationship and have a probability of critical events in various regions in Indonesia. Thus, these results can be used for prediction, early warning, and logistic distribution planning.

Download Full-text

Scalable mining of maximal quasi-cliques

Proceedings of the VLDB Endowment ◽

10.14778/3436905.3436916 ◽

2020 ◽

Vol 14 (4) ◽

pp. 573-585

Author(s):

Guimu Guo ◽

Da Yan ◽

M. Tamer Özsu ◽

Zhe Jiang ◽

Jalal Khalil

Keyword(s):

Graph Mining ◽

State Of The Art ◽

Minimum Degree ◽

The Other ◽

Dense Subgraph ◽

Load Imbalance ◽

Subgraph Mining ◽

Execution Engine ◽

Clique Algorithm ◽

Dense Subgraph Mining

Given a user-specified minimum degree threshold γ , a γ -quasiclique is a subgraph g = (V g , E g ) where each vertex ν ∈ V g connects to at least γ fraction of the other vertices (i.e., ⌈ γ · (| V g |- 1)⌉ vertices) in g. Quasi-clique is one of the most natural definitions for dense structures useful in finding communities in social networks and discovering significant biomolecule structures and pathways. However, mining maximal quasi-cliques is notoriously expensive. In this paper, we design parallel algorithms for mining maximal quasi-cliques on G-thinker, a distributed graph mining framework that decomposes mining into compute-intensive tasks to fully utilize CPU cores. We found that directly using G-thinker results in the straggler problem due to (i) the drastic load imbalance among different tasks and (ii) the difficulty of predicting the task running time. We address these challenges by redesigning G-thinker's execution engine to prioritize long-running tasks for execution, and by utilizing a novel timeout strategy to effectively decompose long-running tasks to improve load balancing. While this system redesign applies to many other expensive dense subgraph mining problems, this paper verifies the idea by adapting the state-of-the-art quasi-clique algorithm, Quick, to our redesigned G-thinker. Extensive experiments verify that our new solution scales well with the number of CPU cores, achieving 201× runtime speedup when mining a graph with 3.77M vertices and 16.5M edges in a 16-node cluster.

Download Full-text

The Use of an Exact Algorithm within a Tabu Search Maximum Clique Algorithm

Algorithms ◽

10.3390/a13100253 ◽

2020 ◽

Vol 13 (10) ◽

pp. 253 ◽

Cited By ~ 1

Author(s):

Derek H. Smith ◽

Roberto Montemanni ◽

Stephanie Perkins

Keyword(s):

Tabu Search ◽

Undirected Graph ◽

Search Algorithm ◽

Optimal Solution ◽

Maximum Clique ◽

Exact Algorithm ◽

Maximum Clique Problem ◽

Tabu Search Algorithm ◽

Vertex Set ◽

Clique Algorithm

Let G=(V,E) be an undirected graph with vertex set V and edge set E. A clique C of G is a subset of the vertices of V with every pair of vertices of C adjacent. A maximum clique is a clique with the maximum number of vertices. A tabu search algorithm for the maximum clique problem that uses an exact algorithm on subproblems is presented. The exact algorithm uses a graph coloring upper bound for pruning, and the best such algorithm to use in this context is considered. The final tabu search algorithm successfully finds the optimal or best known solution for all standard benchmarks considered. It is compared with a state-of-the-art algorithm that does not use exact search. It is slower to find the known optimal solution for most instances but is faster for five instances and finds a larger clique for two instances.

Download Full-text

Reversed Search Maximum Clique Algorithm Based on Recoloring

Advances in Intelligent Systems and Computing - Optimization of Complex Systems: Theory, Models, Algorithms and Applications ◽

10.1007/978-3-030-21803-4_46 ◽

2019 ◽

pp. 458-467

Author(s):

Deniss Kumlander ◽

Aleksandr Porošin

Keyword(s):

Maximum Clique ◽

Clique Algorithm

Download Full-text

Data Mining for Social Network Analysis Using a CLIQUE Algorithm

Advances in Social Networking and Online Communities - Cognitive Social Mining Applications in Data Analytics and Forensics ◽

10.4018/978-1-5225-7522-1.ch009 ◽

2019 ◽

pp. 160-187

Author(s):

Phu Ngoc Vo ◽

Tran Vo Thi Ngoc

Keyword(s):

Data Mining ◽

Social Networks ◽

Social Network ◽

Social Network Analysis ◽

Network Analysis ◽

The Social ◽

The World ◽

Commercial Applications ◽

Clique Algorithm

Many different areas of computer science have been developed for many years in the world. Data mining is one of the fields which many algorithms, methods, and models have been built and applied to many commercial applications and research successfully. Many social networks have been invested and developed in the strongest way for the recent years in the world because they have had many big benefits as follows: they have been used by lots of users in the world and they have been applied to many business fields successfully. Thus, a lot of different techniques for the social networks have been generated. Unsurprisingly, the social network analysis is crucial at the present time in the world. To support this process, in this book chapter we have presented many simple concepts about data mining and social networking. In addition, we have also displayed a novel model of the data mining for the social network analysis using a CLIQUE algorithm successfully.

Download Full-text

Uniform Adaptive Testing Using Maximum Clique Algorithm

Lecture Notes in Computer Science - Artificial Intelligence in Education ◽

10.1007/978-3-030-23204-7_40 ◽

2019 ◽

pp. 482-493

Author(s):

Maomi Ueno ◽

Yoshimitsu Miyazawa

Keyword(s):

Maximum Clique ◽

Adaptive Testing ◽

Clique Algorithm

Download Full-text

MAPSkew: Metaheuristic Approaches for Partitioning Skew in MapReduce

Algorithms ◽

10.3390/a12010005 ◽

2018 ◽

Vol 12 (1) ◽

pp. 5

Author(s):

Matheus Pericini ◽

Lucas Leite ◽

Francisco de Carvalho-Junior ◽

Javam Machado ◽

Cenez Rezende

Keyword(s):

Distribution Function ◽

Parallel Computing ◽

Simulated Annealing ◽

Processing Time ◽

Computational Effort ◽

Beam Search ◽

Large Dataset ◽

Computing Model ◽

Clique Algorithm ◽

Multiple Machines

MapReduce is a parallel computing model in which a large dataset is split into smaller parts and executed on multiple machines. Due to its simplicity, MapReduce has been widely used in various applications domains. MapReduce can significantly reduce the processing time of a large amount of data by dividing the dataset into smaller parts and processing them in parallel in multiple machines. However, when data are not uniformly distributed, we have the so called partitioning skew, where the allocation of tasks to machines becomes unbalanced, either by the distribution function splitting the dataset unevenly or because a part of the data is more complex and requires greater computational effort. To solve this problem, we propose an approach based on metaheuristics. For evaluating purposes, three metaheuristics were implemented: Simulated Annealing, Local Beam Search and Stochastic Beam Search. Our experimental evaluation, using a MapReduce implementation of the Bron-Kerbosch Clique Algorithm, shows that the proposed method can find good partitionings while better balancing data among machines.

Download Full-text