Gravitational community detection by predicting diameter

Community detection is a pivotal part of network analysis and is classified as an NP-hard problem. In this paper, a novel community detection algorithm is proposed, which probabilistically predicts communities’ diameter using the local information of random seed nodes. The gravitation method is then applied to discover communities surrounding the seed nodes. The individual communities are combined to get the community structure of the whole network. The proposed algorithm, named as Local Gravitational community detection algorithm (LGCDA), can also work with overlapping communities. LGCDA algorithm is evaluated based on quality metrics and ground-truth data by comparing it with some of the widely used community detection algorithms using synthetic and real-world networks.

Download Full-text

Community detection algorithm evaluation with ground-truth data

Physica A Statistical Mechanics and its Applications ◽

10.1016/j.physa.2017.10.018 ◽

2018 ◽

Vol 492 ◽

pp. 651-706 ◽

Cited By ~ 23

Author(s):

Malek Jebabli ◽

Hocine Cherifi ◽

Chantal Cherifi ◽

Atef Hamouda

Keyword(s):

Community Detection ◽

Ground Truth ◽

Detection Algorithm ◽

Ground Truth Data ◽

Algorithm Evaluation ◽

Community Detection Algorithm

Download Full-text

Overlapping Community Detection Based on Attribute Augmented Graph

Entropy ◽

10.3390/e23060680 ◽

2021 ◽

Vol 23 (6) ◽

pp. 680

Author(s):

Hanyang Lin ◽

Yongzhao Zhan ◽

Zizheng Zhao ◽

Yuzhong Chen ◽

Chen Dong

Keyword(s):

Community Detection ◽

Real World ◽

Detection Algorithm ◽

Overlapping Community Detection ◽

Overlapping Communities ◽

Adjustment Strategy ◽

Topology Information ◽

Overlapping Community ◽

Real World Datasets ◽

Community Detection Algorithm

There is a wealth of information in real-world social networks. In addition to the topology information, the vertices or edges of a social network often have attributes, with many of the overlapping vertices belonging to several communities simultaneously. It is challenging to fully utilize the additional attribute information to detect overlapping communities. In this paper, we first propose an overlapping community detection algorithm based on an augmented attribute graph. An improved weight adjustment strategy for attributes is embedded in the algorithm to help detect overlapping communities more accurately. Second, we enhance the algorithm to automatically determine the number of communities by a node-density-based fuzzy k-medoids process. Extensive experiments on both synthetic and real-world datasets demonstrate that the proposed algorithms can effectively detect overlapping communities with fewer parameters compared to the baseline methods.

Download Full-text

WattScale

ACM/IMS Transactions on Data Science ◽

10.1145/3406961 ◽

2021 ◽

Vol 2 (1) ◽

pp. 1-25

Author(s):

Srinivasan Iyengar ◽

Stephen Lee ◽

David Irwin ◽

Prashant Shenoy ◽

Benjamin Weil

Keyword(s):

Cooling System ◽

Large Population ◽

Ground Truth ◽

Detection Algorithm ◽

Building Envelope ◽

Ground Truth Data ◽

Heating And Cooling ◽

Data Driven Approach ◽

Execution Mode ◽

The Individual

Buildings consume over 40% of the total energy in modern societies, and improving their energy efficiency can significantly reduce our energy footprint. In this article, we present WattScale, a data-driven approach to identify the least energy-efficient buildings from a large population of buildings in a city or a region. Unlike previous methods such as least-squares that use point estimates, WattScale uses Bayesian inference to capture the stochasticity in the daily energy usage by estimating the distribution of parameters that affect a building. Further, it compares them with similar homes in a given population. WattScale also incorporates a fault detection algorithm to identify the underlying causes of energy inefficiency. We validate our approach using ground truth data from different geographical locations, which showcases its applicability in various settings. WattScale has two execution modes—(i) individual and (ii) region-based, which we highlight using two case studies. For the individual execution mode, we present results from a city containing >10,000 buildings and show that more than half of the buildings are inefficient in one way or another indicating a significant potential from energy improvement measures. Additionally, we provide probable cause of inefficiency and find that 41%, 23.73%, and 0.51% homes have poor building envelope, heating, and cooling system faults, respectively. For the region-based execution mode, we show that WattScale can be extended to millions of homes in the U.S. due to the recent availability of representative energy datasets.

Download Full-text

Community detection in complex network by network embedding and density clustering

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-202961 ◽

2021 ◽

pp. 1-12

Author(s):

JinFang Sheng ◽

Huaiyu Zuo ◽

Bin Wang ◽

Qiong Li

Keyword(s):

Complex Network ◽

Community Detection ◽

Dimensional Space ◽

Detection Algorithm ◽

Superior Performance ◽

Network Embedding ◽

Detection Algorithms ◽

Density Clustering ◽

Community Detection Algorithm ◽

Low Dimensional

In a complex network system, the structure of the network is an extremely important element for the analysis of the system, and the study of community detection algorithms is key to exploring the structure of the complex network. Traditional community detection algorithms would represent the network using an adjacency matrix based on observations, which may contain redundant information or noise that interferes with the detection results. In this paper, we propose a community detection algorithm based on density clustering. In order to improve the performance of density clustering, we consider an algorithmic framework for learning the continuous representation of network nodes in a low-dimensional space. The network structure is effectively preserved through network embedding, and density clustering is applied in the embedded low-dimensional space to compute the similarity of nodes in the network, which in turn reveals the implied structure in a given network. Experiments show that the algorithm has superior performance compared to other advanced community detection algorithms for real-world networks in multiple domains as well as synthetic networks, especially when the network data chaos is high.

Download Full-text

Using bootstrap procedures for testing the modular partition inferred via leading eigenvector community detection algorithm

10.1101/2021.10.04.462969 ◽

2021 ◽

Author(s):

Oksana Vertsimakha ◽

Igor Dzeverin

Keyword(s):

Community Detection ◽

Detection Algorithm ◽

Fine Tuning ◽

Modular Structure ◽

Biological Organization ◽

Detection Algorithms ◽

Community Detection Algorithm ◽

Numeric Data ◽

Bootstrap Algorithm ◽

Modular Structures

AbstractModularity and modular structures can be recognized at various levels of biological organization and in various domains of studies. Recently, algorithms based on network analysis came into focus. And while such a framework is a powerful tool in studying modular structure, those methods usually pose a problem of assessing statistical support for the obtained modular structures. One of the widely applied methods is the leading eigenvector, or Newman’s spectral community detection algorithm. We conduct a brief overview of the method, including a comparison with some other community detection algorithms and explore a possible fine-tuning procedure. Finally, we propose an adapted bootstrap-based procedure based on Shimodaira’s multiscale bootstrap algorithm to derive approximately unbiased p-values for the module partitions of observations datasets. The proposed procedure also gives a lot of freedom to the researcher in constructing the network construction from the raw numeric data, and can be applied to various types of data and used in diverse problems concerning modular structure. We provide an R language code for all the calculations and the visualization of the obtained results for the researchers interested in using the procedure.

Download Full-text

Ego-zones: non-symmetric dependencies reveal network groups with large and dense overlaps

Applied Network Science ◽

10.1007/s41109-019-0192-6 ◽

2019 ◽

Vol 4 (1) ◽

Cited By ~ 1

Author(s):

Milos Kudelka ◽

Eliska Ochodkova ◽

Sarka Zehnalova ◽

Jakub Plesnik

Keyword(s):

Network Structure ◽

Real World ◽

Ground Truth ◽

Structural Similarity ◽

Global Network ◽

Overlapping Communities ◽

Detection Algorithms ◽

First Case ◽

New Perspective ◽

The Individual

Abstract The existence of groups of nodes with common characteristics and the relationships between these groups are important factors influencing the structures of social, technological, biological, and other networks. Uncovering such groups and the relationships between them is, therefore, necessary for understanding these structures. Groups can either be found by detection algorithms based solely on structural analysis or identified on the basis of more in-depth knowledge of the processes taking place in networks. In the first case, these are mainly algorithms detecting non-overlapping communities or communities with small overlaps. The latter case is about identifying ground-truth communities, also on the basis of characteristics other than only network structure. Recent research into ground-truth communities shows that in real-world networks, there are nested communities or communities with large and dense overlaps which we are not yet able to detect satisfactorily only on the basis of structural network properties.In our approach, we present a new perspective on the problem of group detection using only the structural properties of networks. Its main contribution is pointing out the existence of large and dense overlaps of detected groups. We use the non-symmetric structural similarity between pairs of nodes, which we refer to as dependency, to detect groups that we call zones. Unlike other approaches, we are able, thanks to non-symmetry, accurately to describe the prominent nodes in the zones which are responsible for large zone overlaps and the reasons why overlaps occur. The individual zones that are detected provide new information associated in particular with the non-symmetric relationships within the group and the roles that individual nodes play in the zone. From the perspective of global network structure, because of the non-symmetric node-to-node relationships, we explore new properties of real-world networks that describe the differences between various types of networks.

Download Full-text

Recent trends on community detection algorithms: A survey

Modern Physics Letters B ◽

10.1142/s0217984920504084 ◽

2020 ◽

Vol 34 (35) ◽

pp. 2050408

Author(s):

Sumit Gupta ◽

Dhirendra Pratap Singh

Keyword(s):

Community Detection ◽

Real Life ◽

Detection Algorithm ◽

Data Sets ◽

Detection Algorithms ◽

Life Problems ◽

Application Data ◽

Art Community ◽

Community Detection Algorithm ◽

Two Parameters

In today’s world scenario, many of the real-life problems and application data can be represented with the help of the graphs. Nowadays technology grows day by day at a very fast rate; applications generate a vast amount of valuable data, due to which the size of their representation graphs is increased. How to get meaningful information from these data become a hot research topic. Methodical algorithms are required to extract useful information from these raw data. These unstructured graphs are not scattered in nature, but these show some relationships between their basic entities. Identifying communities based on these relationships improves the understanding of the applications represented by graphs. Community detection algorithms are one of the solutions which divide the graph into small size clusters where nodes are densely connected within the cluster and sparsely connected across. During the last decade, there are lots of algorithms proposed which can be categorized into mainly two broad categories; non-overlapping and overlapping community detection algorithm. The goal of this paper is to offer a comparative analysis of the various community detection algorithms. We bring together all the state of art community detection algorithms related to these two classes into a single article with their accessible benchmark data sets. Finally, we represent a comparison of these algorithms concerning two parameters: one is time efficiency, and the other is how accurately the communities are detected.

Download Full-text

Evaluating the role of community detection in improving influence maximization heuristics

Social Network Analysis and Mining ◽

10.1007/s13278-021-00804-5 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

László Hajdu ◽

Miklós Krész ◽

András Bóta

Keyword(s):

Community Detection ◽

Real Life ◽

Detection Algorithm ◽

Influence Maximization ◽

Overlapping Community Detection ◽

Detection Algorithms ◽

Overlapping Community ◽

Community Detection Algorithm ◽

The Cost

AbstractBoth community detection and influence maximization are well-researched fields of network science. Here, we investigate how several popular community detection algorithms can be used as part of a heuristic approach to influence maximization. The heuristic is based on the community value, a node-based metric defined on the outputs of overlapping community detection algorithms. This metric is used to select nodes as high influence candidates for expanding the set of influential nodes. Our aim in this paper is twofold. First, we evaluate the performance of eight frequently used overlapping community detection algorithms on this specific task to show how much improvement can be gained compared to the originally proposed method of Kempe et al. Second, selecting the community detection algorithm(s) with the best performance, we propose a variant of the influence maximization heuristic with significantly reduced runtime, at the cost of slightly reduced quality of the output. We use both artificial benchmarks and real-life networks to evaluate the performance of our approach.

Download Full-text

Multilayer Social Network Overlapping Community Detection Algorithm Based on Trust Relationship

Wireless Communications and Mobile Computing ◽

10.1155/2021/9268039 ◽

2021 ◽

Vol 2021 ◽

pp. 1-14

Author(s):

Junjie Jia ◽

Pengtao Liu ◽

Xiaojin Du ◽

Yuchao Zhang

Keyword(s):

Social Network ◽

Community Detection ◽

Detection Algorithm ◽

Experimental Comparison ◽

Overlapping Community Detection ◽

Trust Value ◽

Overlapping Communities ◽

Trust Relationship ◽

Overlapping Community ◽

Community Detection Algorithm

Aiming at the problem of the lack of user social attribute characteristics in the process of dividing overlapping communities in multilayer social networks, in this paper, we propose a multilayer social network overlapping community detection algorithm based on trust relationship. By combining structural trust and social attribute trust, we transform a complex multilayer social network into a single-layer trust network. We obtain the community structure according to the community discovery algorithm based on trust value and merge communities with higher overlap. The experimental comparison and analysis are carried out on the synthetic network and the real network, respectively. The experimental results show that the proposed algorithm has higher harmonic mean and modularity than other algorithms of the same type.

Download Full-text

Clustering Algorithm for Community Detection in Complex Network: A Comprehensive Review

Recent Advances in Computer Science and Communications ◽

10.2174/2213275912666190710183635 ◽

2020 ◽

Vol 13 (4) ◽

pp. 542-549

Author(s):

Smita Agrawal ◽

Atul Patel

Keyword(s):

Complex Network ◽

Community Detection ◽

Large Scale ◽

Clustering Algorithm ◽

Detection Algorithm ◽

Connected Subgraph ◽

Comprehensive Review ◽

Detection Algorithms ◽

Community Detection Algorithm ◽

Large Scale Networks

Many real-world social networks exist in the form of a complex network, which includes very large scale networks with structured or unstructured data and a set of graphs. This complex network is available in the form of brain graph, protein structure, food web, transportation system, World Wide Web, and these networks are sparsely connected, and most of the subgraphs are densely connected. Due to the scaling of large scale graphs, efficient way for graph generation, complexity, the dynamic nature of graphs, and community detection are challenging tasks. From large scale graph to find the densely connected subgraph from the complex network, various community detection algorithms using clustering techniques are discussed here. In this paper, we discussed the taxonomy of various community detection algorithms like Structural Clustering Algorithm for Networks (SCAN), Structural-Attribute based Cluster (SA-cluster), Community Detection based on Hierarchical Clustering (CDHC), etc. In this comprehensive review, we provide a classification of community detection algorithm based on their approach, dataset used for the existing algorithm for experimental study and measure to evaluate them. In the end, insights into the future scope and research opportunities for community detection are discussed.

Download Full-text