clustering algorithm Latest Research Papers

Forecasting Trend of Coronavirus Disease 2019 using Multi-Task Weighted TSK Fuzzy System

ACM Transactions on Internet Technology ◽

10.1145/3475870 ◽

2022 ◽

Vol 22 (3) ◽

pp. 1-24

Author(s):

Yizhang Jiang ◽

Xiaoqing Gu ◽

Lei Hua ◽

Kang Li ◽

Yuwen Tao ◽

...

Keyword(s):

Infectious Disease ◽

Fuzzy System ◽

Clustering Algorithm ◽

Edge Computing ◽

Model Parameters ◽

Parameter Learning ◽

The Public ◽

Learning Framework ◽

Task Learning ◽

L2 Norm

Artificial intelligence– (AI) based fog/edge computing has become a promising paradigm for infectious disease. Various AI algorithms are embedded in cooperative fog/edge devices to construct medical Internet of Things environments, infectious disease forecast systems, smart health, and so on. However, these systems are usually done in isolation, which is called single-task learning. They do not consider the correlation and relationship between multiple/different tasks, so some common information in the model parameters or data characteristics is lost. In this study, each data center in fog/edge computing is considered as a task in the multi-task learning framework. In such a learning framework, a multi-task weighted Takagi-Sugeno-Kang (TSK) fuzzy system, called MW-TSKFS, is developed to forecast the trend of Coronavirus disease 2019 (COVID-19). MW-TSKFS provides a multi-task learning strategy for both antecedent and consequent parameters of fuzzy rules. First, a multi-task weighted fuzzy c-means clustering algorithm is developed for antecedent parameter learning, which extracts the public information among all tasks and the private information of each task. By sharing the public cluster centroid and public membership matrix, the differences of commonality and individuality can be further exploited. For consequent parameter learning of MW-TSKFS, a multi-task collaborative learning mechanism is developed based on ε-insensitive criterion and L2 norm penalty term, which can enhance the generalization and forecasting ability of the proposed fuzzy system. The experimental results on the real COVID-19 time series show that the forecasting tend model based on multi-task the weighted TSK fuzzy system has a high application value.

Root cause analysis of COVID-19 cases by enhanced text mining process

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v12i2.pp1807-1817 ◽

2022 ◽

Vol 12 (2) ◽

pp. 1807

Author(s):

Sujatha Arun Kokatnoor ◽

Balachandran Krishnan

Keyword(s):

Dirichlet Process ◽

Clustering Algorithm ◽

Latent Dirichlet Allocation ◽

Semantic Analysis ◽

Learning Approaches ◽

The Public ◽

Root Cause ◽

Document Frequency ◽

Coherence Score ◽

Index Value

<p>The main focus of this research is to find the reasons behind the fresh cases of COVID-19 from the public’s perception for data specific to India. The analysis is done using machine learning approaches and validating the inferences with medical professionals. The data processing and analysis is accomplished in three steps. First, the dimensionality of the vector space model (VSM) is reduced with improvised feature engineering (FE) process by using a weighted term frequency-inverse document frequency (TF-IDF) and forward scan trigrams (FST) followed by removal of weak features using feature hashing technique. In the second step, an enhanced K-means clustering algorithm is used for grouping, based on the public posts from Twitter®. In the last step, latent dirichlet allocation (LDA) is applied for discovering the trigram topics relevant to the reasons behind the increase of fresh COVID-19 cases. The enhanced K-means clustering improved Dunn index value by 18.11% when compared with the traditional K-means method. By incorporating improvised two-step FE process, LDA model improved by 14% in terms of coherence score and by 19% and 15% when compared with latent semantic analysis (LSA) and hierarchical dirichlet process (HDP) respectively thereby resulting in 14 root causes for spike in the disease.</p>

A PSO Enable Multi-Hop Clustering Algorithm for VANET

International Journal of Swarm Intelligence Research ◽

10.4018/ijsir.20220401.oa7 ◽

2022 ◽

Vol 13 (2) ◽

pp. 1-14

Author(s):

Ankit Temurnikar ◽

Pushpneel Verma ◽

Gaurav Dhiman

Keyword(s):

Clustering Algorithm ◽

Ad Hoc ◽

Cluster Head ◽

Delivery Ratio ◽

Intelligent Transport System ◽

Network Clustering ◽

Malicious Node ◽

The Road ◽

On The Road ◽

Type Node

VANET (Vehicle Ad-hoc Network) is an emerging technology in today’s intelligent transport system. In VANET, there are many moving nodes which are called the vehicle running on the road. They communicate with each other to provide the information to driver regarding the road condition, traffic, weather and parking. VANET is a kind of network where moving nodes talk with each other with the help of equipment. There are various other things which also make complete to VANET like OBU (onboard unit), RSU (Road Aside Unit) and CA (Certificate authority). In this paper, a new PSO enable multi-hop technique is proposed which helps in VANET to Select the best route and find the stable cluster head and remove the malicious node from the network to avoid the false messaging. The false can be occurred when there is the malicious node in a network. Clustering is a technique for making a group of the same type node. This proposed work is based on PSO enable clustering and its importance in VANET. While using this approach in VANET, it has increased the 20% packet delivery ratio.

Research On Parallel Association Rules Mining Of Big Data Based On Improved K-Means Clustering Algorithm

International Journal of Autonomous and Adaptive Communications Systems ◽

10.1504/ijaacs.2023.10042660 ◽

2023 ◽

Vol 16 (3) ◽

pp. 1

Author(s):

Chaoping Guo ◽

Tuanbu Wang ◽

Li Hao

Keyword(s):

Big Data ◽

Association Rules ◽

Clustering Algorithm ◽

Association Rules Mining

Multi-task Fuzzy Clustering–Based Multi-task TSK Fuzzy System for Text Sentiment Classification

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3476103 ◽

2022 ◽

Vol 21 (2) ◽

pp. 1-24

Author(s):

Xiaoqing Gu ◽

Kaijian Xia ◽

Yizhang Jiang ◽

Alireza Jolfaei

Keyword(s):

Language Processing ◽

Fuzzy Clustering ◽

Private Information ◽

Fuzzy System ◽

Clustering Algorithm ◽

Public Information ◽

Sentiment Classification ◽

Fuzzy Rules ◽

Proposed Model ◽

The Common

Text sentiment classification is an important technology for natural language processing. A fuzzy system is a strong tool for processing imprecise or ambiguous data, and it can be used for text sentiment analysis. This article proposes a new formulation of a multi-task Takagi-Sugeno-Kang fuzzy system (TSK FS) modeling, which can be used for text sentiment image classification. Using a novel multi-task fuzzy c-means clustering algorithm, the common (public) information among all tasks and the individual (private) information for each task are extracted. The information about clustering, for example, cluster centers, can be used to learn the antecedent parameters of multi-task TSK fuzzy systems. With the common and individual antecedent parameters obtained, a corresponding multi-task learning mechanism for learning consequent parameters is devised. Accordingly, a multi-task fuzzy clustering–based multi-task TSK fuzzy system (MTFCM-MT-TSK-FS) is proposed. When the proposed model is built, the information conveyed by the fuzzy rules formed is two-fold, including (1) common fuzzy rules representing the inter-task correlation information and (2) individual fuzzy rules depicting the independent information of each task. The experimental results on several text sentiment datasets demonstrate the validity of the proposed model.

BotSpot++: A Hierarchical Deep Ensemble Model for Bots Install Fraud Detection in Mobile Advertising

ACM Transactions on Information Systems ◽

10.1145/3476107 ◽

2022 ◽

Vol 40 (3) ◽

pp. 1-28

Author(s):

Yadong Zhu ◽

Xiliang Wang ◽

Qing Li ◽

Tianjun Yao ◽

Shangsong Liang

Keyword(s):

Domain Knowledge ◽

Clustering Algorithm ◽

State Of The Art ◽

Fraud Detection ◽

Ensemble Model ◽

Graph Structure ◽

Mobile Advertising ◽

The World ◽

Advertising Company ◽

Noisy Labels

Mobile advertising has undoubtedly become one of the fastest-growing industries in the world. The influx of capital attracts increasing fraudsters to defraud money from advertisers. Fraudsters can leverage many techniques, where bots install fraud is the most difficult to detect due to its ability to emulate normal users by implementing sophisticated behavioral patterns to evade from detection rules defined by human experts. Therefore, we proposed BotSpot 1 for bots install fraud detection previously. However, there are some drawbacks in BotSpot, such as the sparsity of the devices’ neighbors, weak interactive information of leaf nodes, and noisy labels. In this work, we propose BotSpot++ to improve these drawbacks: (1) for the sparsity of the devices’ neighbors, we propose to construct a super device node to enrich the graph structure and information flow utilizing domain knowledge and a clustering algorithm; (2) for the weak interactive information, we propose to incorporate a self-attention mechanism to enhance the interaction of various leaf nodes; and (3) for the noisy labels, we apply a label smoothing mechanism to alleviate it. Comprehensive experimental results show that BotSpot++ yields the best performance compared with six state-of-the-art baselines. Furthermore, we deploy our model to the advertising platform of Mobvista, 2 a leading global mobile advertising company. The online experiments also demonstrate the effectiveness of our proposed method.

Enhancing the Job Scheduling Procedure to Develop an Efficient Cloud Environment using Near Optimal Clustering Algorithm

International Journal of Cloud Computing ◽

10.1504/ijcc.2023.10033597 ◽

2023 ◽

Vol 12 (2) ◽

pp. 1

Author(s):

Ramamoorthy S ◽

Suganya R ◽

Rajadevi R ◽

NIJU P. JOSEPH

Keyword(s):

Clustering Algorithm ◽

Job Scheduling ◽

Cloud Environment

On a two-stage progressive clustering algorithm with graph-augmented density peak clustering

Engineering Applications of Artificial Intelligence ◽

10.1016/j.engappai.2021.104566 ◽

2022 ◽

Vol 108 ◽

pp. 104566

Author(s):

Xinzheng Niu ◽

Yunhong Zheng ◽

Wuji Liu ◽

Chase Q. Wu

Keyword(s):

Clustering Algorithm ◽

Two Stage ◽

Density Peak ◽

Density Peak Clustering

Effective structural unit analysis in hexagonal close-packed alloys – reconstruction of parent β microstructures and crystal orientation post-processing analysis

Journal of Applied Crystallography ◽

10.1107/s1600576721011584 ◽

2022 ◽

Vol 55 (1) ◽

Author(s):

Ruth Birch ◽

Thomas Benjamin Britton

Keyword(s):

Phase Transformation ◽

Clustering Algorithm ◽

Electron Backscatter Diffraction ◽

Zirconium Alloys ◽

Data Sets ◽

Post Processing ◽

Orientation Relationships ◽

Hexagonal Close Packed ◽

Deformation Properties ◽

Backscatter Diffraction

Materials with an allotropic phase transformation can form microstructures where grains have orientation relationships determined by the transformation history. These microstructures influence the final material properties. In zirconium alloys, there is a solid-state body-centred cubic (b.c.c.) to hexagonal close-packed (h.c.p.) phase transformation, where the crystal orientations of the h.c.p. phase can be related to the parent b.c.c. structure via the Burgers orientation relationship (BOR). In the present work, a reconstruction code, developed for steels and which uses a Markov chain clustering algorithm to analyse electron backscatter diffraction maps, is adapted and applied to the h.c.p./b.c.c. BOR. This algorithm is released as open-source code (via github, as ParentBOR). The algorithm enables new post-processing of the original and reconstructed data sets to analyse the variants of the h.c.p. α phase that are present and understand shared crystal planes and shared lattice directions within each parent β grain; it is anticipated that this will assist in understanding the transformation-related deformation properties of the final microstructure. Finally, the ParentBOR code is compared with recently released reconstruction codes implemented in MTEX to reveal differences and similarities in how the microstructure is described.

A heterogeneous parallel implementation of the Markov clustering algorithm for large-scale biological networks on distributed CPU–GPU clusters

The Journal of Supercomputing ◽

10.1007/s11227-021-04204-6 ◽

2022 ◽

Author(s):

You Fu ◽

Wei Zhou

Keyword(s):

Biological Networks ◽

Large Scale ◽

Clustering Algorithm ◽

Parallel Implementation ◽

Gpu Clusters ◽

Markov Clustering

clustering algorithm
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Forecasting Trend of Coronavirus Disease 2019 using Multi-Task Weighted TSK Fuzzy System

Root cause analysis of COVID-19 cases by enhanced text mining process

A PSO Enable Multi-Hop Clustering Algorithm for VANET

Research On Parallel Association Rules Mining Of Big Data Based On Improved K-Means Clustering Algorithm

Multi-task Fuzzy Clustering–Based Multi-task TSK Fuzzy System for Text Sentiment Classification

BotSpot++: A Hierarchical Deep Ensemble Model for Bots Install Fraud Detection in Mobile Advertising

Enhancing the Job Scheduling Procedure to Develop an Efficient Cloud Environment using Near Optimal Clustering Algorithm

On a two-stage progressive clustering algorithm with graph-augmented density peak clustering

Effective structural unit analysis in hexagonal close-packed alloys – reconstruction of parent β microstructures and crystal orientation post-processing analysis

A heterogeneous parallel implementation of the Markov clustering algorithm for large-scale biological networks on distributed CPU–GPU clusters

Export Citation Format

clustering algorithmRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Forecasting Trend of Coronavirus Disease 2019 using Multi-Task Weighted TSK Fuzzy System

Root cause analysis of COVID-19 cases by enhanced text mining process

A PSO Enable Multi-Hop Clustering Algorithm for VANET

Research On Parallel Association Rules Mining Of Big Data Based On Improved K-Means Clustering Algorithm

Multi-task Fuzzy Clustering–Based Multi-task TSK Fuzzy System for Text Sentiment Classification

BotSpot++: A Hierarchical Deep Ensemble Model for Bots Install Fraud Detection in Mobile Advertising

Enhancing the Job Scheduling Procedure to Develop an Efficient Cloud Environment using Near Optimal Clustering Algorithm

On a two-stage progressive clustering algorithm with graph-augmented density peak clustering

Effective structural unit analysis in hexagonal close-packed alloys – reconstruction of parent β microstructures and crystal orientation post-processing analysis

A heterogeneous parallel implementation of the Markov clustering algorithm for large-scale biological networks on distributed CPU–GPU clusters

clustering algorithm
Recently Published Documents