incremental clustering Latest Research Papers

Cloud4NFICA-Nearness Factor-Based Incremental Clustering Algorithm Using Microsoft Azure for the Analysis of Intelligent Meter Data

10.4018/978-1-6684-3666-0.ch020 ◽

2022 ◽

pp. 423-442

Author(s):

Archana Yashodip Chaudhari ◽

Preeti Mulay

Keyword(s):

Smart Grids ◽

Clustering Algorithm ◽

Incremental Clustering ◽

Clustering Method ◽

Data Types ◽

Infrastructure As A Service ◽

Dynamic Data ◽

Processing Power ◽

Microsoft Azure ◽

Socioeconomic Data

Intelligent electricity meters (IEMs) form a key infrastructure necessary for the growth of smart grids. IEMs generate a considerable amount of electricity data incrementally. However, on an influx of new data, traditional clustering task re-cluster all of the data from scratch. The incremental clustering method is an essential way to solve the problem of clustering with dynamic data. Given the volume of IEM data and the number of data types involved, an incremental clustering method is highly complex. Microsoft Azure provide the processing power necessary to handle incremental clustering analytics. The proposed Cloud4NFICA is a scalable platform of a nearness factor-based incremental clustering algorithm. This research uses the real dataset of Irish households collected by IEMs and related socioeconomic data. Cloud4NFICA is incremental in nature, hence accommodates the influx of new data. Cloud4NFICA was designed as an infrastructure as a service. It is visible from the study that the developed system performs well on the scalability aspect.

Download Full-text

An incremental clustering method for anomaly detection in flight data

Transportation Research Part C Emerging Technologies ◽

10.1016/j.trc.2021.103406 ◽

2021 ◽

Vol 132 ◽

pp. 103406

Author(s):

Weizun Zhao ◽

Lishuai Li ◽

Sameer Alam ◽

Yanjun Wang

Keyword(s):

Anomaly Detection ◽

Incremental Clustering ◽

Clustering Method ◽

Flight Data

Download Full-text

An Incremental Clustering Algorithm with Pattern Drift Detection for IoT-Enabled Smart Grid System

Sensors ◽

10.3390/s21196466 ◽

2021 ◽

Vol 21 (19) ◽

pp. 6466

Author(s):

Zigui Jiang ◽

Rongheng Lin ◽

Fangchun Yang

Keyword(s):

Smart Grid ◽

Data Clustering ◽

Clustering Algorithm ◽

Grid System ◽

Incremental Clustering ◽

Training Time ◽

Evolution Analysis ◽

Validity Indices ◽

Load Pattern ◽

Smart Grid System

The IoT-enabled smart grid system provides smart meter data for electricity consumers to record their energy consumption behaviors, the typical features of which can be represented by the load patterns extracted from load data clustering. The changeability of consumption behaviors requires load pattern update for achieving accurate consumer segmentation and effective demand response. In order to save training time and reduce computation scale, we propose a novel incremental clustering algorithm with probability strategy, ICluster-PS, instead of overall load data clustering to update load patterns. ICluster-PS first conducts new load pattern extraction based on the existing load patterns and new data. Then, it intergrades new load patterns with the existing ones. Finally, it optimizes the intergraded load pattern sets by a further modification. Moreover, ICluster-PS can be performed continuously with new coming data due to parameter updating and generalization. Extensive experiments are implemented on real-world dataset containing diverse consumer types in various districts. The experimental results are evaluated by both clustering validity indices and accuracy measures, which indicate that ICluster-PS outperforms other related incremental clustering algorithm. Additionally, according to the further case studies on pattern evolution analysis, ICluster-PS is able to present any pattern drifts through its incremental clustering results.

Download Full-text

Unleashing analytics to reduce electricity consumption using incremental clustering algorithm

International Journal of Energy Sector Management ◽

10.1108/ijesm-11-2019-0016 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Archana Yashodip Chaudhari ◽

Preeti Mulay

Keyword(s):

Real Time ◽

Clustering Algorithm ◽

Electricity Consumption ◽

Incremental Clustering ◽

Science Data ◽

Load Curve ◽

Data Set ◽

Content Type ◽

Validity Indices ◽

Reduce Electricity Consumption

Purpose To reduce the electricity consumption in our homes, a first step is to make the user aware of it. Reading a meter once in a month is not enough, instead, it requires real-time meter reading. Smart electricity meter (SEM) is capable of providing a quick and exact meter reading in real-time at regular time intervals. SEM generates a considerable amount of household electricity consumption data in an incremental manner. However, such data has embedded load patterns and hidden information to extract and learn consumer behavior. The extracted load patterns from data clustering should be updated because consumer behaviors may be changed over time. The purpose of this study is to update the new clustering results based on the old data rather than to re-cluster all of the data from scratch. Design/methodology/approach This paper proposes an incremental clustering with nearness factor (ICNF) algorithm to update load patterns without overall daily load curve clustering. Findings Extensive experiments are implemented on real-world SEM data of Irish Social Science Data Archive (Ireland) data set. The results are evaluated by both accuracy measures and clustering validity indices, which indicate that proposed method is useful for using the enormous amount of smart meter data to understand customers’ electricity consumption behaviors. Originality/value ICNF can provide an efficient response for electricity consumption patterns analysis to end consumers via SEMs.

Download Full-text

Topic Modelling and Clustering of Disaster-Related Tweets using Bilingual Latent Dirichlet Allocation and Incremental Clustering Algorithm with Support Vector Machines for Need Assessment

10.1109/icsecs52883.2021.00041 ◽

2021 ◽

Author(s):

Lady Angelica Buen Guerzo ◽

Hans Aaron O. Kilkenny ◽

Raphael Noel D. Osorio ◽

Andrei Hart E. Villegas ◽

Charmaine S. Ponay

Keyword(s):

Support Vector Machines ◽

Clustering Algorithm ◽

Latent Dirichlet Allocation ◽

Support Vector ◽

Topic Modelling ◽

Incremental Clustering ◽

Need Assessment ◽

Vector Machines ◽

Dirichlet Allocation

Download Full-text

Lane Detection Using Edge Detection and Spatio-Temporal Incremental Clustering

2021 International Seminar on Intelligent Technology and Its Applications (ISITIA) ◽

10.1109/isitia52817.2021.9502232 ◽

2021 ◽

Author(s):

Sayyidul Aulia Alamsyah ◽

Djoko Purwanto ◽

Muhammad Attamimi

Keyword(s):

Edge Detection ◽

Lane Detection ◽

Incremental Clustering ◽

Spatio Temporal

Download Full-text

Malware Variant Identification Using Incremental Clustering

Electronics ◽

10.3390/electronics10141628 ◽

2021 ◽

Vol 10 (14) ◽

pp. 1628

Author(s):

Paul Black ◽

Iqbal Gondal ◽

Adil Bagirov ◽

Md Moniruzzaman

Keyword(s):

Pattern Matching ◽

Clustering Algorithm ◽

Hybrid Approach ◽

The Novel ◽

Incremental Clustering ◽

Straightforward Method ◽

Matching Technique ◽

Matching Techniques ◽

Acting In Concert ◽

Variant Identification

Dynamic analysis and pattern matching techniques are widely used in industry, and they provide a straightforward method for the identification of malware samples. Yara is a pattern matching technique that can use sandbox memory dumps for the identification of malware families. However, pattern matching techniques fail silently due to minor code variations, leading to unidentified malware samples. This paper presents a two-layered Malware Variant Identification using Incremental Clustering (MVIIC) process and proposes clustering of unidentified malware samples to enable the identification of malware variants and new malware families. The novel incremental clustering algorithm is used in the identification of new malware variants from the unidentified malware samples. This research shows that clustering can provide a higher level of performance than Yara rules, and that clustering is resistant to small changes introduced by malware variants. This paper proposes a hybrid approach, using Yara scanning to eliminate known malware, followed by clustering, acting in concert, to allow the identification of new malware variants. F1 score and V-Measure clustering metrics are used to evaluate our results.

Download Full-text

Computational analysis of incremental clustering approaches for Large Data

International Journal of Computers and Communications ◽

10.46300/91013.2021.15.3 ◽

2021 ◽

Vol 15 ◽

pp. 14-18

Author(s):

Arun Pratap Singh Kushwah ◽

Shailesh Jaloree ◽

Ramjeevan Singh Thakur

Keyword(s):

Data Mining ◽

Clustering Algorithm ◽

Computational Analysis ◽

Large Data ◽

Distance Functions ◽

Spatial Density ◽

Incremental Clustering ◽

Clustering Method ◽

Density Method ◽

Incremental Approach

Clustering is an approach of data mining, which helps us to find the underlying hidden structure in the dataset. K-means is a clustering method which usages distance functions to find the similarities or dissimilarities between the instances. DBSCAN is a clustering algorithm, which discovers the arbitrary shapes & sizes of clusters from huge volume of using spatial density method. These two approaches of clustering are the classical methods for efficient clustering but underperform when the data is updated frequently in the databases so, the incremental or gradual clustering approaches are always preferred in this environment. In this paper, an incremental approach for clustering is introduced using K-means and DBSCAN to handle the new datasets dynamically updated in the database in an interval.

Download Full-text

A Comparative Review of Incremental Clustering Methods for Large Dataset

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2021/261022021 ◽

2021 ◽

Vol 10 (2) ◽

pp. 643-650

Keyword(s):

Real Time ◽

Search Space ◽

Incremental Clustering ◽

Clustering Methods ◽

Time Data ◽

Large Dataset ◽

Comparative Review ◽

Real Time Visualization ◽

Data Points ◽

Small Clusters

Several algorithms have developed for analyzing large incremental datasets. Incremental algorithms are relatively efficient in dynamic evolving environment to seek out small clusters in large datasets. Many algorithms have devised for limiting the search space, building, and updating arbitrary shaped clusters in large incremented datasets. Within the real time visualization of real time data, when data in motion and growing dynamically, new data points arrive that generates instant cluster labels. In this paper, the comparative review of Incremental clustering methods for large dataset has done.

Download Full-text

DISC: Density-Based Incremental Clustering by Striding over Streaming Data

2021 IEEE 37th International Conference on Data Engineering (ICDE) ◽

10.1109/icde51399.2021.00077 ◽

2021 ◽

Author(s):

Bogyeong Kim ◽

Kyoseung Koo ◽

Juhun Kim ◽

Bongki Moon

Keyword(s):

Streaming Data ◽

Incremental Clustering

Download Full-text

incremental clustering
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Cloud4NFICA-Nearness Factor-Based Incremental Clustering Algorithm Using Microsoft Azure for the Analysis of Intelligent Meter Data

An incremental clustering method for anomaly detection in flight data

An Incremental Clustering Algorithm with Pattern Drift Detection for IoT-Enabled Smart Grid System

Unleashing analytics to reduce electricity consumption using incremental clustering algorithm

Topic Modelling and Clustering of Disaster-Related Tweets using Bilingual Latent Dirichlet Allocation and Incremental Clustering Algorithm with Support Vector Machines for Need Assessment

Lane Detection Using Edge Detection and Spatio-Temporal Incremental Clustering

Malware Variant Identification Using Incremental Clustering

Computational analysis of incremental clustering approaches for Large Data

A Comparative Review of Incremental Clustering Methods for Large Dataset

DISC: Density-Based Incremental Clustering by Striding over Streaming Data

Export Citation Format

incremental clusteringRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Cloud4NFICA-Nearness Factor-Based Incremental Clustering Algorithm Using Microsoft Azure for the Analysis of Intelligent Meter Data

An incremental clustering method for anomaly detection in flight data

An Incremental Clustering Algorithm with Pattern Drift Detection for IoT-Enabled Smart Grid System

Unleashing analytics to reduce electricity consumption using incremental clustering algorithm

Topic Modelling and Clustering of Disaster-Related Tweets using Bilingual Latent Dirichlet Allocation and Incremental Clustering Algorithm with Support Vector Machines for Need Assessment

Lane Detection Using Edge Detection and Spatio-Temporal Incremental Clustering

Malware Variant Identification Using Incremental Clustering

Computational analysis of incremental clustering approaches for Large Data

A Comparative Review of Incremental Clustering Methods for Large Dataset

DISC: Density-Based Incremental Clustering by Striding over Streaming Data

incremental clustering
Recently Published Documents