An incremental clustering method of micro-blog topic detection

Intelligent electricity meters (IEMs) form a key infrastructure necessary for the growth of smart grids. IEMs generate a considerable amount of electricity data incrementally. However, on an influx of new data, traditional clustering task re-cluster all of the data from scratch. The incremental clustering method is an essential way to solve the problem of clustering with dynamic data. Given the volume of IEM data and the number of data types involved, an incremental clustering method is highly complex. Microsoft Azure provide the processing power necessary to handle incremental clustering analytics. The proposed Cloud4NFICA is a scalable platform of a nearness factor-based incremental clustering algorithm. This research uses the real dataset of Irish households collected by IEMs and related socioeconomic data. Cloud4NFICA is incremental in nature, hence accommodates the influx of new data. Cloud4NFICA was designed as an infrastructure as a service. It is visible from the study that the developed system performs well on the scalability aspect.

Download Full-text

Topic Detection from Short Text: A Term-based Consensus Clustering method

2016 13th International Conference on Service Systems and Service Management (ICSSSM) ◽

10.1109/icsssm.2016.7538624 ◽

2016 ◽

Author(s):

Hao Lin ◽

Bo Sun ◽

Junjie Wu ◽

Haitao Xiong

Keyword(s):

Topic Detection ◽

Consensus Clustering ◽

Clustering Method ◽

Short Text

Download Full-text

Automatic Topic Detection with an Incremental Clustering Algorithm

Web Information Systems and Mining - Lecture Notes in Computer Science ◽

10.1007/978-3-642-16515-3_43 ◽

2010 ◽

pp. 344-351 ◽

Cited By ~ 1

Author(s):

Xiaoming Zhang ◽

Zhoujun Li

Keyword(s):

Clustering Algorithm ◽

Topic Detection ◽

Incremental Clustering

Download Full-text

Hot Topic Detection on Twitter Data Streams with Incremental Clustering Using Named Entities and Central Centroids

2019 IEEE-RIVF International Conference on Computing and Communication Technologies (RIVF) ◽

10.1109/rivf.2019.8713730 ◽

2019 ◽

Author(s):

Son Nguyen ◽

Bao Ngo ◽

Chau Vo ◽

Tru Cao

Keyword(s):

Data Streams ◽

Topic Detection ◽

Incremental Clustering ◽

Named Entities ◽

Twitter Data

Download Full-text

Computational analysis of incremental clustering approaches for Large Data

International Journal of Computers and Communications ◽

10.46300/91013.2021.15.3 ◽

2021 ◽

Vol 15 ◽

pp. 14-18

Author(s):

Arun Pratap Singh Kushwah ◽

Shailesh Jaloree ◽

Ramjeevan Singh Thakur

Keyword(s):

Data Mining ◽

Clustering Algorithm ◽

Computational Analysis ◽

Large Data ◽

Distance Functions ◽

Spatial Density ◽

Incremental Clustering ◽

Clustering Method ◽

Density Method ◽

Incremental Approach

Clustering is an approach of data mining, which helps us to find the underlying hidden structure in the dataset. K-means is a clustering method which usages distance functions to find the similarities or dissimilarities between the instances. DBSCAN is a clustering algorithm, which discovers the arbitrary shapes & sizes of clusters from huge volume of using spatial density method. These two approaches of clustering are the classical methods for efficient clustering but underperform when the data is updated frequently in the databases so, the incremental or gradual clustering approaches are always preferred in this environment. In this paper, an incremental approach for clustering is introduced using K-means and DBSCAN to handle the new datasets dynamically updated in the database in an interval.

Download Full-text