EMR: Scalable Clustering of Big HR Data using Evolutionary MapReduce

Companion Proceedings of the Web Conference 2021 ◽

10.1145/3442442.3453543 ◽

2021 ◽

Author(s):

Mahdi Bohlouli ◽

Zhonghua He

Keyword(s):

Scalable Clustering

Download Full-text

Effective and Scalable Clustering on Massive Attributed Graphs

Proceedings of the Web Conference 2021 ◽

10.1145/3442381.3449875 ◽

2021 ◽

Author(s):

Renchi Yang ◽

Jieming Shi ◽

Yin Yang ◽

Keke Huang ◽

Shiqi Zhang ◽

...

Keyword(s):

Scalable Clustering ◽

Attributed Graphs

Download Full-text

Multi-level clustering protocol for load-balanced and scalable clustering in large-scale wireless sensor networks

The Journal of Supercomputing ◽

10.1007/s11227-018-2727-5 ◽

2018 ◽

Vol 75 (7) ◽

pp. 3712-3739 ◽

Author(s):

Harmanpreet Singh ◽

Damanpreet Singh

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Large Scale ◽

Wireless Sensor ◽

Scalable Clustering ◽

Clustering Protocol ◽

Multi Level ◽

Download Full-text

DPM: Fast and scalable clustering algorithm for large scale high dimensional datasets

2014 10th International Computer Engineering Conference (ICENCO) ◽

10.1109/icenco.2014.7050427 ◽

2014 ◽

Author(s):

Tamer F. Ghanem ◽

Wail S. Elkilani ◽

Hatem S. Ahmed ◽

Mohiy M. Hadhoud

Keyword(s):

Large Scale ◽

Clustering Algorithm ◽

High Dimensional ◽

Scalable Clustering ◽

High Dimensional Datasets

Download Full-text

Scalable Clustering of High Dimensional Data

Between Data Science and Applied Data Analysis - Studies in Classification, Data Analysis, and Knowledge Organization ◽

10.1007/978-3-642-18991-3_7 ◽

2003 ◽

pp. 57-64

Author(s):

David Littau ◽

Daniel Boley

Keyword(s):

High Dimensional Data ◽

High Dimensional ◽

Scalable Clustering

Download Full-text

STiMR k-Means: An Efficient Clustering Method for Big Data

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001419500137 ◽

2019 ◽

Vol 33 (08) ◽

pp. 1950013 ◽

Author(s):

Mohamed Aymen Ben HajKacem ◽

Chiheb-Eddine Ben N′Cir ◽

Nadia Essoussi

Keyword(s):

Big Data ◽

Triangle Inequality ◽

Computational Cost ◽

Internal Validity ◽

Clustering Methods ◽

Clustering Method ◽

Scalable Clustering ◽

Acceleration Techniques ◽

Clustering Quality ◽

Important Challenge

Big Data clustering has become an important challenge in data analysis since several applications require scalable clustering methods to organize such data into groups of similar objects. Given the computational cost of most of the existing clustering methods, we propose in this paper a new clustering method, referred to as STiMR [Formula: see text]-means, able to provide good tradeoff between scalability and clustering quality. The proposed method is based on the combination of three acceleration techniques: sampling, triangle inequality and MapReduce. Sampling is used to reduce the number of data points when building cluster prototypes, triangle inequality is used to reduce the number of comparisons when looking for nearest clusters and MapReduce is used to configure a parallel framework for running the proposed method. Experiments performed on simulated and real datasets have shown the effectiveness of the proposed method, with the existing ones, in terms of running time, scalability and internal validity measures.

Download Full-text

Modeling of mobile satellite channels by scalable clustering algorithm

Vehicular Technology Conference. IEEE 55th Vehicular Technology Conference. VTC Spring 2002 (Cat. No.02CH37367) ◽

10.1109/vtc.2002.1002974 ◽

2003 ◽

Author(s):

L. Husson ◽

J.C. Dany ◽

S. Chambon ◽

K. Berradi ◽

A. Beffani

Keyword(s):

Clustering Algorithm ◽

Scalable Clustering ◽

Satellite Channels ◽

Mobile Satellite

Download Full-text

Scalable Clustering for Large High-Dimensional Data Based on Data Summarization

2007 IEEE Symposium on Computational Intelligence and Data Mining ◽

10.1109/cidm.2007.368910 ◽

2007 ◽

Author(s):

Ying Lai ◽

Ratko Orlandic ◽

Wai Gen Yee ◽

Sachin Kulkarni

Keyword(s):

High Dimensional Data ◽

High Dimensional ◽

Data Summarization ◽

Scalable Clustering

Download Full-text

Transform Residual K-Means Trees for Scalable Clustering

2013 IEEE 13th International Conference on Data Mining Workshops ◽

10.1109/icdmw.2013.110 ◽

2013 ◽

Author(s):

Jiangbo Yuan ◽

Xiuwen Liu

Keyword(s):

Scalable Clustering

Download Full-text

Towards scalable clustering of infrastructured mobile ad hoc networks

IEEE/Sarnoff Symposium on Advances in Wired and Wireless Communication, 2005. ◽

10.1109/sarnof.2005.1426516 ◽

2006 ◽

Author(s):

A.M. Mahdy ◽

J.S. Deogun ◽

Jun Wang

Keyword(s):

Ad Hoc Networks ◽

Mobile Ad Hoc Networks ◽

Ad Hoc ◽

Scalable Clustering ◽

Mobile Ad Hoc ◽

Download Full-text

A Scalable Clustering Algorithm in Dense Mobile Sensor Networks

Journal of Networks ◽

10.4304/jnw.6.3.505-512 ◽

2011 ◽

Vol 6 (3) ◽

Author(s):

Jianbo Li ◽

Shan Jiang

Keyword(s):

Sensor Networks ◽

Clustering Algorithm ◽

Mobile Sensor Networks ◽

Mobile Sensor ◽

Scalable Clustering

Download Full-text