An efficient distributed hierarchical-clustering algorithm for large scale data

2010 International Computer Symposium (ICS2010) ◽

10.1109/compsym.2010.5685388 ◽

2010 ◽

Author(s):

Cheng-Hsien Tang ◽

An-Ching Huang ◽

Meng-Feng Tsai ◽

Wei-Jen Wang

Keyword(s):

Hierarchical Clustering ◽

Large Scale ◽

Clustering Algorithm ◽

Large Scale Data ◽

Hierarchical Clustering Algorithm ◽

Download Full-text

A stratified sampling based clustering algorithm for large-scale data

Knowledge-Based Systems ◽

10.1016/j.knosys.2018.09.007 ◽

2019 ◽

Vol 163 ◽

pp. 416-428 ◽

Author(s):

Xingwang Zhao ◽

Jiye Liang ◽

Chuangyin Dang

Keyword(s):

Large Scale ◽

Clustering Algorithm ◽

Stratified Sampling ◽

Large Scale Data ◽

Download Full-text

The Research on Large Scale Data Set Clustering Algorithm Based on Tag Set

Communications in Computer and Information Science - Computational Intelligence and Intelligent Systems ◽

10.1007/978-981-10-0356-1_38 ◽

2016 ◽

pp. 365-372

Author(s):

Qiang Chen

Keyword(s):

Large Scale ◽

Clustering Algorithm ◽

Data Set ◽

Large Scale Data ◽

Download Full-text

An Efficient K-Medoids Clustering Algorithm for Large Scale Data

Machine Learning-based Natural Scene Recognition for Mobile Robot Localization in An Unknown Environment ◽

10.1007/978-981-13-9217-7_5 ◽

2019 ◽

pp. 85-108

Author(s):

Xiaochun Wang ◽

Xiali Wang ◽

Don Mitchell Wilkes

Keyword(s):

Large Scale ◽

Clustering Algorithm ◽

Large Scale Data ◽

Download Full-text

Using Uncertain DM-Chameleon Clustering Algorithm Based on Machine Learning to Predict Landslide Hazards

Journal of Robotics and Mechatronics ◽

10.20965/jrm.2019.p0329 ◽

2019 ◽

Vol 31 (2) ◽

pp. 329-338 ◽

Author(s):

Jian Hu ◽

Haiwan Zhu ◽

Yimin Mao ◽

Canlong Zhang ◽

Tian Liang ◽

...

Keyword(s):

Machine Learning ◽

Large Scale ◽

Clustering Algorithm ◽

Uncertain Data ◽

Landslide Hazard ◽

Data Sets ◽

Large Scale Data ◽

Landslide Hazards ◽

Hazard Levels ◽

Landslide hazard prediction is a difficult, time-consuming process when traditional methods are used. This paper presents a method that uses machine learning to predict landslide hazard levels automatically. Due to difficulties in obtaining and effectively processing rainfall in landslide hazard prediction, and to the existing limitation in dealing with large-scale data sets in the M-chameleon algorithm, a new method based on an uncertain DM-chameleon algorithm (developed M-chameleon) is proposed to assess the landslide susceptibility model. First, this method designs a new two-phase clustering algorithm based on M-chameleon, which effectively processes large-scale data sets. Second, the new E-H distance formula is designed by combining the Euclidean and Hausdorff distances, and this enables the new method to manage uncertain data effectively. The uncertain data model is presented at the same time to effectively quantify triggering factors. Finally, the model for predicting landslide hazards is constructed and verified using the data from the Baota district of the city of Yan’an, China. The experimental results show that the uncertain DM-chameleon algorithm of machine learning can effectively improve the accuracy of landslide prediction and has high feasibility. Furthermore, the relationships between hazard factors and landslide hazard levels can be extracted based on clustering results.

Download Full-text

Large-scale data clustering algorithm based on quantum immune regulation network

2017 IEEE Symposium Series on Computational Intelligence (SSCI) ◽

10.1109/ssci.2017.8285302 ◽

2017 ◽

Author(s):

Yangyang Li ◽

Xiaoyu Bai ◽

Xiaoju Hou ◽

Licheng Jiao

Keyword(s):

Immune Regulation ◽

Data Clustering ◽

Large Scale ◽

Clustering Algorithm ◽

Large Scale Data ◽

Regulation Network ◽

Download Full-text

A EM Probabilistic Clustering Algorithm for Large Scale Data Sets based on Partial Constraints Information

INTERNATIONAL JOURNAL ON Advances in Information Sciences and Service Sciences ◽

10.4156/aiss.vol3.issue10.3 ◽

2011 ◽

Vol 3 (10) ◽

pp. 20-29

Author(s):

Shen Yan ◽

Song Shunlin ◽

Zhu Yuquan

Keyword(s):

Large Scale ◽

Clustering Algorithm ◽

Data Sets ◽

Probabilistic Clustering ◽

Large Scale Data ◽

Large Scale Data Sets

Download Full-text

A distributed hierarchical clustering algorithm for large-scale dynamic networks

Proceedings of the 8th ACM workshop on Performance monitoring and measurement of heterogeneous wireless and wired networks - PM2HW2N '13 ◽

10.1145/2512840.2512868 ◽

2013 ◽

Author(s):

François Avril ◽

Alain Bui ◽

Devan Sohier

Keyword(s):

Hierarchical Clustering ◽

Large Scale ◽

Clustering Algorithm ◽

Dynamic Networks ◽

Hierarchical Clustering Algorithm

Download Full-text

Part Priority Clustering Algorithm for Large-Scale Data Set

2013 5th International Conference on Intelligent Human-Machine Systems and Cybernetics ◽

10.1109/ihmsc.2013.100 ◽

2013 ◽

Author(s):

Zhihao Yin ◽

Bencheng Yu ◽

Zhifeng Wang ◽

Wang Ran

Keyword(s):

Large Scale ◽

Clustering Algorithm ◽

Data Set ◽

Large Scale Data ◽

Download Full-text

Privacy-preserving constrained spectral clustering algorithm for large-scale data sets

IET Information Security ◽

10.1049/iet-ifs.2019.0255 ◽

2020 ◽

Vol 14 (3) ◽

pp. 321-331 ◽

Author(s):

Ji Li ◽

Jianghong Wei ◽

Mao Ye ◽

Wenfen Liu ◽

Xuexian Hu

Keyword(s):

Spectral Clustering ◽

Large Scale ◽

Clustering Algorithm ◽

Privacy Preserving ◽

Data Sets ◽

Large Scale Data ◽

Spectral Clustering Algorithm ◽

Large Scale Data Sets

Download Full-text

A fast hierarchical clustering algorithm for large-scale protein sequence data sets

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2014.02.016 ◽

2014 ◽

Vol 48 ◽

pp. 94-101 ◽

Author(s):

Sándor M. Szilágyi ◽

László Szilágyi

Keyword(s):

Hierarchical Clustering ◽

Protein Sequence ◽

Large Scale ◽

Clustering Algorithm ◽

Sequence Data ◽

Data Sets ◽

Protein Sequence Data ◽

Hierarchical Clustering Algorithm

Download Full-text