Cluster ensemble selection using balanced normalized mutual information

A bad partition in an ensemble will be removed by a cluster ensemble selection framework from the final ensemble. It is the main idea in cluster ensemble selection to remove these partitions (bad partitions) from the selected ensemble. But still, it is likely that one of them contains some reliable clusters. Therefore, it may be reasonable to apply the selection phase on cluster level. To do this, a cluster evaluation metric is needed. Some of these metrics have been recently introduced; each of them has its limitations. The weak points of each method have been addressed in the paper. Subsequently, a new metric for cluster assessment has been introduced. The new measure is named Balanced Normalized Mutual Information (BNMI) criterion. It balances the deficiency of the traditional NMI-based criteria. Additionally, an innovative cluster ensemble approach has been proposed. To create the consensus partition considering the elected clusters, a set of different aggregation-functions (called also consensus-functions) have been utilized: the ones which are based upon the co-association matrix (CAM), the ones which are based on hyper graph partitioning algorithms, and the ones which are based upon intermediate space. The experimental study indicates that the state-of-the-art cluster ensemble methods are outperformed by the proposed cluster ensemble approach.

Download Full-text

Social Network Optimization for Cluster Ensemble Selection

Fundamenta Informaticae ◽

10.3233/fi-2020-1964 ◽

2020 ◽

Vol 176 (1) ◽

pp. 79-102

Author(s):

Chenyue Zhao ◽

Hosein Alizadeh ◽

Behrouz Minaei ◽

Majid Mohamadpoor ◽

Hamid Parvin ◽

...

Keyword(s):

Large Scale ◽

Cluster Structure ◽

Ensemble Methods ◽

Quadratic Program ◽

Maximization Problem ◽

Similarity Matrix ◽

Cluster Ensemble ◽

Ensemble Selection ◽

Consensus Functions ◽

Consensus Partition

This paper studies the cluster ensemble selection problem for unsupervised learning. Given a large ensemble of clustering solutions, our goal is to select a subset of solutions to form a smaller yet better performing cluster ensemble than using all available solutions. The common way of aggregating the chosen solutions is accumulating the information of the selected results to a similarity matrix. This paper suggests transforming the similarity matrix to a modularity matrix and then applying a new consensus function which optimizes modularity measure in it. We represent the modularity maximization problem as a 0-1 quadratic program which can be exactly solved for small datasets. We also established a new greedy algorithm, namely sum linkage, to optimize the objective function specially for large scale datasets in a very short time. We show that the proposed consensus partition gets much closer to the actual cluster structure than the partitions obtained from the direct application of common cluster ensemble methods. The promising results compared with other most cited consensus functions show the excellent efficiency of the proposed method.

Download Full-text

Computational Speed and Matching Quality using an Upper Bound on the Normalized Mutual Information

10.2172/1360069 ◽

2017 ◽

Author(s):

Kalyan S. Perumalla ◽

Maksudul Alam ◽

Devin A. White

Keyword(s):

Mutual Information ◽

Upper Bound ◽

Normalized Mutual Information ◽

Computational Speed

Download Full-text

Cluster Ensemble Selection

Statistical Analysis and Data Mining The ASA Data Science Journal ◽

10.1002/sam.10008 ◽

2008 ◽

Vol 1 (3) ◽

pp. 128-141 ◽

Cited By ~ 95

Author(s):

Xiaoli Z. Fern ◽

Wei Lin

Keyword(s):

Cluster Ensemble ◽

Ensemble Selection

Download Full-text

Clinical validation of the normalized mutual information method for registration of CT and MR images in radiotherapy of brain tumors

Journal of Applied Clinical Medical Physics ◽

10.1120/jacmp.v5i3.1959 ◽

2004 ◽

Vol 5 (3) ◽

pp. 66-79 ◽

Cited By ~ 13

Author(s):

Theo Veninga ◽

Henkjan Huisman ◽

Richard W. M. van der Maazen ◽

Henk Huizenga

Keyword(s):

Brain Tumors ◽

Mutual Information ◽

Clinical Validation ◽

Mr Images ◽

Normalized Mutual Information ◽

Information Method

Download Full-text

A Normalized Mutual Information Estimator Compensating Variance Fluctuations for Motion Detection

Proceedings in Adaptation, Learning and Optimization - Proceedings of ELM-2017 ◽

10.1007/978-3-030-01520-6_5 ◽

2018 ◽

pp. 46-57

Author(s):

Kun Qin ◽

Lei Sun ◽

Shengmin Zhou ◽

Badong Chen ◽

Beom-Seok Oh ◽

...

Keyword(s):

Mutual Information ◽

Motion Detection ◽

Normalized Mutual Information

Download Full-text

Detecting Infarct Region in Cardiac Magnetic Resonance Images Through Weighted Normalized Mutual Information

Iranian Journal of Radiology ◽

10.5812/iranjradiol.41334 ◽

2016 ◽

Vol In press (In press) ◽

Author(s):

Hossein Yousefi-Banaem ◽

Saeed Kermani ◽

Hamid Sanei ◽

Alireza Daneshmehr

Keyword(s):

Cardiac Magnetic Resonance ◽

Magnetic Resonance ◽

Mutual Information ◽

Magnetic Resonance Images ◽

Infarct Region ◽

Normalized Mutual Information

Download Full-text

New wrapper method based on normalized mutual information for dimension reduction and classification of hyperspectral images

2018 4th International Conference on Optimization and Applications (ICOA) ◽

10.1109/icoa.2018.8370546 ◽

2018 ◽

Cited By ~ 3

Author(s):

Hasna Nhaila ◽

Asma Elmaizi ◽

Elkebir Sarhrouni ◽

Ahmed Hammouch

Keyword(s):

Mutual Information ◽

Dimension Reduction ◽

Hyperspectral Images ◽

Normalized Mutual Information ◽

Wrapper Method

Download Full-text

High-performance link-based cluster ensemble approach for categorical data clustering

The Journal of Supercomputing ◽

10.1007/s11227-018-2526-z ◽

2018 ◽

Vol 76 (6) ◽

pp. 4556-4579 ◽

Cited By ~ 3

Author(s):

N. Yuvaraj ◽

C. Suresh Ghana Dhas

Keyword(s):

Categorical Data ◽

Data Clustering ◽

High Performance ◽

Cluster Ensemble ◽

Ensemble Approach ◽

Categorical Data Clustering

Download Full-text

A new correlation-based approach for ensemble selection in random forests

International Journal of Intelligent Computing and Cybernetics ◽

10.1108/ijicc-10-2020-0147 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Mostafa El Habib Daho ◽

Nesma Settouti ◽

Mohammed El Amine Bechar ◽

Amina Boublenza ◽

Mohammed Amine Chikh

Keyword(s):

State Of The Art ◽

Ensemble Methods ◽

The State ◽

Ensemble Classifiers ◽

Content Type ◽

Pruning Method ◽

Ensemble Selection ◽

Small Ensemble ◽

Short Time ◽

Pruning Techniques

PurposeEnsemble methods have been widely used in the field of pattern recognition due to the difficulty of finding a single classifier that performs well on a wide variety of problems. Despite the effectiveness of these techniques, studies have shown that ensemble methods generate a large number of hypotheses and that contain redundant classifiers in most cases. Several works proposed in the state of the art attempt to reduce all hypotheses without affecting performance.Design/methodology/approachIn this work, the authors are proposing a pruning method that takes into consideration the correlation between classifiers/classes and each classifier with the rest of the set. The authors have used the random forest algorithm as trees-based ensemble classifiers and the pruning was made by a technique inspired by the CFS (correlation feature selection) algorithm.FindingsThe proposed method CES (correlation-based Ensemble Selection) was evaluated on ten datasets from the UCI machine learning repository, and the performances were compared to six ensemble pruning techniques. The results showed that our proposed pruning method selects a small ensemble in a smaller amount of time while improving classification rates compared to the state-of-the-art methods.Originality/valueCES is a new ordering-based method that uses the CFS algorithm. CES selects, in a short time, a small sub-ensemble that outperforms results obtained from the whole forest and the other state-of-the-art techniques used in this study.

Download Full-text

Normalized Mutual Information Based PET-MR Registration Using K-Means Clustering and Shading Correction

Biomedical Image Registration - Lecture Notes in Computer Science ◽

10.1007/978-3-540-39701-4_4 ◽

2003 ◽

pp. 31-39 ◽

Cited By ~ 2

Author(s):

Z. F. Knops ◽

J. B. Antoine Maintz ◽

M. A. Viergever ◽

J. P. W. Pluim

Keyword(s):

Mutual Information ◽

Normalized Mutual Information

Download Full-text