MiCS-P:Parallel Mutual-information Computation of Big Categorical Data on Spark

Journal of Parallel and Distributed Computing ◽

10.1016/j.jpdc.2021.12.002 ◽

2021 ◽

Author(s):

Junli Li ◽

Chaowei Zhang ◽

Jifu Zhang ◽

Xiao Qin ◽

Lihua Hu

Keyword(s):

Mutual Information ◽

Categorical Data

Download Full-text

Mutual information and redundancy for categorical data

Statistical Papers ◽

10.1007/s00362-009-0196-x ◽

2009 ◽

Vol 52 (1) ◽

pp. 17-31 ◽

Author(s):

Chong Sun Hong ◽

Beom Jun Kim

Keyword(s):

Mutual Information ◽

Categorical Data

Download Full-text

Unsupervised feature selection for outlier detection in categorical data using mutual information

2012 12th International Conference on Hybrid Intelligent Systems (HIS) ◽

10.1109/his.2012.6421343 ◽

2012 ◽

Author(s):

N N R Ranga Suri ◽

M Narasimha Murty ◽

G Athithan

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Outlier Detection ◽

Categorical Data ◽

Unsupervised Feature Selection ◽

Download Full-text

Time Series of Categorical Data Using Auto-Mutual Information with Application of Fitting an AR(2) Model

Advances in Multivariate Statistical Methods - Statistical Science and Interdisciplinary Research ◽

10.1142/9789812838247_0025 ◽

2009 ◽

pp. 421-435 ◽

Author(s):

Atanu Biswas ◽

Apratim Guha

Keyword(s):

Time Series ◽

Mutual Information ◽

Categorical Data

Download Full-text

k-ANMI: A mutual information based clustering algorithm for categorical data

Information Fusion ◽

10.1016/j.inffus.2006.05.006 ◽

2008 ◽

Vol 9 (2) ◽

pp. 223-233 ◽

Author(s):

Zengyou He ◽

Xiaofei Xu ◽

Shengchun Deng

Keyword(s):

Mutual Information ◽

Categorical Data ◽

Clustering Algorithm

Download Full-text

Mutual Information and Redundancy for Categorical Data

Communications for Statistical Applications and Methods ◽

10.5351/ckss.2006.13.2.297 ◽

2006 ◽

Vol 13 (2) ◽

pp. 297-307

Author(s):

Chong-Sun Hong ◽

Beom-Jun Kim

Keyword(s):

Mutual Information ◽

Categorical Data

Download Full-text

G-ANMI: A mutual information based genetic clustering algorithm for categorical data

Knowledge-Based Systems ◽

10.1016/j.knosys.2009.11.001 ◽

2010 ◽

Vol 23 (2) ◽

pp. 144-149 ◽

Author(s):

Shengchun Deng ◽

Zengyou He ◽

Xiaofei Xu

Keyword(s):

Mutual Information ◽

Categorical Data ◽

Clustering Algorithm ◽

Genetic Clustering

Download Full-text

Time series analysis of categorical data using auto-mutual information

Journal of Statistical Planning and Inference ◽

10.1016/j.jspi.2009.02.009 ◽

2009 ◽

Vol 139 (9) ◽

pp. 3076-3087 ◽

Author(s):

Atanu Biswas ◽

Apratim Guha

Keyword(s):

Time Series ◽

Mutual Information ◽

Time Series Analysis ◽

Categorical Data ◽

Series Analysis

Download Full-text

Mutual Information Kullback-Leibler Divergence based for Clustering Categorical Data

JOIV International Journal on Informatics Visualization ◽

10.30630/joiv.5.1.462 ◽

2021 ◽

Vol 5 (1) ◽

Author(s):

Iwan Tri Riyadi Yanto ◽

Ririn Setiyowati ◽

Edi Sutoyo ◽

Nur Azizah ◽

Rasyidah

Keyword(s):

Mutual Information ◽

Categorical Data ◽

Euclidean Distance ◽

Clustering Algorithm ◽

Product Distribution ◽

Comparison Results ◽

Leibler Divergence ◽

Clustering Quality ◽

Multiple Clusters

Clustering is a process of grouping a set of objects into multiple clusters, so that the collection of similar objects will be grouped into the same cluster and dissimilar objects will be grouped into other clusters. Fuzzy k-means algorithm is one of clustering algorithm by partitioning data into k clusters employing Euclidean distance as a distance function. This research discusses clustering categorical data using Fuzzy k-Means Kullback-Leibler Divergence. In the determination of the distance between data and center of cluster uses mutual information known as Kullback-Leibler Divergence distance between the joint distribution and the product distribution from two marginal distributions. Extensive theoretical analysis was performed to show the effectiveness of the proposed method. Moreover, the comparison results of the proposed method with Fuzzy Centroid and Fuzzy k-Partition approaches in terms of response time and clustering accuracy were also performed employing several datasets from UCI Machine Learning. The experiment results show that the proposed algorithm provides good results both from clustering quality and accuracy for clustering categorical data as compared to Fuzzy Centroid and Fuzzy k-Partition.

Download Full-text

Computing Mutual Information of Big Categorical Data and Its Application to Feature Grouping

2020 IEEE 36th International Conference on Data Engineering (ICDE) ◽

10.1109/icde48307.2020.00210 ◽

2020 ◽

Author(s):

Junli Li ◽

Chaowei Zhang ◽

Jifu Zhang ◽

Xiao Qin

Keyword(s):

Mutual Information ◽

Categorical Data ◽

Feature Grouping

Download Full-text

The maximum mutual information without coding for binary quantum-state signals

Journal of Modern Optics ◽

10.1080/095003498151960 ◽

1998 ◽

Vol 45 (2) ◽

pp. 269-282 ◽

Author(s):

MASAO OSAKI, OSAMU HIROTA MASASHI BAN

Keyword(s):

Mutual Information ◽

Quantum State ◽

Maximum Mutual Information

Download Full-text