multiple clusterings Latest Research Papers

Inductive Multi-view Multiple Clusterings

10.1109/bigdia53151.2021.9619704 ◽

2021 ◽

Author(s):

Shaowei Wei ◽

Guangyang Han ◽

Runmin Wang ◽

Yuanlin Yang ◽

Huiling Zhang ◽

...

Keyword(s):

Multiple Clusterings

Multiple clusterings of heterogeneous information networks

Machine Learning ◽

10.1007/s10994-021-06000-y ◽

2021 ◽

Author(s):

Shaowei Wei ◽

Guoxian Yu ◽

Jun Wang ◽

Carlotta Domeniconi ◽

Xiangliang Zhang

Keyword(s):

Information Networks ◽

Heterogeneous Information ◽

Heterogeneous Information Networks ◽

Multiple Clusterings

Tensor Train-Based Multiple Clusterings for Big Data in Cyber-Physical-Social Systems and Its Efficient Implementations

IEEE Transactions on Network Science and Engineering ◽

10.1109/tnse.2021.3119324 ◽

2021 ◽

pp. 1-1

Author(s):

Yaliang Zhao ◽

Laurence T. Yang ◽

Yiwen Zhang ◽

Jiayu Sun ◽

Xiaojing Wang ◽

...

Keyword(s):

Big Data ◽

Social Systems ◽

Multiple Clusterings

EpiMC: Detecting Epistatic Interactions using Multiple Clusterings

IEEE/ACM Transactions on Computational Biology and Bioinformatics ◽

10.1109/tcbb.2021.3080462 ◽

2021 ◽

pp. 1-1

Author(s):

Jun Wang ◽

Huiling Zhang ◽

Wei Ren ◽

Maozu Guo ◽

Guoxian Yu

Keyword(s):

Epistatic Interactions ◽

Multiple Clusterings

Deep Incomplete Multi-view Multiple Clusterings

2020 IEEE International Conference on Data Mining (ICDM) ◽

10.1109/icdm50108.2020.00074 ◽

2020 ◽

Author(s):

Shaowei Wei ◽

Jun Wang ◽

Guoxian Yu ◽

Carlotta Domeniconi ◽

Xiangliang Zhang

Keyword(s):

Multiple Clusterings

A Clustering Refinement Approach for Revealing Urban Spatial Structure from Smart Card Data

Applied Sciences ◽

10.3390/app10165606 ◽

2020 ◽

Vol 10 (16) ◽

pp. 5606

Author(s):

Liyang Tang ◽

Yang Zhao ◽

Kwok Leung Tsui ◽

Yuxin He ◽

Liwei Pan

Keyword(s):

Spatial Structure ◽

Smart Card ◽

Rapid Development ◽

Urban Spatial Structure ◽

Data Intensive ◽

Multiple Clusterings ◽

Sensing Technology ◽

Smart Card Data ◽

Government Planning ◽

Subway Stations

Facilitated by rapid development of the data-intensive techniques together with communication and sensing technology, we can take advantage of smart card data collected through Automatic Fare Collection (AFC) systems to establish connections between public transit and urban spatial structure. In this paper, with a case study on Shenzhen metro system in China, we investigate the agglomeration pattern of passenger flow among subway stations. Specifically, leveraging inbound and outbound passenger flows at subway stations, we propose a clustering refinement approach based on cluster member stability among multiple clusterings produced by isomorphic or heterogeneous clusterers. Furthermore, we validate and elaborate five clusters of subway stations in terms of regional functionality and urban planning by comparing station clusters with reference to government planning policies and regulations of Shenzhen city. Additionally, outlier stations with ambiguous functionalities are detected using proposed clustering refinement framework.

Multi-View Multiple Clusterings Using Deep Matrix Factorization

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6104 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6348-6355 ◽

Cited By ~ 1

Author(s):

Shaowei Wei ◽

Jun Wang ◽

Guoxian Yu ◽

Carlotta Domeniconi ◽

Xiangliang Zhang

Keyword(s):

Matrix Factorization ◽

State Of The Art ◽

Optimization Procedure ◽

Experimental Results ◽

Layer By Layer ◽

Complementary Information ◽

Iterative Optimization ◽

Multiple Clusterings ◽

Benchmark Datasets ◽

Data Matrices

Multi-view clustering aims at integrating complementary information from multiple heterogeneous views to improve clustering results. Existing multi-view clustering solutions can only output a single clustering of the data. Due to their multiplicity, multi-view data, can have different groupings that are reasonable and interesting from different perspectives. However, how to find multiple, meaningful, and diverse clustering results from multi-view data is still a rarely studied and challenging topic in multi-view clustering and multiple clusterings. In this paper, we introduce a deep matrix factorization based solution (DMClusts) to discover multiple clusterings. DMClusts gradually factorizes multi-view data matrices into representational subspaces layer-by-layer and generates one clustering in each layer. To enforce the diversity between generated clusterings, it minimizes a new redundancy quantification term derived from the proximity between samples in these subspaces. We further introduce an iterative optimization procedure to simultaneously seek multiple clusterings with quality and diversity. Experimental results on benchmark datasets confirm that DMClusts outperforms state-of-the-art multiple clustering solutions.

EpIntMC: Detecting Epistatic Interactions Using Multiple Clusterings

Bioinformatics Research and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-030-57821-3_6 ◽

2020 ◽

pp. 56-67

Author(s):

Huiling Zhang ◽

Guoxian Yu ◽

Wei Ren ◽

Maozu Guo ◽

Jun Wang

Keyword(s):

Epistatic Interactions ◽

Multiple Clusterings

Pruning High-Similarity Clusters to Optimize Data Diversity when Building Ensemble Classifiers

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026819500275 ◽

2019 ◽

Vol 18 (04) ◽

pp. 1950027

Author(s):

Sam Fletcher ◽

Brijesh Verma

Keyword(s):

State Of The Art ◽

Computation Time ◽

Ensemble Classifier ◽

Classification Error ◽

Ensemble Classifiers ◽

High Similarity ◽

New Approach ◽

The Past ◽

Multiple Clusterings ◽

Benchmark Datasets

Diversity is a key component for building a successful ensemble classifier. One approach to diversifying the base classifiers in an ensemble classifier is to diversify the data they are trained on. While sampling approaches such as bagging have been used for this task in the past, we argue that since they maintain the global distribution, they do not create diversity. Instead, we make a principled argument for the use of [Formula: see text]-means clustering to create diversity. Expanding on previous work, we observe that when creating multiple clusterings with multiple [Formula: see text] values, there is a risk of different clusterings discovering the same clusters, which would in turn train the same base classifiers. This would bias the ensemble voting process. We propose a new approach that uses the Jaccard Index to detect and remove similar clusters before training the base classifiers, not only saving computation time, but also reducing classification error by removing repeated votes. We empirically demonstrate the effectiveness of the proposed approach compared to the state of the art on 19 UCI benchmark datasets.

Privacy-Preserving Tensor-Based Multiple Clusterings on Cloud for Industrial IoT

IEEE Transactions on Industrial Informatics ◽

10.1109/tii.2018.2871174 ◽

2019 ◽

Vol 15 (4) ◽

pp. 2372-2381 ◽

Cited By ~ 10

Author(s):

Yaliang Zhao ◽

Laurence T. Yang ◽

Jiayu Sun

Keyword(s):

Privacy Preserving ◽

Multiple Clusterings ◽

Industrial Iot

multiple clusterings
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Inductive Multi-view Multiple Clusterings

Multiple clusterings of heterogeneous information networks

Tensor Train-Based Multiple Clusterings for Big Data in Cyber-Physical-Social Systems and Its Efficient Implementations

EpiMC: Detecting Epistatic Interactions using Multiple Clusterings

Deep Incomplete Multi-view Multiple Clusterings

A Clustering Refinement Approach for Revealing Urban Spatial Structure from Smart Card Data

Multi-View Multiple Clusterings Using Deep Matrix Factorization

EpIntMC: Detecting Epistatic Interactions Using Multiple Clusterings

Pruning High-Similarity Clusters to Optimize Data Diversity when Building Ensemble Classifiers

Privacy-Preserving Tensor-Based Multiple Clusterings on Cloud for Industrial IoT

Export Citation Format

multiple clusteringsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Inductive Multi-view Multiple Clusterings

Multiple clusterings of heterogeneous information networks

Tensor Train-Based Multiple Clusterings for Big Data in Cyber-Physical-Social Systems and Its Efficient Implementations

EpiMC: Detecting Epistatic Interactions using Multiple Clusterings

Deep Incomplete Multi-view Multiple Clusterings

A Clustering Refinement Approach for Revealing Urban Spatial Structure from Smart Card Data

Multi-View Multiple Clusterings Using Deep Matrix Factorization

EpIntMC: Detecting Epistatic Interactions Using Multiple Clusterings

Pruning High-Similarity Clusters to Optimize Data Diversity when Building Ensemble Classifiers

Privacy-Preserving Tensor-Based Multiple Clusterings on Cloud for Industrial IoT

multiple clusterings
Recently Published Documents