Spectral clustering and the high-dimensional stochastic blockmodel

Design space exploration can reveal the underlying structure of design problems of interest. In a set-based approach, for example, exploration can identify sets of designs or regions of the design space that meet specific performance requirements. For some problems, promising designs may cluster in multiple regions of the design space, and the boundaries of those clusters may be irregularly shaped and difficult to predict. Visualizing the promising regions can clarify the design space structure, but design spaces are typically high-dimensional, making it difficult to visualize the space in three dimensions. Techniques have been introduced to map high-dimensional design spaces to low-dimensional, visualizable spaces. Before the promising regions can be visualized, however, the first task is to identify how many clusters of promising designs exist in the high-dimensional design space. Unsupervised machine learning methods, such as spectral clustering, have been utilized for this task. Spectral clustering is generally accurate but becomes computationally intractable with large sets of candidate designs. Therefore, in this paper a technique for accurately identifying clusters of promising designs is introduced that remains viable with large sets of designs. The technique is based on spectral clustering but reduces its computational impact by leveraging the Nyström Method in the formulation of self-tuning spectral clustering. After validating the method on a simplified example, it is applied to identify clusters of high performance designs for a high-dimensional negative stiffness metamaterials design problem.

Get full-text (via PubEx)

Regression of High Dimensional Data on the Grassmann Manifold using Spectral Clustering

Proceedings of the 8th International Conference on Computational Stochastic Mechanics (CSM 8) ◽

10.3850/978-981-11-2723-6_25-cd ◽

2018 ◽

Author(s):

Dimitris G. Giovanis ◽

Michael D. Shields

Keyword(s):

Spectral Clustering ◽

Grassmann Manifold ◽

High Dimensional Data ◽

High Dimensional

Get full-text (via PubEx)

Minimum Similarity Sampling Scheme for Nyström Based Spectral Clustering on Large Scale High-Dimensional Data

Modern Advances in Applied Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-319-07467-2_28 ◽

2014 ◽

pp. 260-269 ◽

Cited By ~ 4

Author(s):

Zhicheng Zeng ◽

Ming Zhu ◽

Hong Yu ◽

Honglian Ma

Keyword(s):

Spectral Clustering ◽

Large Scale ◽

High Dimensional Data ◽

Sampling Scheme ◽

High Dimensional

Get full-text (via PubEx)

Spectral clustering of high-dimensional data via Nonnegative Matrix Factorization

2015 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2015.7280465 ◽

2015 ◽

Cited By ~ 3

Author(s):

Shulin Wang ◽

Fang Chen ◽

Jianwen Fang

Keyword(s):

Matrix Factorization ◽

Spectral Clustering ◽

Nonnegative Matrix Factorization ◽

High Dimensional Data ◽

Nonnegative Matrix ◽

High Dimensional

Get full-text (via PubEx)

Spectral clustering of high-dimensional data exploiting sparse representation vectors

Neurocomputing ◽

10.1016/j.neucom.2013.12.027 ◽

2014 ◽

Vol 135 ◽

pp. 229-239 ◽

Cited By ~ 26

Author(s):

Sen Wu ◽

Xiaodong Feng ◽

Wenjun Zhou

Keyword(s):

Sparse Representation ◽

Spectral Clustering ◽

High Dimensional Data ◽

High Dimensional

Get full-text (via PubEx)

Spectral Clustering and Vantage Point Indexing for Efficient Data Retrieval

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v8i4.pp2261-2271 ◽

2018 ◽

Vol 8 (4) ◽

pp. 2261

Author(s):

Pushpalatha R. ◽

K. Meenakshi Sundaram

Keyword(s):

Spectral Clustering ◽

High Dimensional Data ◽

Data Retrieval ◽

True Positive Rate ◽

Vantage Point ◽

Space Complexity ◽

High Dimensional ◽

Retrieval Time ◽

User Query ◽

Data Points

<p>Data mining is an essential process for identifying the patterns in large datasets through machine learning techniques and database systems. Clustering of high dimensional data is becoming very challenging process due to curse of dimensionality. In addition, space complexity and data retrieval performance was not improved. In order to overcome the limitation, Spectral Clustering Based VP Tree Indexing Technique is introduced. The technique clusters and indexes the densely populated high dimensional data points for effective data retrieval based on user query. A Normalized Spectral Clustering Algorithm is used to group similar high dimensional data points. After that, Vantage Point Tree is constructed for indexing the clustered data points with minimum space complexity. At last, indexed data gets retrieved based on user query using Vantage Point Tree based Data Retrieval Algorithm. This in turn helps to improve true positive rate with minimum retrieval time. The performance is measured in terms of space complexity, true positive rate and data retrieval time with El Nino weather data sets from UCI Machine Learning Repository. An experimental result shows that the proposed technique is able to reduce the space complexity by 33% and also reduces the data retrieval time by 24% when compared to state-of-the-art-works.</p>

Get full-text (via PubEx)

The Approach of Adaptive Spectral Clustering Analyze on High Dimensional Data

2010 International Conference on Computational and Information Sciences ◽

10.1109/iccis.2010.45 ◽

2010 ◽

Author(s):

Liping Cai ◽

Xuchuan Zhou ◽

Jiancheng Song

Keyword(s):

Spectral Clustering ◽

High Dimensional Data ◽

High Dimensional

Get full-text (via PubEx)

Clustering High-Dimensional Data via Spectral Clustering Using Collaborative Representation Coefficients

Intelligent Computing Theories and Methodologies - Lecture Notes in Computer Science ◽

10.1007/978-3-319-22186-1_25 ◽

2015 ◽

pp. 248-258 ◽

Cited By ~ 1

Author(s):

Shulin Wang ◽

Jinchao Gu ◽

Fang Chen

Keyword(s):

Spectral Clustering ◽

High Dimensional Data ◽

Collaborative Representation ◽

High Dimensional

Get full-text (via PubEx)

Spectral Clustering of High-Dimensional Data via k-Nearest Neighbor Based Sparse Representation Coefficients

Lecture Notes in Computer Science - Advanced Intelligent Computing Theories and Applications ◽

10.1007/978-3-319-22053-6_40 ◽

2015 ◽

pp. 363-374

Author(s):

Fang Chen ◽

Shulin Wang ◽

Jianwen Fang

Keyword(s):

Sparse Representation ◽

Spectral Clustering ◽

Nearest Neighbor ◽

High Dimensional Data ◽

High Dimensional ◽

K Nearest Neighbor

Get full-text (via PubEx)

Robust Structured Low-Rank Representation for Image Segmentation

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213018500203 ◽

2018 ◽

Vol 27 (05) ◽

pp. 1850020 ◽

Cited By ~ 1

Author(s):

Cong-Zhe You ◽

Vasile Palade ◽

Xiao-Jun Wu

Keyword(s):

Clustering Analysis ◽

Spectral Clustering ◽

Optimization Problem ◽

Subspace Clustering ◽

Low Rank ◽

Joint Optimization ◽

High Dimensional ◽

Great Success ◽

Subspace Segmentation ◽

Low Rank Representation

Subspace clustering analysis algorithms are often employed when dealing with high-dimensional data. As a representative approach, Low-Rank Representation (LRR) of data has achieved great success for subspace segmentation tasks in applications such as image processing. The traditional LRR-related methods consist of two separate tasks: first, the affinity graph construction by using lowrank minimization techniques, and then the spectral clustering, which is done on the affinity graph to get the final segmentation. Since these two steps are independent of each other, this method does not guarantee that the results obtained by the algorithm are globally optimal. In this paper, a method called Robust Structured Low-Rank Representation (RSLRR) is proposed, by integrating the two above mentioned tasks and solve a joint optimization problem. This paper also puts forward a method to solve the joint optimization problem, which can efficiently get both the segmentation and the structured low-rank representation. Experiments on several standard datasets show that, compared with other algorithms, the algorithm proposed in this paper can achieve better clustering results.

Get full-text (via PubEx)