Enhancing Both Efficiency and Representational Capability of Isomap by Extensive Landmark Selection

The problems of improving computational efficiency and extending representational capability are the two hottest topics in approaches of global manifold learning. In this paper, a new method called extensive landmark Isomap (EL-Isomap) is presented, addressing both topics simultaneously. On one hand, originated from landmark Isomap (L-Isomap), which is known for its high computational efficiency property, EL-Isomap also possesses high computational efficiency through utilizing a small set of landmarks to embed all data points. On the other hand, EL-Isomap significantly extends the representational capability of L-Isomap and other global manifold learning approaches by utilizing only an available subset from the whole landmark set instead of all to embed each point. Particularly, compared with other manifold learning approaches, the data manifolds with intrinsic low-dimensional concave topologies and essential loops can be unwrapped by the new method more successfully, which are shown by simulation results on a series of synthetic and real-world data sets. Moreover, the accuracy, robustness, and computational complexity of EL-Isomap are analyzed in this paper, and the relation between EL-Isomap and L-Isomap is also discussed theoretically.

Download Full-text

Classification of Infrared Objects in Manifold Space Using Kullback-Leibler Divergence of Gaussian Distributions of Image Points

Symmetry ◽

10.3390/sym12030434 ◽

2020 ◽

Vol 12 (3) ◽

pp. 434 ◽

Cited By ~ 2

Author(s):

Huilin Ge ◽

Zhiyu Zhu ◽

Kang Lou ◽

Wei Wei ◽

Runbang Liu ◽

...

Keyword(s):

Manifold Learning ◽

Gaussian Distribution ◽

Classification Accuracy ◽

Infrared Image ◽

Data Sets ◽

Dimensional Manifold ◽

Infrared Images ◽

Leibler Divergence ◽

Data Points ◽

Low Dimensional

Infrared image recognition technology can work day and night and has a long detection distance. However, the infrared objects have less prior information and external factors in the real-world environment easily interfere with them. Therefore, infrared object classification is a very challenging research area. Manifold learning can be used to improve the classification accuracy of infrared images in the manifold space. In this article, we propose a novel manifold learning algorithm for infrared object detection and classification. First, a manifold space is constructed with each pixel of the infrared object image as a dimension. Infrared images are represented as data points in this constructed manifold space. Next, we simulate the probability distribution information of infrared data points with the Gaussian distribution in the manifold space. Then, based on the Gaussian distribution information in the manifold space, the distribution characteristics of the data points of the infrared image in the low-dimensional space are derived. The proposed algorithm uses the Kullback-Leibler (KL) divergence to minimize the loss function between two symmetrical distributions, and finally completes the classification in the low-dimensional manifold space. The efficiency of the algorithm is validated on two public infrared image data sets. The experiments show that the proposed method has a 97.46% classification accuracy and competitive speed in regards to the analyzed data sets.

Download Full-text

Manifold Alignment Aware Ants: A Markovian Process for Manifold Extraction

Neural Computation ◽

10.1162/neco_a_01478 ◽

2022 ◽

pp. 1-47

Author(s):

Mohammad Mohammadi ◽

Peter Tino ◽

Kerstin Bunte

Keyword(s):

Background Noise ◽

Globular Clusters ◽

Large Data ◽

Local Alignment ◽

Density Estimator ◽

Data Sets ◽

Real World Data ◽

Data Points ◽

Low Dimensional ◽

Food Seeking

Abstract The presence of manifolds is a common assumption in many applications, including astronomy and computer vision. For instance, in astronomy, low-dimensional stellar structures, such as streams, shells, and globular clusters, can be found in the neighborhood of big galaxies such as the Milky Way. Since these structures are often buried in very large data sets, an algorithm, which can not only recover the manifold but also remove the background noise (or outliers), is highly desirable. While other works try to recover manifolds either by pushing all points toward manifolds or by downsampling from dense regions, aiming to solve one of the problems, they generally fail to suppress the noise on manifolds and remove background noise simultaneously. Inspired by the collective behavior of biological ants in food-seeking process, we propose a new algorithm that employs several random walkers equipped with a local alignment measure to detect and denoise manifolds. During the walking process, the agents release pheromone on data points, which reinforces future movements. Over time the pheromone concentrates on the manifolds, while it fades in the background noise due to an evaporation procedure. We use the Markov chain (MC) framework to provide a theoretical analysis of the convergence of the algorithm and its performance. Moreover, an empirical analysis, based on synthetic and real-world data sets, is provided to demonstrate its applicability in different areas, such as improving the performance of t-distributed stochastic neighbor embedding (t-SNE) and spectral clustering using the underlying MC formulas, recovering astronomical low-dimensional structures, and improving the performance of the fast Parzen window density estimator.

Download Full-text

An Incremental Isomap Method for Hyperspectral Dimensionality Reduction and Classification

Photogrammetric Engineering & Remote Sensing ◽

10.14358/pers.87.7.445 ◽

2021 ◽

Vol 87 (6) ◽

pp. 445-455

Author(s):

Yi Ma ◽

Zezhong Zheng ◽

Yutang Ma ◽

Mingcang Zhu ◽

Ran Huang ◽

...

Keyword(s):

Manifold Learning ◽

Nearest Neighbor ◽

Hyperspectral Image ◽

Hyperspectral Data ◽

Training Data ◽

Support Vector ◽

Data Sets ◽

K Nearest Neighbor ◽

Data Set ◽

Data Points

Many manifold learning algorithms conduct an eigen vector analysis on a data-similarity matrix with a size of N×N, where N is the number of data points. Thus, the memory complexity of the analysis is no less than O(N2). We pres- ent in this article an incremental manifold learning approach to handle large hyperspectral data sets for land use identification. In our method, the number of dimensions for the high-dimensional hyperspectral-image data set is obtained with the training data set. A local curvature varia- tion algorithm is utilized to sample a subset of data points as landmarks. Then a manifold skeleton is identified based on the landmarks. Our method is validated on three AVIRIS hyperspectral data sets, outperforming the comparison algorithms with a k–nearest-neighbor classifier and achieving the second best performance with support vector machine.

Download Full-text

Learning Manifolds

Machine Learning in Computer-Aided Diagnosis - Advances in Bioinformatics and Biomedical Engineering ◽

10.4018/978-1-4666-0059-1.ch018 ◽

2012 ◽

pp. 374-402

Author(s):

Diana Mateus ◽

Christian Wachinger ◽

Selen Atasoy ◽

Loren Schwarz ◽

Nassir Navab

Keyword(s):

Manifold Learning ◽

Domain Knowledge ◽

Dimensional Space ◽

Human Motion ◽

Motion Modeling ◽

Learning Methods ◽

Data Representations ◽

Non Linear ◽

Data Points ◽

Low Dimensional

Computer aided diagnosis is often confronted with processing and analyzing high dimensional data. One alternative to deal with such data is dimensionality reduction. This chapter focuses on manifold learning methods to create low dimensional data representations adapted to a given application. From pairwise non-linear relations between neighboring data-points, manifold learning algorithms first approximate the low dimensional manifold where data lives with a graph; then, they find a non-linear map to embed this graph into a low dimensional space. Since the explicit pairwise relations and the neighborhood system can be designed according to the application, manifold learning methods are very flexible and allow easy incorporation of domain knowledge. The authors describe different assumptions and design elements that are crucial to building successful low dimensional data representations with manifold learning for a variety of applications. In particular, they discuss examples for visualization, clustering, classification, registration, and human-motion modeling.

Download Full-text

More faithfulness graph embedding

International Journal of Applied Mathematical Research ◽

10.14419/ijamr.v4i2.4419 ◽

2015 ◽

Vol 4 (2) ◽

pp. 336

Author(s):

Alaa Najim

Keyword(s):

Dimensionality Reduction ◽

Graph Embedding ◽

New Method ◽

Graph Representation ◽

Data Sets ◽

Graph Visualization ◽

Graph Data ◽

Original Space ◽

Data Points ◽

Effectiveness And Efficiency

<p><span lang="EN-GB">Using dimensionality reduction idea to visualize graph data sets can preserve the properties of the original space and reveal the underlying information shared among data points. Continuity Trustworthy Graph Embedding (CTGE) is new method we have introduced in this paper to improve the faithfulness of the graph visualization. We will use CTGE in graph field to find new understandable representation to be more easy to analyze and study. Several experiments on real graph data sets are applied to test the effectiveness and efficiency of the proposed method, which showed CTGE generates highly faithfulness graph representation when compared its representation with other methods.</span></p>

Download Full-text

Multi-View Multi-Label Learning with View-Specific Information Extraction

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/539 ◽

2019 ◽

Cited By ~ 4

Author(s):

Xuan Wu ◽

Qing-Guo Chen ◽

Yao Hu ◽

Dengbao Wang ◽

Xiaodong Chang ◽

...

Keyword(s):

Information Extraction ◽

Real World ◽

State Of The Art ◽

Specific Information ◽

Learning Approach ◽

Data Sets ◽

Learning Approaches ◽

Real World Data ◽

Learning Techniques ◽

Shared Information

Multi-view multi-label learning serves an important framework to learn from objects with diverse representations and rich semantics. Existing multi-view multi-label learning techniques focus on exploiting shared subspace for fusing multi-view representations, where helpful view-specific information for discriminative modeling is usually ignored. In this paper, a novel multi-view multi-label learning approach named SIMM is proposed which leverages shared subspace exploitation and view-specific information extraction. For shared subspace exploitation, SIMM jointly minimizes confusion adversarial loss and multi-label loss to utilize shared information from all views. For view-specific information extraction, SIMM enforces an orthogonal constraint w.r.t. the shared subspace to utilize view-specific discriminative information. Extensive experiments on real-world data sets clearly show the favorable performance of SIMM against other state-of-the-art multi-view multi-label learning approaches.

Download Full-text

LTSA Algorithm for Dimension Reduction of Microarray Data

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.645.192 ◽

2013 ◽

Vol 645 ◽

pp. 192-195 ◽

Cited By ~ 1

Author(s):

Xiao Zhou Chen

Keyword(s):

Dimensionality Reduction ◽

Dimension Reduction ◽

Manifold Learning ◽

Microarray Data ◽

Learning Algorithm ◽

Medical Applications ◽

Data Sets ◽

Learning Method ◽

Data Dimensionality Reduction ◽

Low Dimensional

Dimension reduction is an important issue to understand microarray data. In this study, we proposed a efficient approach for dimensionality reduction of microarray data. Our method allows to apply the manifold learning algorithm to analyses dimensionality reduction of microarray data. The intra-/inter-category distances were used as the criteria to quantitatively evaluate the effects of data dimensionality reduction. Colon cancer and leukaemia gene expression datasets are selected for our investigation. When the neighborhood parameter was effectivly set, all the intrinsic dimension numbers of data sets were low. Therefore, manifold learning is used to study microarray data in the low-dimensional projection space. Our results indicate that Manifold learning method possesses better effects than the linear methods in analysis of microarray data, which is suitable for clinical diagnosis and other medical applications.

Download Full-text

Affinity Learning for Mixed Data Clustering

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/302 ◽

2017 ◽

Cited By ~ 1

Author(s):

Nan Li ◽

Longin Jan Latecki

Keyword(s):

Data Clustering ◽

Mixed Type ◽

Original Data ◽

Mixed Data ◽

Abstract Objects ◽

Data Sets ◽

Process Data ◽

Real World Data ◽

Specific Data ◽

Data Points

In this paper, we propose a novel affinity learning based framework for mixed data clustering, which includes: how to process data with mixed-type attributes, how to learn affinities between data points, and how to exploit the learned affinities for clustering. In the proposed framework, each original data attribute is represented with several abstract objects defined according to the specific data type and values. Each attribute value is transformed into the initial affinities between the data point and the abstract objects of attribute. We refine these affinities and infer the unknown affinities between data points by taking into account the interconnections among the attribute values of all data points. The inferred affinities between data points can be exploited for clustering. Alternatively, the refined affinities between data points and the abstract objects of attributes can be transformed into new data features for clustering. Experimental results on many real world data sets demonstrate that the proposed framework is effective for mixed data clustering.

Download Full-text

AN ALGORITHMIC COMPUTATION OF CORRELATION DIMENSION FROM TIME SERIES

Modern Physics Letters B ◽

10.1142/s0217984907012517 ◽

2007 ◽

Vol 21 (02n03) ◽

pp. 129-138 ◽

Cited By ~ 5

Author(s):

K. P. HARIKRISHNAN ◽

G. AMBIKA ◽

R. MISRA

Keyword(s):

Time Series ◽

Hypothesis Testing ◽

Correlation Dimension ◽

Visual Inspection ◽

Chaotic Systems ◽

Synthetic Data ◽

Data Sets ◽

Data Points ◽

Scaling Region ◽

Low Dimensional

We present an algorithmic scheme to compute the correlation dimension D2 of a time series, without requiring the visual inspection of the scaling region in the correlation sum. It is based on the standard Grassberger–Proccacia [GP] algorithm for computing D2. The scheme is tested using synthetic data sets from several standard chaotic systems as well as by adding noise to low-dimensional chaotic data. We show that the scheme is efficient with a few thousand data points and is most suitable when a nonsubjective comparison of D2 values of two time series is required, such as, in hypothesis testing.

Download Full-text

An improved Isomap method for manifold learning

International Journal of Intelligent Computing and Cybernetics ◽

10.1108/ijicc-03-2016-0014 ◽

2017 ◽

Vol 10 (1) ◽

pp. 30-40 ◽

Cited By ~ 4

Author(s):

Taiguo Qu ◽

Zixing Cai

Keyword(s):

Manifold Learning ◽

High Speed ◽

Shortest Paths ◽

Computation Time ◽

Data Sets ◽

Content Type ◽

Data Points ◽

S Curve ◽

Neighbourhood Graph ◽

Isometric Feature Mapping

Purpose Isometric feature mapping (Isomap) is a very popular manifold learning method and is widely used in dimensionality reduction and data visualization. The most time-consuming step in Isomap is to compute the shortest paths between all pairs of data points based on a neighbourhood graph. The classical Isomap (C-Isomap) is very slow, due to the use of Floyd’s algorithm to compute the shortest paths. The purpose of this paper is to speed up Isomap. Design/methodology/approach Through theoretical analysis, it is found that the neighbourhood graph in Isomap is sparse. In this case, the Dijkstra’s algorithm with Fibonacci heap (Fib-Dij) is faster than Floyd’s algorithm. In this paper, an improved Isomap method based on Fib-Dij is proposed. By using Fib-Dij to replace Floyd’s algorithm, an improved Isomap method is presented in this paper. Findings Using the S-curve, the Swiss-roll, the Frey face database, the mixed national institute of standards and technology database of handwritten digits and a face image database, the performance of the proposed method is compared with C-Isomap, showing the consistency with C-Isomap and marked improvements in terms of the high speed. Simulations also demonstrate that Fib-Dij reduces the computation time of the shortest paths from O(N3) to O(N2lgN). Research limitations/implications Due to the limitations of the computer, the sizes of the data sets in this paper are all smaller than 3,000. Therefore, researchers are encouraged to test the proposed algorithm on larger data sets. Originality/value The new method based on Fib-Dij can greatly improve the speed of Isomap.

Download Full-text