Visualization of Non-vectorial Data Using Twin Kernel Embedding

GRAPH CLASSIFICATION BASED ON VECTOR SPACE EMBEDDING

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800140900748x ◽

2009 ◽

Vol 23 (06) ◽

pp. 1053-1081 ◽

Cited By ~ 48

Author(s):

KASPAR RIESEN ◽

HORST BUNKE

Keyword(s):

Data Sets ◽

Object Representations ◽

Graph Classification ◽

Graph Edit Distance ◽

Graph Kernels ◽

Symbolic Data ◽

Representational Power ◽

High Degree ◽

Vectorial Data ◽

Representation Formalism

Graphs provide us with a powerful and flexible representation formalism for pattern classification. Many classification algorithms have been proposed in the literature. However, the vast majority of these algorithms rely on vectorial data descriptions and cannot directly be applied to graphs. Recently, a growing interest in graph kernel methods can be observed. Graph kernels aim at bridging the gap between the high representational power and flexibility of graphs and the large amount of algorithms available for object representations in terms of feature vectors. In the present paper, we propose an approach transforming graphs into n-dimensional real vectors by means of prototype selection and graph edit distance computation. This approach allows one to build graph kernels in a straightforward way. It is not only applicable to graphs, but also to other kind of symbolic data in conjunction with any kind of dissimilarity measure. Thus it is characterized by a high degree of flexibility. With several experimental results, we prove the robustness and flexibility of our new method and show that our approach outperforms other graph classification methods on several graph data sets of diverse nature.

Fuzzy Clustering Methods for Categorical Multivariate Data Based on q-Divergence

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2018.p0524 ◽

2018 ◽

Vol 22 (4) ◽

pp. 524-536 ◽

Cited By ~ 2

Author(s):

Tadafumi Kondo ◽

◽

Yuchi Kanzawa

Keyword(s):

Conventional Method ◽

Fuzzy Clustering ◽

Optimization Problems ◽

Clustering Algorithms ◽

Multivariate Data ◽

Clustering Methods ◽

Kl Divergence ◽

Fuzzy Clustering Methods ◽

Conventional Methods ◽

Vectorial Data

This paper presents two fuzzy clustering algorithms for categorical multivariate data based on q-divergence. First, this study shows that a conventional method for vectorial data can be explained as regularizing another conventional method using q-divergence. Second, based on the known results that Kullback-Leibler (KL)-divergence is generalized into the q-divergence, and two conventional fuzzy clustering methods for categorical multivariate data adopt KL-divergence, two fuzzy clustering algorithms for categorical multivariate data that are based on q-divergence are derived from two optimization problems built by extending the KL-divergence in these conventional methods to the q-divergence. Through numerical experiments using real datasets, the proposed methods outperform the conventional methods in term of clustering accuracy.

FlowVis — a CAD based solution for the graphical post-processing of scalar and vectorial data

Environmental Modelling & Software ◽

10.1016/s1364-8152(97)00006-6 ◽

1997 ◽

Vol 12 (2-3) ◽

pp. 161-168 ◽

Cited By ~ 3

Author(s):

A.M.G. Lopes

Keyword(s):

Post Processing ◽

Vectorial Data

EZ-ROSE: a computer program for equal-area circular histograms and statistical analysis of two-dimensional vectorial data

Computers & Geosciences ◽

10.1016/s0098-3004(99)00072-2 ◽

2000 ◽

Vol 26 (2) ◽

pp. 153-166 ◽

Cited By ~ 39

Author(s):

Jaco H. Baas

Keyword(s):

Statistical Analysis ◽

Computer Program ◽

Two Dimensional ◽

Equal Area ◽

Vectorial Data ◽

Circular Histograms

SELF-ORGANIZING MAP-BASED DISCOVERY AND VISUALIZATION OF HUMAN ENDOGENOUS RETROVIRAL SEQUENCE GROUPS

International Journal of Neural Systems ◽

10.1142/s0129065705000177 ◽

2005 ◽

Vol 15 (03) ◽

pp. 163-179 ◽

Cited By ~ 23

Author(s):

MERJA OJA ◽

GÖRAN O. SPERBER ◽

JONAS BLOMBERG ◽

SAMUEL KASKI

Keyword(s):

Human Genome ◽

Common Origin ◽

Self Organizing Map ◽

Visualization Method ◽

Bootstrap Procedure ◽

Human Genes ◽

Vectorial Data ◽

Self Organizing

About 8 per cent of the human genome consists of human endogenous retroviral sequences (HERVs), which are remains from ancient infections. The HERVs may give rise to transcripts or affect the expression of human genes. The first step in understanding HERV function is to classify HERVs into families. In this work we study the relationships of existing HERV families and detect potentially new HERV families. A Median Self-Organizing Map (SOM), a SOM for non-vectorial data, is used to group and visualize a collection of 3661 HERVs. The SOM-based analysis is complemented with estimates of the reliability of the results. A novel trustworthiness visualization method is used to estimate which parts of the SOM visualization are reliable and which not. The reliability of extracted interesting HERV groups is verified by a bootstrap procedure suitable for SOM visualization-based analysis. The SOM detects a group of epsilonretroviral sequences and a group of ERV9, HERVW, and HUERSP3 sequences which suggests that ERV9 and HERVW sequences may have a common origin.

Estimating and modeling spatio-temporal complex-valued covariance functions

10.5194/egusphere-egu21-16382 ◽

2021 ◽

Author(s):

Sabrina Maggio ◽

Donato Posa ◽

Sandra De Iaco ◽

Claudia Cappello

Keyword(s):

Complex Data ◽

Temporal Dependence ◽

Covariance Functions ◽

Covariance Models ◽

Unit Of Measurement ◽

Oceanographic Data ◽

Spatio Temporal ◽

Complex Valued ◽

Vectorial Data

Oceanographic data belong to the wide class of vectorial data, for which the decomposition in modulus and direction is meaningful, and the vectorial components are characterized by homogeneous quantities, with the same unit of measurement. Another feature of oceanographic data is that they exhibit spatio-temporal dependence. In Geostatistics, such data can be properly modelled by recalling the theory of complex-valued random fields. However, in the literature, only techniques for modeling and predicting the spatial evolution of these phenomena were proposed; while the temporal dependence was analyzed separately from the spatial one, or just time-varying complex covariance models were used. Thus, the novelty of this paper regards some advances of the complex formalism for analyzing complex data in space-time and new classes of spatio-temporal complex covariance models. A case study on spatio-temporal complex estimating and modeling with oceanographic data is provided and a comparison between two classes of complex covariance models is also proposed.

Complex-Valued Random Fields for Vectorial Data: Estimating and Modeling Aspects

Mathematical Geosciences ◽

10.1007/s11004-013-9468-z ◽

2013 ◽

Vol 45 (5) ◽

pp. 557-573 ◽

Cited By ~ 3

Author(s):

S. De Iaco ◽

D. Posa ◽

M. Palma

Keyword(s):

Random Fields ◽

Complex Valued ◽

Vectorial Data

Dynamics and Topographic Organization of Recursive Self-Organizing Maps

Neural Computation ◽

10.1162/neco.2006.18.10.2529 ◽

2006 ◽

Vol 18 (10) ◽

pp. 2529-2567 ◽

Cited By ~ 16

Author(s):

Peter Tiňo ◽

Igor Farkaš ◽

Jort van Mourik

Keyword(s):

Receptive Fields ◽

Topographic Maps ◽

Sequential Data ◽

Self Organizing Map ◽

Self Organizing Maps ◽

Fixed Input ◽

Feedback Connections ◽

Superior Memory ◽

Vectorial Data ◽

Self Organizing

Recently there has been an outburst of interest in extending topographic maps of vectorial data to more general data structures, such as sequences or trees. However, there is no general consensus as to how best to process sequences using topographic maps, and this topic remains an active focus of neurocomputational research. The representational capabilities and internal representations of the models are not well understood. Here, we rigorously analyze a generalization of the self-organizing map (SOM) for processing sequential data, recursive SOM(RecSOM) (Voegtlin, 2002), as a nonautonomous dynamical system consisting of a set of fixed input maps. We argue that contractive fixed-input maps are likely to produce Markovian organizations of receptive fields on the RecSOM map. We derive bounds on parameter β (weighting the importance of importing past information when processing sequences) under which contractiveness of the fixed-input maps is guaranteed. Some generalizations of SOM contain a dynamic module responsible for processing temporal contexts as an integral part of the model. We show that Markovian topographic maps of sequential data can be produced using a simple fixed (nonadaptable) dynamic module externally feeding a standard topographic model designed to process static vectorial data of fixed dimensionality (e.g., SOM). However, by allowing trainable feedback connections, one can obtain Markovian maps with superior memory depth and topography preservation. We elaborate on the importance of non-Markovian organizations in topographic maps of sequential data.

Learning Out-Of Sample Mapping in Non-Vectorial Data Reduction using Constrained Twin Kernel Embedding

2007 International Conference on Machine Learning and Cybernetics ◽

10.1109/icmlc.2007.4370108 ◽

2007 ◽

Author(s):

Yi Guo ◽

Junbin Gao ◽

Paul W. Kwan

Keyword(s):

Data Reduction ◽

Out Of Sample ◽

Vectorial Data

Analysis Of Vectorial Data

Journal of Sedimentary Research ◽

10.1306/74d70d76-2b21-11d7-8648000102c1865d ◽

1962 ◽

Vol Vol. 32 ◽

Cited By ~ 7

Author(s):

Richard Steinmetz

Keyword(s):

Vectorial Data