scholarly journals An Efficient Method for Texture Feature Extraction and Recognition based on Contourlet Transform and Canonical Correlation Analysis

2018 ◽  
Vol 1 (29) ◽  
pp. 498-511
Author(s):  
Ali Mohsin Al-juboori

        Feature extraction is an important processing step in texture classification. For feature extraction in contourlet domain, statistical features for blocks of subband are computed. In this paper, we present an efficient feature vector extraction method for texture classification. For more discriminative feature a canonical correlation analysis method is propose for feature vector fused to the different sample of  texture in the same cluster. The KNN (K-Nearest Neighbor) classifier is utilizing to perform texture classification.  

Biometrika ◽  
2020 ◽  
Vol 107 (3) ◽  
pp. 609-625 ◽  
Author(s):  
Grace Yoon ◽  
Raymond J Carroll ◽  
Irina Gaynanova

Summary Canonical correlation analysis investigates linear relationships between two sets of variables, but it often works poorly on modern datasets because of high dimensionality and mixed data types such as continuous, binary and zero-inflated. To overcome these challenges, we propose a semiparametric approach to sparse canonical correlation analysis based on the Gaussian copula. The main result of this paper is a truncated latent Gaussian copula model for data with excess zeros, which allows us to derive a rank-based estimator of the latent correlation matrix for mixed variable types without estimation of marginal transformation functions. The resulting canonical correlation analysis method works well in high-dimensional settings, as demonstrated via numerical studies, and when applied to the analysis of association between gene expression and microRNA data from breast cancer patients.


2011 ◽  
Vol 18 (3) ◽  
pp. 399-436
Author(s):  
SAMI VIRPIOJA ◽  
MARI-SANNA PAUKKERI ◽  
ABHISHEK TRIPATHI ◽  
TIINA LINDH-KNUUTILA ◽  
KRISTA LAGUS

AbstractVector space models are used in language processing applications for calculating semantic similarities of words or documents. The vector spaces are generated with feature extraction methods for text data. However, evaluation of the feature extraction methods may be difficult. Indirect evaluation in an application is often time-consuming and the results may not generalize to other applications, whereas direct evaluations that measure the amount of captured semantic information usually require human evaluators or annotated data sets. We propose a novel direct evaluation method based on canonical correlation analysis (CCA), the classical method for finding linear relationship between two data sets. In our setting, the two sets are parallel text documents in two languages. A good feature extraction method should provide representations that reflect the semantic content of the documents. Assuming that the underlying semantic content is independent of the language, we can study feature extraction methods that capture the content best by measuring dependence between the representations of a document and its translation. In the case of CCA, the applied measure of dependence is correlation. The evaluation method is based on unsupervised learning, it is language- and domain-independent, and it does not require additional resources besides a parallel corpus. In this paper, we demonstrate the evaluation method on a sentence-aligned parallel corpus. The method is validated by showing that the obtained results with bag-of-words representations are intuitive and agree well with the previous findings. Moreover, we examine the performance of the proposed evaluation method with indirect evaluation methods in simple sentence matching tasks, and a quantitative manual evaluation of word translations. The results of the proposed method correlate well with the results of the indirect and manual evaluations.


Sign in / Sign up

Export Citation Format

Share Document