Natural Scene Image Annotation Using Local Semantic Concepts and Spatial Bag of Visual Words

With the rapid growth of image collections, image classification and annotation has been active areas of research with notable recent progress. Bag-of-Visual-Words (BoVW) model, which relies on building visual vocabulary, has been widely used in this area. Recently, attention has been shifted to the use of advanced architectures which are characterized by multi-level processing. Hierarchical Max-Pooling (HMAX) model has attracted a great deal of attention in image classification. To improve image classification and annotation, several approaches based on ontologies have been proposed. However, image classification and annotation remain a challenging problem due to many related issues like the problem of ambiguity between classes. This problem can affect the quality of both classification and annotation results. In this paper, we propose an ontology-based image classification and annotation approach. Our contributions consist of the following: (1) exploiting ontological relationships between classes during both image classification and annotation processes; (2) combining the outputs of hypernym–hyponym classifiers to lead to a better discrimination between classes; and (3) annotating images by combining hypernym and hyponym classification results in order to improve image annotation and to reduce the ambiguous and inconsistent annotations. The aim is to improve image classification and annotation by using ontologies. Several strategies have been experimented, and the obtained results have shown that our proposal improves image classification and annotation.

Download Full-text

IMAGE ANNOTATION BASED ON BAG OF VISUAL WORDS AND OPTIMIZED SEMI-SUPERVISED LEARNING METHOD

ICTACT Journal on Image and Video Processing ◽

10.21917/ijivp.2014.0129 ◽

2014 ◽

Vol 5 (1) ◽

pp. 887-890

Author(s):

Jun Li ◽

◽

Hongmei Zhang ◽

Yuanjiang Liao ◽

◽

...

Keyword(s):

Supervised Learning ◽

Image Annotation ◽

Bag Of Visual Words ◽

Learning Method ◽

Visual Words

Download Full-text

Multilabel Image Annotation Based on Double-Layer PLSA Model

The Scientific World JOURNAL ◽

10.1155/2014/494387 ◽

2014 ◽

Vol 2014 ◽

pp. 1-9 ◽

Cited By ~ 3

Author(s):

Jing Zhang ◽

Da Li ◽

Weiwei Hu ◽

Zhihua Chen ◽

Yubo Yuan

Keyword(s):

Double Layer ◽

Semantic Analysis ◽

Image Annotation ◽

Image Understanding ◽

Probabilistic Latent Semantic Analysis ◽

Visual Features ◽

Automatic Image Annotation ◽

Visual Words ◽

Low Level ◽

Semantic Concepts

Due to the semantic gap between visual features and semantic concepts, automatic image annotation has become a difficult issue in computer vision recently. We propose a new image multilabel annotation method based on double-layer probabilistic latent semantic analysis (PLSA) in this paper. The new double-layer PLSA model is constructed to bridge the low-level visual features and high-level semantic concepts of images for effective image understanding. The low-level features of images are represented as visual words by Bag-of-Words model; latent semantic topics are obtained by the first layer PLSA from two aspects of visual and texture, respectively. Furthermore, we adopt the second layer PLSA to fuse the visual and texture latent semantic topics and achieve a top-layer latent semantic topic. By the double-layer PLSA, the relationships between visual features and semantic concepts of images are established, and we can predict the labels of new images by their low-level features. Experimental results demonstrate that our automatic image annotation model based on double-layer PLSA can achieve promising performance for labeling and outperform previous methods on standard Corel dataset.

Download Full-text