Image Description Techniques

Continuous orthogonal moments, for which continuous functions are used as kernel functions, are invariant to rotation and scaling, and they have been greatly developed over the recent years. Among continuous orthogonal moments, polar harmonic Fourier moments (PHFMs) have superior performance and strong image description ability. In order to improve the performance of PHFMs in noise resistance and image reconstruction, PHFMs, which can only take integer numbers, are extended to fractional-order polar harmonic Fourier moments (FrPHFMs) in this paper. Firstly, the radial polynomials of integer-order PHFMs are modified to obtain fractional-order radial polynomials, and FrPHFMs are constructed based on the fractional-order radial polynomials; subsequently, the strong reconstruction ability, orthogonality, and geometric invariance of the proposed FrPHFMs are proven; and, finally, the performance of the proposed FrPHFMs is compared with that of integer-order PHFMs, fractional-order radial harmonic Fourier moments (FrRHFMs), fractional-order polar harmonic transforms (FrPHTs), and fractional-order Zernike moments (FrZMs). The experimental results show that the FrPHFMs constructed in this paper are superior to integer-order PHFMs and other fractional-order continuous orthogonal moments in terms of performance in image reconstruction and object recognition, as well as that the proposed FrPHFMs have strong image description ability and good stability.

Download Full-text

Chinese Image Captioning via Fuzzy Attention-based DenseNet-BiLSTM

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3422668 ◽

2021 ◽

Vol 17 (1s) ◽

pp. 1-18 ◽

Cited By ~ 1

Author(s):

Huimin Lu ◽

Rui Yang ◽

Zhenrong Deng ◽

Yonglin Zhang ◽

Guangwei Gao ◽

...

Keyword(s):

Feature Extraction ◽

Contextual Information ◽

Image Features ◽

Context Information ◽

Global Information ◽

Image Description ◽

Image Captioning ◽

Image Content ◽

Single Feature ◽

Proposed Model

Chinese image description generation tasks usually have some challenges, such as single-feature extraction, lack of global information, and lack of detailed description of the image content. To address these limitations, we propose a fuzzy attention-based DenseNet-BiLSTM Chinese image captioning method in this article. In the proposed method, we first improve the densely connected network to extract features of the image at different scales and to enhance the model’s ability to capture the weak features. At the same time, a bidirectional LSTM is used as the decoder to enhance the use of context information. The introduction of an improved fuzzy attention mechanism effectively improves the problem of correspondence between image features and contextual information. We conduct experiments on the AI Challenger dataset to evaluate the performance of the model. The results show that compared with other models, our proposed model achieves higher scores in objective quantitative evaluation indicators, including BLEU , BLEU , METEOR, ROUGEl, and CIDEr. The generated description sentence can accurately express the image content.

Download Full-text

Deep sequential fusion LSTM network for image description

Neurocomputing ◽

10.1016/j.neucom.2018.05.086 ◽

2018 ◽

Vol 312 ◽

pp. 154-164 ◽

Cited By ~ 12

Author(s):

Pengjie Tang ◽

Hanli Wang ◽

Sam Kwong

Keyword(s):

Image Description ◽

Lstm Network

Download Full-text

The MPEG-7 colour structure descriptor: image description using colour and local spatial information

Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205) ◽

10.1109/icip.2001.959134 ◽

2002 ◽

Cited By ~ 20

Author(s):

D.S. Messing ◽

P. van Beek ◽

J.H. Errico

Keyword(s):

Spatial Information ◽

Image Description ◽

Local Spatial Information ◽

Structure Descriptor

Download Full-text

Image description by Hierarchical Prioritised Fuzzy Systems

2009 IEEE International Conference on Computational Cybernetics (ICCC) ◽

10.1109/icccyb.2009.5393940 ◽

2009 ◽

Cited By ~ 1

Author(s):

Paulo Salgado ◽

Paulo Garrido

Keyword(s):

Fuzzy Systems ◽

Image Description

Download Full-text

Suggesting text alternatives for images in social media

ACM SIGACCESS Accessibility and Computing ◽

10.1145/3477315.3477318 ◽

2021 ◽

pp. 1-6

Author(s):

Letícia Seixas Pereira ◽

João Guerreiro ◽

André Rodrigues ◽

André Santos ◽

João Vicente ◽

...

Keyword(s):

Social Networks ◽

Social Media ◽

Shared Responsibility ◽

User Generated Content ◽

Media Content ◽

Android Application ◽

Image Description ◽

Visual Content ◽

Considerable Part ◽

The Web

Image description has been a recurrent topic on web accessibility over the years. With the increased use of social networks, this discussion is even more relevant. Social networks are responsible for a considerable part of the images available on the web. In this context, users are not only consuming visual content but also creating it. Due to this shared responsibility of providing accessible content, major platforms must go beyond accessible interfaces. Additional resources must also be available to support users in creating accessible content. Although many of today's services already support accessible media content authoring, current efforts still fail to properly integrate and guide their users through the authoring process. One of the consequences is that many users are still unaware of what an image description is, how to provide it, and why it is necessary. We present SONAAR, a project that aims to improve the accessibility of user-generated content on social networks. Our approach is to support the authoring and consumption of accessible social media content. Our prototypes currently focus on Twitter and Facebook and are available as an Android application and as a Chrome extension.

Download Full-text

Model-Based Referenceless Quality Metric of 3D Synthesized Images Using Local Image Description

IEEE Transactions on Image Processing ◽

10.1109/tip.2017.2733164 ◽

2018 ◽

Vol 27 (1) ◽

pp. 394-405 ◽

Cited By ~ 54

Author(s):

Ke Gu ◽

Vinit Jakhetiya ◽

Jun-Fei Qiao ◽

Xiaoli Li ◽

Weisi Lin ◽

...

Keyword(s):

Image Description ◽

Quality Metric ◽

Model Based ◽

Local Image

Download Full-text

Image Description Using Scale-Space Edge Pixel Directions Histogram

Advances in Semantic Media Adaptation and Personalization, Volume 2 ◽

10.1201/9781420076653-c15 ◽

2009 ◽

pp. 321-339 ◽

Cited By ~ 2

Author(s):

António Pinheiro

Keyword(s):

Scale Space ◽

Image Description ◽

Edge Pixel

Download Full-text

Energy analysis of image descriptors in texture classification

10.14293/s2199-1006.1.sor-.ppklvda.v1 ◽

2021 ◽

Author(s):

Faeze Kiani

Keyword(s):

Surface Defect ◽

Energy Analysis ◽

Texture Classification ◽

Medical Image Analysis ◽

Image Description ◽

Image Descriptors ◽

Discriminative Feature ◽

Surface Defect Detection ◽

Occurrence Matrix

Texture play important role in image description process. Texture classification is one of the problems which have been paid much attention on by computer vision scientists in last decade. If texture classification is done accurately, it can be used in many problems such as skin detection, surface defect detection, medical image analysis, gender identification, human identification, etc. Since now, many approaches are proposed to perform it. Most of them have tried to extract discriminative features to separate different texture types accurately. This paper has proposed an approach based on energy analysis of some efficient image descriptors such as median binary pattern, Local binary pattern and Gray Level Co-occurrence matrix. Next, by concatenating extracted features, a discriminative feature vector is defined. Finally, classifier is used to classify texture types. Although, this approach is a general one and is could be used in different applications. In the result part the proposed approach has been evaluated on some benchmark dataset. Next, the results have been compared with some of state-of-the-art approaches to prove the quality of the proposed approach.

Download Full-text

Image Description Techniques

A Fast Non-destructive Algorithm for Image Description Based on Improved Wavelet Moment Features

Invariant Image Representation Using Novel Fractional-Order Polar Harmonic Fourier Moments

Chinese Image Captioning via Fuzzy Attention-based DenseNet-BiLSTM

Deep sequential fusion LSTM network for image description

The MPEG-7 colour structure descriptor: image description using colour and local spatial information

Image description by Hierarchical Prioritised Fuzzy Systems

Suggesting text alternatives for images in social media

Model-Based Referenceless Quality Metric of 3D Synthesized Images Using Local Image Description

Image Description Using Scale-Space Edge Pixel Directions Histogram

Energy analysis of image descriptors in texture classification

Export Citation Format