image caption Latest Research Papers

Image Caption Generator using Siamese Graph Convolutional Networks and LSTM

10.1145/3493700.3493754 ◽

2022 ◽

Author(s):

Athul Kumar ◽

Aarchi Agrawal ◽

K S Ashin Shanly ◽

Sudip Das ◽

Nidhin Harilal

Keyword(s):

Convolutional Networks ◽

Image Caption

Enhanced Image Captioning with Color Recognition Using Deep Learning Methods

Applied Sciences ◽

10.3390/app12010209 ◽

2021 ◽

Vol 12 (1) ◽

pp. 209

Author(s):

Yeong-Hwa Chang ◽

Yen-Jen Chen ◽

Ren-Hung Huang ◽

Yi-Ting Yu

Keyword(s):

Object Detection ◽

Short Term Memory ◽

Color Analysis ◽

Image Captioning ◽

Short Term ◽

Term Memory ◽

Color Recognition ◽

Memory Network ◽

Long Short Term Memory ◽

Image Caption

Automatically describing the content of an image is an interesting and challenging task in artificial intelligence. In this paper, an enhanced image captioning model—including object detection, color analysis, and image captioning—is proposed to automatically generate the textual descriptions of images. In an encoder–decoder model for image captioning, VGG16 is used as an encoder and an LSTM (long short-term memory) network with attention is used as a decoder. In addition, Mask R-CNN with OpenCV is used for object detection and color analysis. The integration of the image caption and color recognition is then performed to provide better descriptive details of images. Moreover, the generated textual sentence is converted into speech. The validation results illustrate that the proposed method can provide more accurate description of images.

Visual sentiment analysis based on image caption and adjective–noun–pair description

Soft Computing ◽

10.1007/s00500-021-06530-6 ◽

2021 ◽

Author(s):

Zuhe Li ◽

Qian Sun ◽

Qingbing Guo ◽

Huaiguang Wu ◽

Lujuan Deng ◽

...

Keyword(s):

Sentiment Analysis ◽

Noun Pair ◽

Image Caption

Computational Intelligence for Image Caption Generation

10.1007/978-981-16-2641-8_25 ◽

2021 ◽

pp. 263-270

Author(s):

Sahil Garg ◽

Parv Dahiya

Keyword(s):

Computational Intelligence ◽

Image Caption Generation ◽

Image Caption

Image Caption Generator Using Deep Learning

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.38652 ◽

2021 ◽

Vol 9 (10) ◽

pp. 1554-1564

Author(s):

A. V. N. Kameswari

Keyword(s):

Deep Learning ◽

Visually Impaired ◽

Short Term Memory ◽

Image Understanding ◽

Research Area ◽

Short Term ◽

Visually Impaired People ◽

Learning Techniques ◽

Long Short Term Memory ◽

Image Caption

Abstract: When humans see an image, their brain can easily tell what the image is about, but a computer cannot do it easily. Computer vision researchers worked on this a lot and they considered it impossible until now! With the advancement in Deep learning techniques, availability of huge datasets and computer power, we can build models that can generate captions for an image. Image Caption Generator is a popular research area of Deep Learning that deals with image understanding and a language description for that image. Generating well-formed sentences requires both syntactic and semantic understanding of the language. Being able to describe the content of an image using accurately formed sentences is a very challenging task, but it could also have a great impact, by helping visually impaired people better understand the content of images. The biggest challenge is most definitely being able to create a description that must capture not only the objects contained in an image, but also express how these objects relate to each other. This paper uses Flickr_8K dataset and Flickr8k_text folder that contains Flickr8k.token which is the main file of our dataset that contains image name and their respective caption separated by newline(“\n”). CNN is used for extracting features from the image. We will use the pre-trained model Xception. LSTM will use the information from CNN to help generate a description of the image. In our Flickr8k_text folder, we have Flickr_8k.trainImages.txt file that contains a list of 6000 images names that we will use for training. After CNN-LSTM model is defined we give an image file as parameter through command prompt for testing image caption generator and it generates the caption of an image and its accuracy is observed by calculating bleu score for generated and reference captions. Keywords: Image Caption Generator, Convolutional Neural Network, Long Short-Term Memory, Bleu score, Flickr_8K

Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c6439.0910321 ◽

2021 ◽

Vol 10 (3) ◽

pp. 131-139

Author(s):

Kota Akshith Reddy ◽

◽

Satish C J ◽

Jahnavi Polsani ◽

Teja Naveen Chintapalli ◽

...

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Training Data ◽

Image Captioning ◽

The Core ◽

Augmentation Techniques ◽

Evaluation Metric ◽

Learning Data ◽

Image Caption Generation ◽

Image Caption

Automatic Image Caption Generation is one of the core problems in the field of Deep Learning. Data Augmentation is a technique which helps in increasing the amount of data at hand and this is done by augmenting the training data using various techniques like flipping, rotating, Zooming, Brightening, etc. In this work, we create an Image Captioning model and check its robustness on all the major types of Image Augmentation techniques. The results show the fuzziness of the model while working with the same image but a different augmentation technique and because of this, a different caption is produced every time a different data augmentation technique is employed. We also show the change in the performance of the model after applying these augmentation techniques. Flickr8k dataset is used for this study along with BLEU score as the evaluation metric for the image captioning model.

A topic-based multi-channel attention model under hybrid mode for image caption

Neural Computing and Applications ◽

10.1007/s00521-021-06557-8 ◽

2021 ◽

Author(s):

Kui Qian ◽

Lei Tian

Keyword(s):

Hybrid Mode ◽

Attention Model ◽

Image Caption

Behavior Detection in Supermarket Based on Image Caption with Unity

10.1109/icce-tw52618.2021.9603079 ◽

2021 ◽

Author(s):

Li-Min Chen ◽

Wei-Liang Lin

Keyword(s):

Behavior Detection ◽

Image Caption

Image Caption Generation Using Neural Network Models and LSTM Hierarchical Structure

Computational Intelligence in Pattern Recognition - Advances in Intelligent Systems and Computing ◽

10.1007/978-981-16-2543-5_10 ◽

2021 ◽

pp. 109-117

Author(s):

Prachi M. Waghmare ◽

Swati V. Shinde

Keyword(s):

Neural Network ◽

Hierarchical Structure ◽

Network Models ◽

Neural Network Models ◽

Image Caption Generation ◽

Image Caption

A Rapid Review of Image Captioning

Journal of Information Technology and Computer Science ◽

10.25126/jitecs.202162316 ◽

2021 ◽

Vol 6 (2) ◽

pp. 158-169

Author(s):

Adriyendi Adriyendi

Keyword(s):

Process Model ◽

Language Model ◽

Human Vision ◽

Rapid Review ◽

Future Research ◽

Automatic Process ◽

Image Captioning ◽

Input Model ◽

Cross Language ◽

Image Caption

Image captioning is an automatic process for generating text based on the content observed in an image. We do review, create framework, and build application model. We review image captioning into 4 categories based on input model, process model, output model, and lingual image caption. Input model is based on criteria caption, method, and dataset. Process model is based on type of learning, encoder-decoder, image extractor, and metric evaluation. Output model based on architecture, features extraction, feature aping, model, and number of caption. Lingual image caption based on language model with 2 groups: bilingual image caption and cross-language image caption. We also design framework with 3 framework model. Furthermore, we also build application with 3 application models. We also provide research opinions on trends and future research that can be developed with image caption generation. Image captioning can be further developed on computer vision versusÂ human vision.

image caption
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Image Caption Generator using Siamese Graph Convolutional Networks and LSTM

Enhanced Image Captioning with Color Recognition Using Deep Learning Methods

Visual sentiment analysis based on image caption and adjective–noun–pair description

Computational Intelligence for Image Caption Generation

Image Caption Generator Using Deep Learning

Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques

A topic-based multi-channel attention model under hybrid mode for image caption

Behavior Detection in Supermarket Based on Image Caption with Unity

Image Caption Generation Using Neural Network Models and LSTM Hierarchical Structure

A Rapid Review of Image Captioning

Export Citation Format

image captionRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Image Caption Generator using Siamese Graph Convolutional Networks and LSTM

Enhanced Image Captioning with Color Recognition Using Deep Learning Methods

Visual sentiment analysis based on image caption and adjective–noun–pair description

Computational Intelligence for Image Caption Generation

Image Caption Generator Using Deep Learning

Analysis of the Fuzziness of Image Caption Generation Models due to Data Augmentation Techniques

A topic-based multi-channel attention model under hybrid mode for image caption

Behavior Detection in Supermarket Based on Image Caption with Unity

Image Caption Generation Using Neural Network Models and LSTM Hierarchical Structure

A Rapid Review of Image Captioning

image caption
Recently Published Documents