text retrieval Latest Research Papers

In view of the complexity of the multimodal environment and the existing shallow network structure that cannot achieve high-precision image and text retrieval, a cross-modal image and text retrieval method combining efficient feature extraction and interactive learning convolutional autoencoder (CAE) is proposed. First, the residual network convolution kernel is improved by incorporating two-dimensional principal component analysis (2DPCA) to extract image features and extracting text features through long short-term memory (LSTM) and word vectors to efficiently extract graphic features. Then, based on interactive learning CAE, cross-modal retrieval of images and text is realized. Among them, the image and text features are respectively input to the two input terminals of the dual-modal CAE, and the image-text relationship model is obtained through the interactive learning of the middle layer to realize the image-text retrieval. Finally, based on Flickr30K, MSCOCO, and Pascal VOC 2007 datasets, the proposed method is experimentally demonstrated. The results show that the proposed method can complete accurate image retrieval and text retrieval. Moreover, the mean average precision (MAP) has reached more than 0.3, the area of precision-recall rate (PR) curves are better than other comparison methods, and they are applicable.

Download Full-text

Security and Privacy Preserving Keyword Search for Cipher Text Retrieval in Cloud Computing based on Oppositional Grasshopper optimization

Indian Journal of Computer Science and Engineering ◽

10.21817/indjcse/2021/v12i6/211206041 ◽

2021 ◽

Vol 12 (6) ◽

pp. 1630-1645

Author(s):

Kasiviswanadham Y ◽

Dr.Subbarao Ch.D.V.

Keyword(s):

Cloud Computing ◽

Keyword Search ◽

Text Retrieval ◽

Privacy Preserving ◽

Security And Privacy ◽

Cipher Text ◽

Grasshopper Optimization

Download Full-text

Simulation of cross-modal image-text retrieval algorithm under convolutional neural network structure and hash method

The Journal of Supercomputing ◽

10.1007/s11227-021-04157-w ◽

2021 ◽

Author(s):

XianBen Yang ◽

Wei Zhang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Structure ◽

Text Retrieval ◽

Retrieval Algorithm ◽

Neural Network Structure

Download Full-text

Super Visual Semantic Embedding for Cross-Modal Image-Text Retrieval

10.1145/3487075.3487167 ◽

2021 ◽

Author(s):

Zhixian Zeng ◽

Jianjun Cao ◽

Guoquan Jiang ◽

Nianfeng Weng ◽

Yuxin Xu ◽

...

Keyword(s):

Text Retrieval ◽

Semantic Embedding

Download Full-text

Cross Lingual Video and Text Retrieval: A New Benchmark Dataset and Algorithm

10.1145/3462244.3479913 ◽

2021 ◽

Author(s):

Jayaprakash Akula ◽

Abhishek ◽

Rishabh Dabral ◽

Preethi Jyothi ◽

Ganesh Ramakrishnan

Keyword(s):

Text Retrieval ◽

Benchmark Dataset ◽

Cross Lingual

Download Full-text

Progressive Semantic Matching for Video-Text Retrieval

10.1145/3474085.3475621 ◽

2021 ◽

Author(s):

Hongying Liu ◽

Ruyi Luo ◽

Fanhua Shang ◽

Mantang Niu ◽

Yuanyuan Liu

Keyword(s):

Text Retrieval ◽

Semantic Matching

Download Full-text

Semantic-Preserving Metric Learning for Video-Text Retrieval

10.1109/icip42928.2021.9506697 ◽

2021 ◽

Author(s):

Sungkwon Choo ◽

Seong Jong Ha ◽

Joonsoo Lee

Keyword(s):

Metric Learning ◽

Text Retrieval

Download Full-text

A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval

10.3233/faia210049 ◽

2021 ◽

Author(s):

Manh-Duy Nguyen ◽

Binh T. Nguyen ◽

Cathal Gurrin

Keyword(s):

Graph Matching ◽

Text Retrieval ◽

General Information ◽

Suitable Method ◽

Baseline Method ◽

Scene Graph ◽

Visual Objects ◽

Convolution Model ◽

Graph Presentation ◽

Structure Graph

Conventional approaches to image-text retrieval mainly focus on indexing visual objects appearing in pictures but ignore the interactions between these objects. Such objects occurrences and interactions are equivalently useful and important in this field as they are usually mentioned in the text. Scene graph presentation is a suitable method for the image-text matching challenge and obtained good results due to its ability to capture the inter-relationship information. Both images and text are represented in scene graph levels and formulate the retrieval challenge as a scene graph matching challenge. In this paper, we introduce the Local and Global Scene Graph Matching (LGSGM) model that enhances the state-of-the-art method by integrating an extra graph convolution network to capture the general information of a graph. Specifically, for a pair of scene graphs of an image and its caption, two separate models are used to learn the features of each graph’s nodes and edges. Then a Siamese-structure graph convolution model is employed to embed graphs into vector forms. We finally combine the graph-level and the vector-level to calculate the similarity of this image-text pair. The empirical experiments show that our enhancement with the combination of levels can improve the performance of the baseline method by increasing the recall by more than 10% on the Flickr30k dataset. Our implementation code can be found at https://github.com/m2man/LGSGM.

Download Full-text