text query
Recently Published Documents


TOTAL DOCUMENTS

53
(FIVE YEARS 25)

H-INDEX

4
(FIVE YEARS 1)

2022 ◽  
pp. 1-1
Author(s):  
Gongmian Wang ◽  
Xing Xu ◽  
Fumin Shen ◽  
Huimin Lu ◽  
Yanli Ji ◽  
...  
Keyword(s):  

2021 ◽  
Vol 11 (3-4) ◽  
pp. 1-34
Author(s):  
Chris Kim ◽  
Xiao Lin ◽  
Christopher Collins ◽  
Graham W. Taylor ◽  
Mohamed R. Amer

While the computer vision problem of searching for activities in videos is usually addressed by using discriminative models, their decisions tend to be opaque and difficult for people to understand. We propose a case study of a novel machine learning approach for generative searching and ranking of motion capture activities with visual explanation. Instead of directly ranking videos in the database given a text query, our approach uses a variant of Generative Adversarial Networks (GANs) to generate exemplars based on the query and uses them to search for the activity of interest in a large database. Our model is able to achieve comparable results to its discriminative counterpart, while being able to dynamically generate visual explanations. In addition to our searching and ranking method, we present an explanation interface that enables the user to successfully explore the model’s explanations and its confidence by revealing query-based, model-generated motion capture clips that contributed to the model’s decision. Finally, we conducted a user study with 44 participants to show that by using our model and interface, participants benefit from a deeper understanding of the model’s conceptualization of the search query. We discovered that the XAI system yielded a comparable level of efficiency, accuracy, and user-machine synchronization as its black-box counterpart, if the user exhibited a high level of trust for AI explanation.


2021 ◽  
Author(s):  
Tien-Phat Nguyen ◽  
Ba-Thinh Tran-Le ◽  
Xuan-Dang Thai ◽  
Tam V. Nguyen ◽  
Minh N. Do ◽  
...  
Keyword(s):  

2021 ◽  
pp. 108027
Author(s):  
Xiao Sun ◽  
Xiang Long ◽  
Dongliang He ◽  
Shilei Wen ◽  
Zhouhui Lian
Keyword(s):  

2021 ◽  
Vol 4 (1) ◽  
pp. 87-89
Author(s):  
Janardan Bhatta

Searching images in a large database is a major requirement in Information Retrieval Systems. Expecting image search results based on a text query is a challenging task. In this paper, we leverage the power of Computer Vision and Natural Language Processing in Distributed Machines to lower the latency of search results. Image pixel features are computed based on contrastive loss function for image search. Text features are computed based on the Attention Mechanism for text search. These features are aligned together preserving the information in each text and image feature. Previously, the approach was tested only in multilingual models. However, we have tested it in image-text dataset and it enabled us to search in any form of text or images with high accuracy.


Author(s):  
Shaily Malik ◽  
Poonam Bansal

The real-world data is multimodal and to classify them by machine learning algorithms, features of both modalities must be transformed into common latent space. The high dimensional common space transformation of features lose their locality information and susceptible to noise. This research article has dealt with this issue of a semantic autoencoder and presents a novel algorithm with distinct mapped features with locality preservation into a commonly hidden space. We call it discriminative regularized semantic autoencoder (DRSAE). It maintains the low dimensional features in the manifold to manage the inter and intra-modality of the data. The data has multi labels, and these are transformed into an aware feature space. Conditional Principal label space transformation (CPLST) is used for it. With the two-fold proposed algorithm, we achieve a significant improvement in text retrieval form image query and image retrieval from the text query.


2021 ◽  
pp. 605-614
Author(s):  
Gia H. Ngo ◽  
Minh Nguyen ◽  
Nancy F. Chen ◽  
Mert R. Sabuncu

Author(s):  
Udit Singhania ◽  
B. K. Tripathy

This chapter is mainly an advanced version of the previous version of the chapter named “An Insight to Deep Learning Architectures” in the encyclopedia. This chapter mainly focusses on giving the insights of information retrieval after the year 2014, as the earlier part has been discussed in the previous version. Deep learning plays an important role in today's era, and this chapter makes use of such deep learning architectures which have evolved over time and have proved to be efficient in image search/retrieval nowadays. In this chapter, various techniques to solve the problem of natural language processing to process text query are mentioned. Recurrent neural nets, deep restricted Boltzmann machines, general adversarial nets have been discussed seeing how they revolutionize the field of information retrieval.


Sign in / Sign up

Export Citation Format

Share Document