scholarly journals End-to-End Concept Word Detection for Video Captioning, Retrieval, and Question Answering

Author(s):  
Youngjae Yu ◽  
Hyungjin Ko ◽  
Jongwook Choi ◽  
Gunhee Kim
2007 ◽  
Vol 58 (8) ◽  
pp. 1082-1099 ◽  
Author(s):  
Nina Wacholder ◽  
Diane Kelly ◽  
Paul Kantor ◽  
Robert Rittman ◽  
Ying Sun ◽  
...  

2020 ◽  
Vol 53 (7) ◽  
pp. 5429-5453
Author(s):  
José Wellington Franco da Silva ◽  
Amanda Drielly Pires Venceslau ◽  
Juliano Efson Sales ◽  
José Gilvan Rodrigues Maia ◽  
Vládia Célia Monteiro Pinheiro ◽  
...  

Author(s):  
Luowei Zhou ◽  
Yingbo Zhou ◽  
Jason J. Corso ◽  
Richard Socher ◽  
Caiming Xiong
Keyword(s):  

Author(s):  
Sebastian Blank ◽  
Florian Wilhelm ◽  
Hans-Peter Zorn ◽  
Achim Rettinger

Almost all of today’s knowledge is stored in databases and thus can only be accessed with the help of domain specific query languages, strongly limiting the number of people which can access the data. In this work, we demonstrate an end-to-end trainable question answering (QA) system that allows a user to query an external NoSQL database by using natural language. A major challenge of such a system is the non-differentiability of database operations which we overcome by applying policy-based reinforcement learning. We evaluate our approach on Facebook’s bAbI Movie Dialog dataset and achieve a competitive score of 84.2% compared to several benchmark models. We conclude that our approach excels with regard to real-world scenarios where knowledge resides in external databases and intermediate labels are too costly to gather for non-end-to-end trainable QA systems.


Author(s):  
Min-je Choi ◽  
Sehun Jeong ◽  
Hakjoo Oh ◽  
Jaegul Choo

Detecting buffer overruns from a source code is one of the most common and yet challenging tasks in program analysis. Current approaches based on rigid rules and handcrafted features are limited in terms of flexible applicability and robustness due to diverse bug patterns and characteristics existing in sophisticated real-world software programs. In this paper, we propose a novel, data-driven approach that is completely end-to-end without requiring any hand-crafted features, thus free from any program language-specific structural limitations. In particular, our approach leverages a recently proposed neural network model called memory networks that have shown the state-of-the-art performances mainly in question-answering tasks. Our experimental results using source code samples demonstrate that our proposed model is capable of accurately detecting different types of buffer overruns. We also present in-depth analyses on how a memory network can learn to understand the semantics in programming languages solely from raw source codes, such as tracing variables of interest, identifying numerical values, and performing their quantitative comparisons.


2021 ◽  
Author(s):  
Devendra Sachan ◽  
Mostofa Patwary ◽  
Mohammad Shoeybi ◽  
Neel Kant ◽  
Wei Ping ◽  
...  

Symmetry ◽  
2020 ◽  
Vol 12 (6) ◽  
pp. 992
Author(s):  
Akshay Aggarwal ◽  
Aniruddha Chauhan ◽  
Deepika Kumar ◽  
Mamta Mittal ◽  
Sudipta Roy ◽  
...  

Traditionally, searching for videos on popular streaming sites like YouTube is performed by taking the keywords, titles, and descriptions that are already tagged along with the video into consideration. However, the video content is not utilized for searching of the user’s query because of the difficulty in encoding the events in a video and comparing them to the search query. One solution to tackle this problem is to encode the events in a video and then compare them to the query in the same space. A method of encoding meaning to a video could be video captioning. The captioned events in the video can be compared to the query of the user, and we can get the optimal search space for the videos. There have been many developments over the course of the past few years in modeling video-caption generators and sentence embeddings. In this paper, we exploit an end-to-end video captioning model and various sentence embedding techniques that collectively help in building the proposed video-searching method. The YouCook2 dataset was used for the experimentation. Seven sentence embedding techniques were used, out of which the Universal Sentence Encoder outperformed over all the other six, with a median percentile score of 99.51. Thus, this method of searching, when integrated with traditional methods, can help improve the quality of search results.


Sign in / Sign up

Export Citation Format

Share Document