End-to-End Concept Word Detection for Video Captioning, Retrieval, and Question Answering

Almost all of today’s knowledge is stored in databases and thus can only be accessed with the help of domain specific query languages, strongly limiting the number of people which can access the data. In this work, we demonstrate an end-to-end trainable question answering (QA) system that allows a user to query an external NoSQL database by using natural language. A major challenge of such a system is the non-differentiability of database operations which we overcome by applying policy-based reinforcement learning. We evaluate our approach on Facebook’s bAbI Movie Dialog dataset and achieve a competitive score of 84.2% compared to several benchmark models. We conclude that our approach excels with regard to real-world scenarios where knowledge resides in external databases and intermediate labels are too costly to gather for non-end-to-end trainable QA systems.

Download Full-text

End-to-End Prediction of Buffer Overruns from Raw Source Code via Neural Memory Networks

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/214 ◽

2017 ◽

Cited By ~ 4

Author(s):

Min-je Choi ◽

Sehun Jeong ◽

Hakjoo Oh ◽

Jaegul Choo

Keyword(s):

Programming Languages ◽

Program Analysis ◽

Question Answering ◽

Source Code ◽

Source Codes ◽

Program Language ◽

Proposed Model ◽

Challenging Tasks ◽

Data Driven Approach ◽

End To End

Detecting buffer overruns from a source code is one of the most common and yet challenging tasks in program analysis. Current approaches based on rigid rules and handcrafted features are limited in terms of flexible applicability and robustness due to diverse bug patterns and characteristics existing in sophisticated real-world software programs. In this paper, we propose a novel, data-driven approach that is completely end-to-end without requiring any hand-crafted features, thus free from any program language-specific structural limitations. In particular, our approach leverages a recently proposed neural network model called memory networks that have shown the state-of-the-art performances mainly in question-answering tasks. Our experimental results using source code samples demonstrate that our proposed model is capable of accurately detecting different types of buffer overruns. We also present in-depth analyses on how a memory network can learn to understand the semantics in programming languages solely from raw source codes, such as tracing variables of interest, identifying numerical values, and performing their quantitative comparisons.

Download Full-text

Expanding End-to-End Question Answering on Differentiable Knowledge Graphs with Intersection

10.18653/v1/2021.emnlp-main.694 ◽

2021 ◽

Author(s):

Priyanka Sen ◽

Armin Oliya ◽

Amir Saffari

Keyword(s):

Question Answering ◽

Knowledge Graphs ◽

End To End

Download Full-text

End-to-End Training of Neural Retrievers for Open-Domain Question Answering

10.18653/v1/2021.acl-long.519 ◽

2021 ◽

Author(s):

Devendra Sachan ◽

Mostofa Patwary ◽

Mohammad Shoeybi ◽

Neel Kant ◽

Wei Ping ◽

...

Keyword(s):

Question Answering ◽

Open Domain ◽

End To End

Download Full-text

End-to-End Representation Learning for Question Answering with Weak Supervision

Semantic Web Challenges - Communications in Computer and Information Science ◽

10.1007/978-3-319-69146-6_7 ◽

2017 ◽

pp. 70-83 ◽

Cited By ~ 6

Author(s):

Daniil Sorokin ◽

Iryna Gurevych

Keyword(s):

Question Answering ◽

Representation Learning ◽

Weak Supervision ◽

End To End

Download Full-text

End-to-End Video Captioning

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW) ◽

10.1109/iccvw.2019.00185 ◽

2019 ◽

Author(s):

Silvio Olivastri ◽

Gurkirt Singh ◽

Fabio Cuzzolin

Keyword(s):

Video Captioning ◽

End To End

Download Full-text

Video Caption Based Searching Using End-to-End Dense Captioning and Sentence Embeddings

Symmetry ◽

10.3390/sym12060992 ◽

2020 ◽

Vol 12 (6) ◽

pp. 992

Author(s):

Akshay Aggarwal ◽

Aniruddha Chauhan ◽

Deepika Kumar ◽

Mamta Mittal ◽

Sudipta Roy ◽

...

Keyword(s):

Search Space ◽

Video Content ◽

Video Captioning ◽

The Past ◽

Percentile Score ◽

Universal Sentence ◽

End To End ◽

Video Searching ◽

Searching Method

Traditionally, searching for videos on popular streaming sites like YouTube is performed by taking the keywords, titles, and descriptions that are already tagged along with the video into consideration. However, the video content is not utilized for searching of the user’s query because of the difficulty in encoding the events in a video and comparing them to the search query. One solution to tackle this problem is to encode the events in a video and then compare them to the query in the same space. A method of encoding meaning to a video could be video captioning. The captioned events in the video can be compared to the query of the user, and we can get the optimal search space for the videos. There have been many developments over the course of the past few years in modeling video-caption generators and sentence embeddings. In this paper, we exploit an end-to-end video captioning model and various sentence embedding techniques that collectively help in building the proposed video-searching method. The YouCook2 dataset was used for the experimentation. Seven sentence embedding techniques were used, out of which the Universal Sentence Encoder outperformed over all the other six, with a median percentile score of 99.51. Thus, this method of searching, when integrated with traditional methods, can help improve the quality of search results.

Download Full-text

End-to-End Concept Word Detection for Video Captioning, Retrieval, and Question Answering

A model for quantitative evaluation of an end-to-end question-answering system

A short survey on end-to-end simple question answering systems

End-to-End Dense Video Captioning with Masked Transformer

Querying NoSQL with Deep Learning to Answer Natural Language Questions

End-to-End Prediction of Buffer Overruns from Raw Source Code via Neural Memory Networks

Expanding End-to-End Question Answering on Differentiable Knowledge Graphs with Intersection

End-to-End Training of Neural Retrievers for Open-Domain Question Answering

End-to-End Representation Learning for Question Answering with Weak Supervision

End-to-End Video Captioning

Video Caption Based Searching Using End-to-End Dense Captioning and Sentence Embeddings

Export Citation Format