Towards coherent natural language description of video streams

Author(s):  
Muhammad Usman Ghani Khan ◽  
Lei Zhang ◽  
Yoshihiko Gotoh
IEEE Access ◽  
2018 ◽  
Vol 6 ◽  
pp. 16639-16645 ◽  
Author(s):  
Aniqa Dilawari ◽  
Muhammad Usman Ghani Khan ◽  
Ammarah Farooq ◽  
Zahoor-Ur Rehman ◽  
Seungmin Rho ◽  
...  

Author(s):  
Md. Asifuzzaman Jishan ◽  
Khan Raqib Mahmud ◽  
Abul Kalam Al Azad

We presented a learning model that generated natural language description of images. The model utilized the connections between natural language and visual data by produced text line based contents from a given image. Our Hybrid Recurrent Neural Network model is based on the intricacies of Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), and Bi-directional Recurrent Neural Network (BRNN) models. We conducted experiments on three benchmark datasets, e.g., Flickr8K, Flickr30K, and MS COCO. Our hybrid model utilized LSTM model to encode text line or sentences independent of the object location and BRNN for word representation, this reduced the computational complexities without compromising the accuracy of the descriptor. The model produced better accuracy in retrieving natural language based description on the dataset.


The research deals with the original algorithms of the linguistic processor integration for solving planimetric problems. The linguistic processor translates the natural language description of the problem into a semantic representation based on the ontology that supports the axiomatics of geometry. The linguistic processor synthesizes natural-language comments to the solution and drawing objects. The method of interactive visualization of the linguistic processor functioning is proposed. The method provides a step-by-step dialog control of syntactic structure construction and its display in semantic representation. During the experiments, several dozens of standard syntactic structures correctly displayed in the semantic structures of the subject area were obtained. The direction of further research related to the development of the proposed approach is outlined.


2015 ◽  
Vol 303 ◽  
pp. 61-82 ◽  
Author(s):  
Muhammad Usman Ghani Khan ◽  
Nouf Al Harbi ◽  
Yoshihiko Gotoh

Sign in / Sign up

Export Citation Format

Share Document