Multimodal Video Description

Proceedings of the 2016 ACM on Multimedia Conference - MM '16 ◽

10.1145/2964284.2984066 ◽

2016 ◽

Author(s):

Vasili Ramanishka ◽

Abir Das ◽

Dong Huk Park ◽

Subhashini Venugopalan ◽

Lisa Anne Hendricks ◽

...

Keyword(s):

Video Description

Download Full-text

SMPTE Centennial: Closed Captioning and Video Description—A Brief Historical Perspective

SMPTE Motion Imaging Journal ◽

10.5594/jmi.2016.2587107 ◽

2016 ◽

Vol 125 (6) ◽

pp. 112-115

Author(s):

Mark Turits

Keyword(s):

Historical Perspective ◽

Closed Captioning ◽

Video Description

Download Full-text

Automated Video Description for Blind and Low Vision Users

Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems ◽

10.1145/3411763.3451810 ◽

2021 ◽

Author(s):

Aditya Bodi ◽

Pooyan Fazli ◽

Shasta Ihorn ◽

Yue-Ting Siu ◽

Andrew T Scott ◽

...

Keyword(s):

Video Description

Download Full-text

Laparoscopic glissonean pedicle approach: step by step video description of the technique from different centres (with video)

Updates in Surgery ◽

10.1007/s13304-021-01219-9 ◽

2022 ◽

Author(s):

Benedetto Ielpo ◽

Antonio Giuliani ◽

Patricia Sanchez ◽

Fernando Burdio ◽

Mikel Gastaka ◽

...

Keyword(s):

Video Description ◽

Glissonean Pedicle

Download Full-text

Transatrial Implantation of the Sapien 3 Heart Valve in Severe Mitral Annular Calcification: Multi-Clinic Experience, Written and Video Description

Structural Heart ◽

10.1080/24748706.2018.1536836 ◽

2018 ◽

Vol 3 (1) ◽

pp. 74-76

Author(s):

Serge Kobsa ◽

Robert A. Sorabella ◽

Kyle Eudailey ◽

Raymond Lee ◽

Michael Borger ◽

...

Keyword(s):

Heart Valve ◽

Mitral Annular Calcification ◽

Video Description ◽

Clinic Experience ◽

Sapien 3 ◽

Annular Calcification

Download Full-text

Rotation-Invariant Image and Video Description With Local Binary Pattern Features

IEEE Transactions on Image Processing ◽

10.1109/tip.2011.2175739 ◽

2012 ◽

Vol 21 (4) ◽

pp. 1465-1477 ◽

Author(s):

Guoying Zhao ◽

T. Ahonen ◽

J. Matas ◽

M. Pietikainen

Keyword(s):

Local Binary Pattern ◽

Rotation Invariant ◽

Video Description

Download Full-text

Automatic video description generation via LSTM with joint two-stream encoding

2016 23rd International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr.2016.7900081 ◽

2016 ◽

Author(s):

Chenyang Zhang ◽

Yingli Tian

Keyword(s):

Video Description

Download Full-text

Video description of our technique for VATS sleeve lobectomy resection of LLL endobronchial lesion

ASVIDE ◽

10.21037/asvide.2018.749 ◽

2018 ◽

Vol 5 ◽

pp. 749-749

Author(s):

Edward D. Percy ◽

Carlyn McNeely ◽

Tamara Coffin ◽

Mark J. Kearns ◽

Ajmal Hafizi ◽

...

Keyword(s):

Sleeve Lobectomy ◽

Video Description ◽

Endobronchial Lesion

Download Full-text

Video Description Based YouTube Comment Classification

Algorithms for Intelligent Systems - Applications of Artificial Intelligence in Engineering ◽

10.1007/978-981-33-4604-8_51 ◽

2021 ◽

pp. 667-678

Author(s):

Asha Shetty ◽

Bryan Abreo ◽

Adline D’Souza ◽

Akarsha Kondana ◽

Kavitha Mahesh Karimbi

Keyword(s):

Video Description

Download Full-text

Video Description Model Based on Temporal-Spatial and Channel Multi-Attention Mechanisms

Applied Sciences ◽

10.3390/app10124312 ◽

2020 ◽

Vol 10 (12) ◽

pp. 4312 ◽

Author(s):

Jie Xu ◽

Haoliang Wei ◽

Linke Li ◽

Qiuru Fu ◽

Jinhong Guo

Keyword(s):

Neural Network ◽

Spatial Attention ◽

Semantic Information ◽

Attention Mechanism ◽

Visual Features ◽

Feature Maps ◽

Global Features ◽

Model Based ◽

Video Description ◽

Video Visualization

Video description plays an important role in the field of intelligent imaging technology. Attention perception mechanisms are extensively applied in video description models based on deep learning. Most existing models use a temporal-spatial attention mechanism to enhance the accuracy of models. Temporal attention mechanisms can obtain the global features of a video, whereas spatial attention mechanisms obtain local features. Nevertheless, because each channel of the convolutional neural network (CNN) feature maps has certain spatial semantic information, it is insufficient to merely divide the CNN features into regions and then apply a spatial attention mechanism. In this paper, we propose a temporal-spatial and channel attention mechanism that enables the model to take advantage of various video features and ensures the consistency of visual features between sentence descriptions to enhance the effect of the model. Meanwhile, in order to prove the effectiveness of the attention mechanism, this paper proposes a video visualization model based on the video description. Experimental results show that, our model has achieved good performance on the Microsoft Video Description (MSVD) dataset and a certain improvement on the Microsoft Research-Video to Text (MSR-VTT) dataset.

Download Full-text

Étude Français-Water : vidéo description d’une procédure standard d’aquablation

Progrès en Urologie ◽

10.1016/j.purol.2019.08.012 ◽

2019 ◽

Vol 29 (13) ◽

pp. 773

Author(s):

V. Misrai ◽

E. Rijo ◽

K. Zorn ◽

N. Barry delongchamps ◽

A. Descazeaud

Keyword(s):

Video Description

Download Full-text