Hallucinating Optical Flow Features for Video Classification

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/130 ◽

2019 ◽

Cited By ~ 2

Author(s):

Yongyi Tang ◽

Lin Ma ◽

Lianqiang Zhou

Keyword(s):

Optical Flow ◽

Data Storage ◽

Large Scale ◽

State Of The Art ◽

Video Content ◽

Video Classification ◽

Temporal Relationships ◽

Optical Flow Computation ◽

Flow Features ◽

Optical Flow Features

Appearance and motion are two key components to depict and characterize the video content. Currently, the two-stream models have achieved state-of-the-art performances on video classification. However, extracting motion information, specifically in the form of optical flow features, is extremely computationally expensive, especially for large-scale video classification. In this paper, we propose a motion hallucination network, namely MoNet, to imagine the optical flow features from the appearance features, with no reliance on the optical flow computation. Specifically, MoNet models the temporal relationships of the appearance features and exploits the contextual relationships of the optical flow features with concurrent connections. Extensive experimental results demonstrate that the proposed MoNet can effectively and efficiently hallucinate the optical flow features, which together with the appearance features consistently improve the video classification performances. Moreover, MoNet can help cutting down almost a half of computational and data-storage burdens for the two-stream video classification. Our code is available at: https://github.com/YongyiTang92/MoNet-Features

Download Full-text

Analysis and performance evaluation of optical flow features for dynamic texture recognition

Signal Processing Image Communication ◽

10.1016/j.image.2007.05.013 ◽

2007 ◽

Vol 22 (7-8) ◽

pp. 680-691 ◽

Cited By ~ 20

Author(s):

Sándor Fazekas ◽

Dmitry Chetverikov

Keyword(s):

Performance Evaluation ◽

Optical Flow ◽

Dynamic Texture ◽

Texture Recognition ◽

Flow Features ◽

And Performance ◽

Optical Flow Features

Download Full-text

Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/98 ◽

2018 ◽

Cited By ~ 10

Author(s):

Hehe Fan ◽

Zhongwen Xu ◽

Linchao Zhu ◽

Chenggang Yan ◽

Jianjun Ge ◽

...

Keyword(s):

Large Scale ◽

Sampling Rate ◽

Computational Cost ◽

Confidence Score ◽

Video Content ◽

Video Classification ◽

Video Frames ◽

Similar Accuracy ◽

Efficient Video

We aim to significantly reduce the computational cost for classification of temporally untrimmed videos while retaining similar accuracy. Existing video classification methods sample frames with a predefined frequency over entire video. Differently, we propose an end-to-end deep reinforcement approach which enables an agent to classify videos by watching a very small portion of frames like what we do. We make two main contributions. First, information is not equally distributed in video frames along time. An agent needs to watch more carefully when a clip is informative and skip the frames if they are redundant or irrelevant. The proposed approach enables the agent to adapt sampling rate to video content and skip most of the frames without the loss of information. Second, in order to have a confident decision, the number of frames that should be watched by an agent varies greatly from one video to another. We incorporate an adaptive stop network to measure confidence score and generate timely trigger to stop the agent watching videos, which improves efficiency without loss of accuracy. Our approach reduces the computational cost significantly for the large-scale YouTube-8M dataset, while the accuracy remains the same.

Download Full-text

Deep Convolutional Neural Network with Optical Flow for Facial Micro-Expression Recognition

Journal of Circuits System and Computers ◽

10.1142/s0218126620500061 ◽

2019 ◽

Vol 29 (01) ◽

pp. 2050006 ◽

Cited By ~ 2

Author(s):

Qiuyu Li ◽

Jun Yu ◽

Toru Kurihara ◽

Haiyan Zhang ◽

Shu Zhan

Keyword(s):

Neural Network ◽

Optical Flow ◽

Video Clip ◽

Support Vector ◽

Expression Recognition ◽

Convolutional Network ◽

Facial Landmarks ◽

Flow Features ◽

Optical Flow Features ◽

Micro Expression

Micro-expression is a kind of brief facial movements which could not be controlled by the nervous system. Micro-expression indicates that a person is hiding his true emotion consciously. Micro-expression recognition has various potential applications in public security and clinical medicine. Researches are focused on the automatic micro-expression recognition, because it is hard to recognize the micro-expression by people themselves. This research proposed a novel algorithm for automatic micro-expression recognition which combined a deep multi-task convolutional network for detecting the facial landmarks and a fused deep convolutional network for estimating the optical flow features of the micro-expression. First, the deep multi-task convolutional network is employed to detect facial landmarks with the manifold-related tasks for dividing the facial region. Furthermore, a fused convolutional network is applied for extracting the optical flow features from the facial regions which contain the muscle changes when the micro-expression appears. Because each video clip has many frames, the original optical flow features of the whole video clip will have high number of dimensions and redundant information. This research revises the optical flow features for reducing the redundant dimensions. Finally, a revised optical flow feature is applied for refining the information of the features and a support vector machine classifier is adopted for recognizing the micro-expression. The main contribution of work is combining the deep multi-task learning neural network and the fusion optical flow network for micro-expression recognition and revising the optical flow features for reducing the redundant dimensions. The results of experiments on two spontaneous micro-expression databases prove that our method achieved competitive performance in micro-expression recognition.

Download Full-text

Multi‐task deep learning with optical flow features for self‐driving cars

IET Intelligent Transport Systems ◽

10.1049/iet-its.2020.0439 ◽

2020 ◽

Vol 14 (13) ◽

pp. 1845-1854

Author(s):

Yuan Hu ◽

Hubert P. H. Shum ◽

Edmond S. L. Ho

Keyword(s):

Deep Learning ◽

Optical Flow ◽

Flow Features ◽

Self Driving Cars ◽

Optical Flow Features

Download Full-text

Research on Micro-Expression Spotting Method Based on Optical Flow Features

10.1145/3474085.3479225 ◽

2021 ◽

Author(s):

He Yuhong

Keyword(s):

Optical Flow ◽

Flow Features ◽

Optical Flow Features ◽

Micro Expression

Download Full-text

ATFVO: An Attentive Tensor-compressed LSTM Model with Optical Flow Features for Monocular Visual Odometry

10.1109/wrcsara53879.2021.9612673 ◽

2021 ◽

Author(s):

Hongwei Ren ◽

Chenghao Li ◽

Xinyi Zhang ◽

Chenchen Ding ◽

Changhai Man ◽

...

Keyword(s):

Optical Flow ◽

Visual Odometry ◽

Flow Features ◽

Optical Flow Features

Download Full-text

Automatic analysis and characterization of the hummingbird wings motion using dense optical flow features

Bioinspiration & Biomimetics ◽

10.1088/1748-3190/10/1/016006 ◽

2015 ◽

Vol 10 (1) ◽

pp. 016006 ◽

Cited By ~ 6

Author(s):

Fabio Martínez ◽

Antoine Manzanera ◽

Eduardo Romero

Keyword(s):

Optical Flow ◽

Automatic Analysis ◽

Dense Optical Flow ◽

Flow Features ◽

Optical Flow Features

Download Full-text

Facial expression recognition based on geometric and optical flow features in colour image sequences

IET Computer Vision ◽

10.1049/iet-cvi.2011.0064 ◽

2012 ◽

Vol 6 (2) ◽

pp. 79 ◽

Cited By ~ 31

Author(s):

R. Niese ◽

A. Al-Hamadi ◽

A. Farag ◽

H. Neumann ◽

B. Michaelis

Keyword(s):

Facial Expression ◽

Optical Flow ◽

Facial Expression Recognition ◽

Image Sequences ◽

Expression Recognition ◽

Colour Image ◽

Flow Features ◽

Optical Flow Features

Download Full-text

Spatio-Temporal Attention Model for Foreground Detection in Cross-Scene Surveillance Videos

Sensors ◽

10.3390/s19235142 ◽

2019 ◽

Vol 19 (23) ◽

pp. 5142 ◽

Cited By ~ 2

Author(s):

Dong Liang ◽

Jiaxing Pan ◽

Han Sun ◽

Huiyu Zhou

Keyword(s):

Optical Flow ◽

Feature Learning ◽

Foreground Detection ◽

Temporal Attention ◽

Frame Size ◽

Attention Model ◽

Flow Features ◽

Spatio Temporal ◽

High Level ◽

Optical Flow Features

Foreground detection is an important theme in video surveillance. Conventional background modeling approaches build sophisticated temporal statistical model to detect foreground based on low-level features, while modern semantic/instance segmentation approaches generate high-level foreground annotation, but ignore the temporal relevance among consecutive frames. In this paper, we propose a Spatio-Temporal Attention Model (STAM) for cross-scene foreground detection. To fill the semantic gap between low and high level features, appearance and optical flow features are synthesized by attention modules via the feature learning procedure. Experimental results on CDnet 2014 benchmarks validate it and outperformed many state-of-the-art methods in seven evaluation metrics. With the attention modules and optical flow, its F-measure increased 9 % and 6 % respectively. The model without any tuning showed its cross-scene generalization on Wallflower and PETS datasets. The processing speed was 10.8 fps with the frame size 256 by 256.

Download Full-text

Hallucinating IDT Descriptors and I3D Optical Flow Features for Action Recognition With CNNs

2019 IEEE/CVF International Conference on Computer Vision (ICCV) ◽

10.1109/iccv.2019.00879 ◽

2019 ◽

Cited By ~ 3

Author(s):

Lei Wang ◽

Piotr Koniusz ◽

Du Huynh

Keyword(s):

Optical Flow ◽

Action Recognition ◽

Flow Features ◽

Optical Flow Features

Download Full-text