video scene Latest Research Papers

Abstract Inspired by the wide application of transformer in computer vision and its excellent ability in temporal feature learning. This paper proposes a novel and efficient spatio-temporal residual attention network for student action recognition in classroom teaching video. It first fuses 2D spatial convolution and 1D temporal convolution to study spatio-temporal feature, then combines the powerful Reformer to better study the deeper spatio-temporal characteristics with visual significance of student classroom action. Based on the spatio-temporal residual attention network, a single person action recognition model in classroom teaching video is proposed. Considering that there are often multiple students in the classroom video scene, on the basis of single person action recognition, combined with object detection and tracking technology, the association of temporal and spatial characteristics of the same student targets is established, so as to realize the multi-student action recognition in classroom video scene. The experimental results on classroom teaching video dataset and public video dataset show that the proposed model achieves higher action recognition performance than the existing excellent models and methods.

Download Full-text

Video Scene Information Detection Based on Entity Recognition

Wireless Communications and Mobile Computing ◽

10.1155/2021/1020044 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Hui Qian ◽

Mengxuan Dai ◽

Yong Ma ◽

Jiale Zhao ◽

Qinghua Liu ◽

...

Keyword(s):

Video Data ◽

Entity Recognition ◽

Original Video ◽

Video Scene ◽

Strong Relation ◽

Scene Information ◽

Information Detection ◽

High Level ◽

Situational Information ◽

Surveillance Analysis

Video situational information detection is widely used in the fields of video query, character anomaly detection, surveillance analysis, and so on. However, most of the existing researches pay much attention to the subject or video backgrounds, but little attention to the recognition of situational information. What is more, because there is no strong relation between the pixel information and the scene information of video data, it is difficult for computers to obtain corresponding high-level scene information through the low-level pixel information of video data. Video scene information detection is mainly to detect and analyze the multiple features in the video and mark the scenes in the video. It is aimed at automatically extracting video scene information from all kinds of original video data and realizing the recognition of scene information through “comprehensive consideration of pixel information and spatiotemporal continuity.” In order to solve the problem of transforming pixel information into scene information, this paper proposes a video scene information detection method based on entity recognition. This model integrates the spatiotemporal relationship between the video subject and object on the basis of entity recognition, so as to realize the recognition of scene information by establishing mapping relation. The effectiveness and accuracy of the model are verified by simulation experiments with the TV series as experimental data. The accuracy of this model in the simulation experiment can reach more than 85%.

Download Full-text

S3-Net: A Fast Scene Understanding Network by Single-Shot Segmentation for Autonomous Driving

ACM Transactions on Intelligent Systems and Technology ◽

10.1145/3470660 ◽

2021 ◽

Vol 12 (5) ◽

pp. 1-19

Author(s):

Yuan Cheng ◽

Yuchao Yang ◽

Hai-Bao Chen ◽

Ngai Wong ◽

Hao Yu

Keyword(s):

Scene Understanding ◽

Autonomous Driving ◽

Single Shot ◽

Shot Segmentation ◽

Video Scene ◽

Temporal Model ◽

Spatio Temporal ◽

3D Cnn ◽

Time Segmentation ◽

Scene Features

Real-time segmentation and understanding of driving scenes are crucial in autonomous driving. Traditional pixel-wise approaches extract scene information by segmenting all pixels in a frame, and hence are inefficient and slow. Proposal-wise approaches only learn from the proposed object candidates, but still require multiple steps on the expensive proposal methods. Instead, this work presents a fast single-shot segmentation strategy for video scene understanding. The proposed net, called S3-Net, quickly locates and segments target sub-scenes , and meanwhile extracts attention-aware time-series sub-scene features ( ats-features ) as inputs to an attention-aware spatio-temporal model (ASM) . Utilizing tensorization and quantization techniques, S3-Net is intended to be lightweight for edge computing. Experiments results on CityScapes, UCF11, HMDB51, and MOMENTS datasets demonstrate that the proposed S3-Net achieves an accuracy improvement of 8.1% versus the 3D-CNN based approach on UCF11, a storage reduction of 6.9× and an inference speed of 22.8 FPS on CityScapes with a GTX1080Ti GPU.

Download Full-text

The Efficacy of Collaborative Authoring of Video Scene Descriptions

10.1145/3441852.3471201 ◽

2021 ◽

Author(s):

Rosiana Natalie ◽

Jolene Loh ◽

Huei Suen Tan ◽

Joshua Tseng ◽

Ian Luke Yi-Ren Chan ◽

...

Keyword(s):

Video Scene

Download Full-text

Multi-Camera Video Scene Graphs for Surveillance Videos Indexing and Retrieval

10.1109/icip42928.2021.9506713 ◽

2021 ◽

Author(s):

Toshal Patel ◽

Alvin Yan Hong Yao ◽

Yu Qiang ◽

Wei Tsang Ooi ◽

Roger Zimmermann

Keyword(s):

Surveillance Videos ◽

Indexing And Retrieval ◽

Video Scene

Download Full-text

Subtitle Generation and Video Scene Indexing using Recurrent Neural Networks

10.1109/icirca51532.2021.9544837 ◽

2021 ◽

Author(s):

Sajjan Kiran ◽

Umesh Patil ◽

P Siddarth Shankar ◽

Poonam Ghuli

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Video Scene

Download Full-text

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

10.1109/cvpr46437.2021.00412 ◽

2021 ◽

Author(s):

Jiaxu Miao ◽

Yunchao Wei ◽

Yu Wu ◽

Chen Liang ◽

Guangrui Li ◽

...

Keyword(s):

Large Scale ◽

Scene Parsing ◽

Large Scale Dataset ◽

Video Scene ◽

In The Wild

Download Full-text

Video scene change detection based on histogram analysis for hiding message

Journal of Physics Conference Series ◽

10.1088/1742-6596/1918/4/042141 ◽

2021 ◽

Vol 1918 (4) ◽

pp. 042141

Author(s):

M Fuad ◽

F Ernawan ◽

L J Hui

Keyword(s):

Change Detection ◽

Histogram Analysis ◽

Scene Change ◽

Scene Change Detection ◽

Video Scene

Download Full-text

Using Sensor Network in Motion Detection Based on Deep Full Convolutional Network Model

Complexity ◽

10.1155/2021/3909522 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Qichang Xu

Keyword(s):

Sensor Network ◽

Target Detection ◽

Network Model ◽

Motion Detection ◽

Video Frame ◽

Detection Accuracy ◽

Moving Target ◽

Moving Target Detection ◽

Convolutional Network ◽

Video Scene

Aiming at the shortcomings of traditional moving target detection methods in complex scenes such as low detection accuracy and high complexity, and not considering the overall structure information of the video frame image, this paper proposes a moving-target detection based on sensor network. First, a low-power motion detection wireless sensor network node is designed to obtain motion detection information in real time. Secondly, the background of the video scene is quickly extracted by the time domain averaging method, and the video sequence and the background image are channel-merged to construct a deep full convolutional network model. Finally, the network model is used to learn the deep features of the video scene and output the pixel-level classification results to achieve moving target detection. This method not only can adapt to complex video scenes of different sizes but also has a simple background extraction method, which effectively improves the detection speed.

Download Full-text

video scene
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Kernel-Induced Possibilistic Fuzzy Associate Background Subtraction for Video Scene

Spatio-temporal Attention Network for Student Action Recognition in Classroom Teaching Videos

Video Scene Information Detection Based on Entity Recognition

S3-Net: A Fast Scene Understanding Network by Single-Shot Segmentation for Autonomous Driving

The Efficacy of Collaborative Authoring of Video Scene Descriptions

Multi-Camera Video Scene Graphs for Surveillance Videos Indexing and Retrieval

Subtitle Generation and Video Scene Indexing using Recurrent Neural Networks

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

Video scene change detection based on histogram analysis for hiding message

Using Sensor Network in Motion Detection Based on Deep Full Convolutional Network Model

Export Citation Format

video sceneRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Kernel-Induced Possibilistic Fuzzy Associate Background Subtraction for Video Scene

Spatio-temporal Attention Network for Student Action Recognition in Classroom Teaching Videos

Video Scene Information Detection Based on Entity Recognition

S3-Net: A Fast Scene Understanding Network by Single-Shot Segmentation for Autonomous Driving

The Efficacy of Collaborative Authoring of Video Scene Descriptions

Multi-Camera Video Scene Graphs for Surveillance Videos Indexing and Retrieval

Subtitle Generation and Video Scene Indexing using Recurrent Neural Networks

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

Video scene change detection based on histogram analysis for hiding message

Using Sensor Network in Motion Detection Based on Deep Full Convolutional Network Model

video scene
Recently Published Documents