Content-based retrieval of video data by the grammar of film

Author(s):  
A. Yoshitaka ◽  
T. Ishii ◽  
M. Hirakawa ◽  
T. Ichikawa
Author(s):  
Waleed E. Farag ◽  
Hussein Abdel-Wahab

The increasing use of multimedia streams nowadays necessitates the development of efficient and effective methodologies and systems for manipulating databases storing these streams. These systems have various areas of application such as video-on-demand and digital libraries. The importance of video content-based retrieval (CBR) systems motivates us to explain their basic components in this chapter and shed light on their underlying working principles. In general, a content-based retrieval system of video data consists of the following four stages: (1) Video Shot Boundary Detection, (2) Key Frames (KFs) selection, (3) features extraction (from selected KFs), and (4) retrieval stage (where similarity matching operations are performed). Each one of the above stages will be reviewed and expounded based on our experience in building a Video Content-based Retrieval (VCR) system that has been fully implemented from scratch in JAVA Language (2002). Moreover, current research directions and outstanding problems will be discussed for each stage in the context of our VCR system.


2020 ◽  
Vol 39 (6) ◽  
pp. 8927-8935
Author(s):  
Bing Zheng ◽  
Dawei Yun ◽  
Yan Liang

Under the impact of COVID-19, research on behavior recognition are highly needed. In this paper, we combine the algorithm of self-adaptive coder and recurrent neural network to realize the research of behavior pattern recognition. At present, most of the research of human behavior recognition is focused on the video data, which is based on the video number. At the same time, due to the complexity of video image data, it is easy to violate personal privacy. With the rapid development of Internet of things technology, it has attracted the attention of a large number of experts and scholars. Researchers have tried to use many machine learning methods, such as random forest, support vector machine and other shallow learning methods, which perform well in the laboratory environment, but there is still a long way to go from practical application. In this paper, a recursive neural network algorithm based on long and short term memory (LSTM) is proposed to realize the recognition of behavior patterns, so as to improve the accuracy of human activity behavior recognition.


2020 ◽  
pp. 1-12
Author(s):  
Hu Jingchao ◽  
Haiying Zhang

The difficulty in class student state recognition is how to make feature judgments based on student facial expressions and movement state. At present, some intelligent models are not accurate in class student state recognition. In order to improve the model recognition effect, this study builds a two-level state detection framework based on deep learning and HMM feature recognition algorithm, and expands it as a multi-level detection model through a reasonable state classification method. In addition, this study selects continuous HMM or deep learning to reflect the dynamic generation characteristics of fatigue, and designs random human fatigue recognition experiments to complete the collection and preprocessing of EEG data, facial video data, and subjective evaluation data of classroom students. In addition to this, this study discretizes the feature indicators and builds a student state recognition model. Finally, the performance of the algorithm proposed in this paper is analyzed through experiments. The research results show that the algorithm proposed in this paper has certain advantages over the traditional algorithm in the recognition of classroom student state features.


2020 ◽  
Vol 2020 (4) ◽  
pp. 116-1-116-7
Author(s):  
Raphael Antonius Frick ◽  
Sascha Zmudzinski ◽  
Martin Steinebach

In recent years, the number of forged videos circulating on the Internet has immensely increased. Software and services to create such forgeries have become more and more accessible to the public. In this regard, the risk of malicious use of forged videos has risen. This work proposes an approach based on the Ghost effect knwon from image forensics for detecting forgeries in videos that can replace faces in video sequences or change the mimic of a face. The experimental results show that the proposed approach is able to identify forgery in high-quality encoded video content.


2019 ◽  
Vol 85 (6) ◽  
pp. 53-63 ◽  
Author(s):  
I. E. Vasil’ev ◽  
Yu. G. Matvienko ◽  
A. V. Pankov ◽  
A. G. Kalinin

The results of using early damage diagnostics technique (developed in the Mechanical Engineering Research Institute of the Russian Academy of Sciences (IMASH RAN) for detecting the latent damage of an aviation panel made of composite material upon bench tensile tests are presented. We have assessed the capabilities of the developed technique and software regarding damage detection at the early stage of panel loading in conditions of elastic strain of the material using brittle strain-sensitive coating and simultaneous crack detection in the coating with a high-speed video camera “Video-print” and acoustic emission system “A-Line 32D.” When revealing a subsurface defect (a notch of the middle stringer) of the aviation panel, the general concept of damage detection at the early stage of loading in conditions of elastic behavior of the material was also tested in the course of the experiment, as well as the software specially developed for cluster analysis and classification of detected location pulses along with the equipment and software for simultaneous recording of video data flows and arrays of acoustic emission (AE) data. Synchronous recording of video images and AE pulses ensured precise control of the cracking process in the brittle strain-sensitive coating (tensocoating)at all stages of the experiment, whereas the use of structural-phenomenological approach kept track of the main trends in damage accumulation at different structural levels and identify the sources of their origin when classifying recorded AE data arrays. The combined use of oxide tensocoatings and high-speed video recording synchronized with the AE control system, provide the possibility of definite determination of the subsurface defect, reveal the maximum principal strains in the area of crack formation, quantify them and identify the main sources of AE signals upon monitoring the state of the aviation panel under loading P = 90 kN, which is about 12% of the critical load.


Sign in / Sign up

Export Citation Format

Share Document