Fast Object Detection in H264/AVC and HEVC Compressed Domains for Video Surveillance

Author(s):  
Sami Jaballah ◽  
Mohamed-Chaker Larabi
Author(s):  
Yuefeng Wang ◽  
Kuang Mao ◽  
Tong Chen ◽  
Yanglong Yin ◽  
Shuibing He ◽  
...  

Author(s):  
Jie Xu

Abstract Recent advances in the field of object detection and face recognition have made it possible to develop practical video surveillance systems with embedded object detection and face recognition functionalities that are accurate and fast enough for commercial uses. In this paper, we compare some of the latest approaches to object detection and face recognition and provide reasons why they may or may not be amongst the best to be used in video surveillance applications in terms of both accuracy and speed. It is discovered that Faster R-CNN with Inception ResNet V2 is able to achieve some of the best accuracies while maintaining real-time rates. Single Shot Detector (SSD) with MobileNet, on the other hand, is incredibly fast and still accurate enough for most applications. As for face recognition, FaceNet with Multi-task Cascaded Convolutional Networks (MTCNN) achieves higher accuracy than advances such as DeepFace and DeepID2+ while being faster. An end-to-end video surveillance system is also proposed which could be used as a starting point for more complex systems. Various experiments have also been attempted on trained models with observations explained in detail. We finish by discussing video object detection and video salient object detection approaches which could potentially be used as future improvements to the proposed system.


2020 ◽  
Vol 194 ◽  
pp. 105590 ◽  
Author(s):  
Francisco Pérez-Hernández ◽  
Siham Tabik ◽  
Alberto Lamas ◽  
Roberto Olmos ◽  
Hamido Fujita ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document