Surveillance video anomaly detection via non-local U-Net frame prediction

Author(s):  
Qianqian Zhang ◽  
Guorui Feng ◽  
Hanzhou Wu
Author(s):  
Jie Wu ◽  
Wei Zhang ◽  
Guanbin Li ◽  
Wenhao Wu ◽  
Xiao Tan ◽  
...  

In this paper, we introduce a novel task, referred to as Weakly-Supervised Spatio-Temporal Anomaly Detection (WSSTAD) in surveillance video. Specifically, given an untrimmed video, WSSTAD aims to localize a spatio-temporal tube (i.e., a sequence of bounding boxes at consecutive times) that encloses the abnormal event, with only coarse video-level annotations as supervision during training. To address this challenging task, we propose a dual-branch network which takes as input the proposals with multi-granularities in both spatial-temporal domains. Each branch employs a relationship reasoning module to capture the correlation between tubes/videolets, which can provide rich contextual information and complex entity relationships for the concept learning of abnormal behaviors. Mutually-guided Progressive Refinement framework is set up to employ dual-path mutual guidance in a recurrent manner, iteratively sharing auxiliary supervision information across branches. It impels the learned concepts of each branch to serve as a guide for its counterpart, which progressively refines the corresponding branch and the whole framework. Furthermore, we contribute two datasets, i.e., ST-UCF-Crime and STRA, consisting of videos containing spatio-temporal abnormal annotations to serve as the benchmarks for WSSTAD. We conduct extensive qualitative and quantitative evaluations to demonstrate the effectiveness of the proposed approach and analyze the key factors that contribute more to handle this task.


Sensors ◽  
2021 ◽  
Vol 21 (12) ◽  
pp. 3993
Author(s):  
Mohammad Ibrahim Sarker ◽  
Cristina Losada-Gutiérrez ◽  
Marta Marrón-Romera ◽  
David Fuentes-Jiménez ◽  
Sara Luengo-Sánchez

Surveillance cameras are being installed in many primary daily living places to maintain public safety. In this video-surveillance context, anomalies occur only for a very short time, and very occasionally. Hence, manual monitoring of such anomalies may be exhaustive and monotonous, resulting in a decrease in reliability and speed in emergency situations due to monitor tiredness. Within this framework, the importance of automatic detection of anomalies is clear, and, therefore, an important amount of research works have been made lately in this topic. According to these earlier studies, supervised approaches perform better than unsupervised ones. However, supervised approaches demand manual annotation, making dependent the system reliability of the different situations used in the training (something difficult to set in anomaly context). In this work, it is proposed an approach for anomaly detection in video-surveillance scenes based on a weakly supervised learning algorithm. Spatio-temporal features are extracted from each surveillance video using a temporal convolutional 3D neural network (T-C3D). Then, a novel ranking loss function increases the distance between the classification scores of anomalous and normal videos, reducing the number of false negatives. The proposal has been evaluated and compared against state-of-art approaches, obtaining competitive performance without fine-tuning, which also validates its generalization capability. In this paper, the proposal design and reliability is presented and analyzed, as well as the aforementioned quantitative and qualitative evaluation in-the-wild scenarios, demonstrating its high sensitivity in anomaly detection in all of them.


2021 ◽  
Vol 2021 ◽  
pp. 1-15
Author(s):  
Jiangfan Feng ◽  
Yukun Liang ◽  
Lin Li

The growing interest in deep learning approaches to video surveillance raises concerns about the accuracy and efficiency of neural networks. However, fast and reliable detection of abnormal events is still a challenging work. Here, we introduce a two-stream approach that offers an autoencoder-based structure for fast and efficient detection to facilitate anomaly detection from surveillance video without labeled abnormal events. Furthermore, we present post hoc interpretability of feature map visualization to show the process of feature learning, revealing uncertain and ambiguous decision boundaries in the video sequence. Experimental results on Avenue, UCSD Ped2, and Subway datasets show that our method can detect abnormal events well and explain the internal logic of the model at the object level.


Conventional static surveillance has proved to be quite ineffective as the huge number of cameras to keep an eye on most often outstrips the monitor’s ability to do so. Furthermore, the amount of focus needed to constantly monitor the surveillance video cameras is often overbearing. The review paper focuses on solving the problem of anomaly detection in video sequence through semi-supervised techniques. Each video is defined as sequence of frames. The model is trained with goal to minimize the reconstruction error which later on is used to detect anomaly in the test sample videos. The model was trained and tested on most commonly used benchmarking datasetAvenue dataset. Experiment results confirm that the model detects anomaly in a video with a reasonably good accuracy in presence of some noise in dataset


2009 ◽  
Author(s):  
Chen Change Loy ◽  
Tao Xiang ◽  
Shaogang Gong

Anomaly detection in automated surveillance video is an extremely monotonous process for monitoring for crowded scenes and surveillance videos are capable to incarcerate a mixture of sensible anomalies. An appropriate machine learning technique can help to train the Anomaly Detection System (ADS) in identifying anomalous activities during surveillance. To this end, we present an anomaly detection system that can be used as a tool for anomaly detection in surveillance videos using the concept of artificial intelligence. The main intention of the proposed anomaly detection system is to improve the detection time and accuracy by using the concept of Convolutional Neural Network (CNN) as artificial intelligence technique. In this paper we present a CNN based Anomaly Detection System (CNN-ADS), which is the combination of multiple layer of hidden unit with the optimized MSER feature by using Genetic Algorithm (GA). Here CNN is used for classifying the activity into normal and abnormal from the surveillance videos based on the fitness function of GA which is used for the selection of optimal MSER feature sets. Further, Self adaptive genetic algorithm (SAGA) is adopted to efficiently solve optimization problems in the continuous search domain to select the best possible feature to segregate the pattern of normal and abnormal activities. The main contribution of this research is validation of proposed system for the large scale data and we introduce a new large-scale dataset of 128 hours of videos. Dataset consists of 1900 long and untrimmed real-world surveillance videos, with 13 sensible anomalies such as road accident, burglary, fighting, robbery, etc. as well as normal activities. The experimental results of the planned system show that our CNN-ADS for anomaly detection achieve essential improvement on anomaly detection presentation as compared to the state-of-the-art approaches. The dataset is available at: https://webpages.uncc.edu/cchen62/dataset.html. In this paper, to validate the proposed ADS we provide the comparison of existing results of several recent deep learning baselines on anomalous activity detection. The real-time ADS in surveillance video sequences using SAGA based CNN with MSER feature extraction technique is implemented using Image Processing Toolbox within Matlab Software.


Sign in / Sign up

Export Citation Format

Share Document