Chong, Yong Shean (2019) Abnormal Event Detection in Surveillance Videos Using Spatiotemporal Autoencoder. Master dissertation/thesis, UTAR.
Abstract
This research presents an efficient method for detecting anomalies in videos. Recent applications of convolutional neural networks have shown promises of convolutional layers for object detection and recognition, especially in images. However, convolutional neural networks are supervised and require labels as learning signals. Hence, a spatiotemporal autoencoder architecture is proposed for anomaly detection in videos including crowded scenes. The proposed architecture includes two main components, one spatial autoencoder for learning feature representation, and one temporal autoencoder for learning the temporal evolution of the spatial features. During training, the model is trained with only normal scenes, with the objective to minimise the reconstruction error between the input video volume and the output video volume reconstructed by the learned model. After the model is trained, normal video volume is expected to have low reconstruction error, whereas abnormal video volume is expected to have a high reconstruction error. By thresholding on the error produced by each testing input volumes, our system will be able to detect when an abnormal event occurs. The model is evaluated on four surveillance video datasets and compared using the area under ROC curve and abnormal event count. Experimental results on UMN, Avenue, and UCSD benchmarks confirm that the proposed method can detect more abnormal events with lower false alarm rate than some state-of-the-art methods. The advantage of the proposed method is that it is unsupervised — the only ingredient required is long video segments containing most normal events in a fixed view. Also, no feature engineering is required as the model automatically learns the most useful features from the training data. Further investigations will be carried out to improve the result of video anomaly detection by having human feedback to update the learned model for better detection and reduced false alarms.
Actions (login required)