论文标题
视频中异常事件检测的时空预测任务
Spatio-temporal predictive tasks for abnormal event detection in videos
论文作者
论文摘要
视频中的异常事件检测是一个具有挑战性的问题,部分原因是异常模式和缺乏相应的注释。在本文中,我们提出了新的受约束借口任务来学习对象级别正态性模式。我们的方法包括在原始分辨率下学习映射下尺度的视觉查询及其相应的正常外观和运动特性之间的映射。所提出的任务比重建和未来的框架预测任务更具挑战性,这些任务在文献中广泛使用,因为我们的模型学会了共同预测空间和时间特征而不是重建它们。我们认为,更受限制的借口任务可以更好地学习正态性模式。几个基准数据集的实验证明了我们在胜过或达到时空评估指标上的本地化和跟踪异常的有效性。
Abnormal event detection in videos is a challenging problem, partly due to the multiplicity of abnormal patterns and the lack of their corresponding annotations. In this paper, we propose new constrained pretext tasks to learn object level normality patterns. Our approach consists in learning a mapping between down-scaled visual queries and their corresponding normal appearance and motion characteristics at the original resolution. The proposed tasks are more challenging than reconstruction and future frame prediction tasks which are widely used in the literature, since our model learns to jointly predict spatial and temporal features rather than reconstructing them. We believe that more constrained pretext tasks induce a better learning of normality patterns. Experiments on several benchmark datasets demonstrate the effectiveness of our approach to localize and track anomalies as it outperforms or reaches the current state-of-the-art on spatio-temporal evaluation metrics.