首页> 外文会议>Youth Academic Annual Conference of Chinese Association of Automation >An Autoencoder with a Memory Module for Video Anomaly Detection
【24h】

An Autoencoder with a Memory Module for Video Anomaly Detection

机译:具有用于视频异常检测的内存模块的AutoEncoder

获取原文

摘要

With the rise of deep convolutional neural networks (CNNs), considerable attention has been paid to video anomaly detection (VAD). Autoencoders are a popular type of framework for VAD, and many existing VAD methods are based on it. However, these methods take an assumption of closed-world VAD, i.e., do not comprehensively consider the diversity of normal patterns. Besides, a competent CNN allows the autoencoder to reconstruct or predict abnormal video frames proficiently, resulting in missing anomalies. To mitigate these drawbacks, we propose an Autoencoder with a Memory Module (AMM) to realize video anomaly detection by predicting video frames. AMM consists of three modules: an encoder, a decoder, and a memory module. First, the consecutive frames are fed into the encoder to yield latent spatial features. Then, the features are utilized to retrieve corresponding memory items in the memory module to generate memory mapping features. Finally, the memory mapping features are adopted in the decoder for predicting the next frame. To match the queries against memory items accurately, we propose a memory triplet loss, which takes into account both size and angle discrepancies between the queries and memory items. At the training stage, AMM utilizes the memory triplet loss, a prediction loss, and multi-scale structure similarity measure. Moreover, the modes of retrieving and updating memory items are ameliorated by a scaled dot product model, which can alleviate vanishing gradient problems to a certain extent. Extensive experiments are conducted on three benchmark public datasets, and the results demonstrate the superior performance of AMM.
机译:随着深度卷积神经网络(CNNS)的兴起,已对视频异常检测(VAD)支付了相当大的关注。 AutoEncoders是VAD的流行框架,许多现有的VAD方法都是基于它的。然而,这些方法假设封闭世界的VAD,即,不要全面考虑正常模式的多样性。此外,竞争力的CNN允许AutoEncoder熟练地重建或预测异常的视频帧,从而导致异常缺失。为了缓解这些缺点,我们提出了一种具有存储器模块(AMM)的AutoEncoder来实现视频异常检测,通过预测视频帧。 AMM由三个模块组成:编码器,解码器和内存模块。首先,将连续帧馈入编码器以产生潜在的空间特征。然后,使用该特征来检索存储器模块中的相应存储器项以生成存储器映射特征。最后,在解码器中采用存储映射特征以预测下一帧。要准确地匹配查询,我们提出了内存三重态丢失,这考虑了查询和存储器项之间的大小和角度差异。在训练阶段,AMM利用内存三重态丢失,预测损失和多尺度结构相似度测量。此外,检索和更新存储器项的模式由缩放点产品模型进行了改善,可以在一定程度上缓解消失的梯度问题。广泛的实验是在三个基准公共数据集中进行的,结果表明了AMM的卓越性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号