首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos
【2h】

Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos

机译:在监控视频中的多通道生成框架和监督学习异常检测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Recently, most state-of-the-art anomaly detection methods are based on apparent motion and appearance reconstruction networks and use error estimation between generated and real information as detection features. These approaches achieve promising results by only using normal samples for training steps. In this paper, our contributions are two-fold. On the one hand, we propose a flexible multi-channel framework to generate multi-type frame-level features. On the other hand, we study how it is possible to improve the detection performance by supervised learning. The multi-channel framework is based on four Conditional GANs (CGANs) taking various type of appearance and motion information as input and producing prediction information as output. These CGANs provide a better feature space to represent the distinction between normal and abnormal events. Then, the difference between those generative and ground-truth information is encoded by Peak Signal-to-Noise Ratio (PSNR). We propose to classify those features in a classical supervised scenario by building a small training set with some abnormal samples of the original test set of the dataset. The binary Support Vector Machine (SVM) is applied for frame-level anomaly detection. Finally, we use Mask R-CNN as detector to perform object-centric anomaly localization. Our solution is largely evaluated on Avenue, Ped1, Ped2, and ShanghaiTech datasets. Our experiment results demonstrate that PSNR features combined with supervised SVM are better than error maps computed by previous methods. We achieve state-of-the-art performance for frame-level AUC on Ped1 and ShanghaiTech. Especially, for the most challenging Shanghaitech dataset, a supervised training model outperforms up to 9% the state-of-the-art an unsupervised strategy.
机译:最近,大多数最先进的异常检测方法基于表观运动和外观重建网络,并在生成和实际信息之间使用误差估计作为检测特征。这些方法仅通过使用正常样本来实现有前途的结果进行培训步骤。在本文中,我们的贡献是两倍。一方面,我们提出了一种灵活的多通道框架来生成多型帧级别功能。另一方面,我们研究了如何通过监督学习改善检测性能。多通道框架基于四个条件GANS(CGANs),将各种类型的外观和运动信息作为输入和产生预测信息作为输出。这些CGAN提供更好的特征空间,以表示正常和异常事件之间的区别。然后,通过峰值信噪比(PSNR)来编码那些生成和地面信息之间的差异。我们建议通过构建具有数据集的原始测试集的一些异常样本的小型训练集来对这些功能进行分类。二进制支持向量机(SVM)应用于帧级异常检测。最后,我们使用面膜R-CNN作为检测器来执行以对象为中心的异常本地化。我们的解决方案主要在大道,PED1,PED2和Shanghaitech Datasets上进行评估。我们的实验结果表明,PSNR功能与监督SVM相结合优于以前的方法计算的错误映射。我们实现了PED1和Shanghaitech的帧级AUC的最先进的性能。特别是,对于最具挑战性的上海学业数据集,监督培训模式优于9%的无监督策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号