...
首页> 外文期刊>Journal of visual communication & image representation >Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection
【24h】

Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection

机译:基于骨架的多尺度时空注意力图卷积网络

获取原文
获取原文并翻译 | 示例

摘要

Anomaly behavior detection plays a significant role in emergencies such as robbery. Although a lot of works have been proposed to deal with this problem, the performance in real applications is still relatively low. Here, to detect abnormal human behavior in videos, we propose a multiscale spatial temporal attention graph convolution network (MSTA-GCN) to capture and cluster the features of the human skeleton. First, based on the human skeleton graph, a multiscale spatial temporal attention graph convolution block (MSTA-GCB) is built which contains multiscale graphs in temporal and spatial dimensions. MSTA-GCB can simulate the motion relations of human body components at different scales where each scale corresponds to different granularity of annotation levels on the human skeleton. Then, static, globally-learned and attention-based adjacency matrices in the graph convolution module are proposed to capture hierarchical representation. Finally, extensive experiments are carried out on the ShanghaiTech Campus and CUHK Avenue datasets, the final results of the frame-level AUC/EER are 0.759/0.311 and 0.876/0.192, respectively. Moreover, the frame-level AUC is 0.768 for the human-related ShanghaiTech subset. These results show that our MSTA-GCN outperforms most of methods in video anomaly detection and we have obtained a new state-of-the-art performance in skeleton-based anomaly behavior detection.
机译:异常行为检测在抢劫等紧急情况中起着重要作用。虽然已经提出了很多工作来解决这个问题,但在实际应用中的性能仍然相对较低。在这里,为了检测视频中的异常人类行为,我们提出了一种多尺度时空注意力图卷积网络(MSTA-GCN)来捕获和聚类人体骨骼的特征。首先,基于人体骨骼图,构建包含时空维度多尺度图的多尺度时空注意力图卷积块(MSTA-GCB);MSTA-GCB可以模拟人体各部件在不同尺度上的运动关系,每个尺度对应人体骨骼上不同粒度的标注级别。然后,在图卷积模块中提出静态的、全局学习的、基于注意力的邻接矩阵来捕获层次表示。最后,在上海科技大学校园和香港中文大学大道数据集上进行了大量实验,最终得到帧级AUC/EER的结果分别为0.759/0.311和0.876/0.192。此外,与人类相关的上海科技大学子集的帧级AUC为0.768。这些结果表明,我们的MSTA-GCN在视频异常检测方面优于大多数方法,并且在基于骨骼的异常行为检测方面获得了新的先进性能。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号