Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection

Chen Xiaoyu; Kan Shichao; Zhang FanghuiCen YigangZhang LinnaZhang Damin

首页> 外文期刊>Journal of visual communication & image representation >Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection

【24h】

Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection

机译：基于骨架的多尺度时空注意力图卷积网络

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相关主题

摘要

Anomaly behavior detection plays a significant role in emergencies such as robbery. Although a lot of works have been proposed to deal with this problem, the performance in real applications is still relatively low. Here, to detect abnormal human behavior in videos, we propose a multiscale spatial temporal attention graph convolution network (MSTA-GCN) to capture and cluster the features of the human skeleton. First, based on the human skeleton graph, a multiscale spatial temporal attention graph convolution block (MSTA-GCB) is built which contains multiscale graphs in temporal and spatial dimensions. MSTA-GCB can simulate the motion relations of human body components at different scales where each scale corresponds to different granularity of annotation levels on the human skeleton. Then, static, globally-learned and attention-based adjacency matrices in the graph convolution module are proposed to capture hierarchical representation. Finally, extensive experiments are carried out on the ShanghaiTech Campus and CUHK Avenue datasets, the final results of the frame-level AUC/EER are 0.759/0.311 and 0.876/0.192, respectively. Moreover, the frame-level AUC is 0.768 for the human-related ShanghaiTech subset. These results show that our MSTA-GCN outperforms most of methods in video anomaly detection and we have obtained a new state-of-the-art performance in skeleton-based anomaly behavior detection.

机译：异常行为检测在抢劫等紧急情况中起着重要作用。虽然已经提出了很多工作来解决这个问题，但在实际应用中的性能仍然相对较低。在这里，为了检测视频中的异常人类行为，我们提出了一种多尺度时空注意力图卷积网络（MSTA-GCN）来捕获和聚类人体骨骼的特征。首先，基于人体骨骼图，构建包含时空维度多尺度图的多尺度时空注意力图卷积块（MSTA-GCB）;MSTA-GCB可以模拟人体各部件在不同尺度上的运动关系，每个尺度对应人体骨骼上不同粒度的标注级别。然后，在图卷积模块中提出静态的、全局学习的、基于注意力的邻接矩阵来捕获层次表示。最后，在上海科技大学校园和香港中文大学大道数据集上进行了大量实验，最终得到帧级AUC/EER的结果分别为0.759/0.311和0.876/0.192。此外，与人类相关的上海科技大学子集的帧级AUC为0.768。这些结果表明，我们的MSTA-GCN在视频异常检测方面优于大多数方法，并且在基于骨骼的异常行为检测方面获得了新的先进性能。

著录项

来源
《Journal of visual communication & image representation》 |2023年第2期|103707.1-103707.9|共9页
作者
Chen Xiaoyu; Kan Shichao; Zhang FanghuiCen YigangZhang LinnaZhang Damin;
展开▼
作者单位

Beijing Jiaotong Univ||Beijing Key Lab Adv Informat Sci & Network Technol;

Cent South Univ;

Guizhou Univ;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Multiscale spatial temporal graph; Spatial attention graph convolution; Skeleton-based anomaly behavior detection;

机译：多尺度时空图;空间注意力图卷积;基于骨架的异常行为检测;

Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection

摘要

著录项

引文网络

相关主题

期刊订阅