首页> 外文会议>International Conference on Multimedia Modeling >Learning Multi-level Interaction Relations and Feature Representations for Group Activity Recognition
【24h】

Learning Multi-level Interaction Relations and Feature Representations for Group Activity Recognition

机译:学习组活动识别的多级交互关系和特征表示

获取原文

摘要

Group activity recognition is an challenging task with a major issue that reasons about complex interaction relations in the context of multi-person scenes. Most existing approaches concentrate on capturing interaction relations and learning features of the group activity at individual or group levels. These approaches lose sight of multi-level structures and interaction relations of the group activity. To overcome this challenge, we propose a Multi-level Interaction Relation model (MIR) to flexibly and efficiently learn multi-level structures of the group activity and capture multi-level interaction relations in the group activity. MIR employs graph pooling and unpooling networks to build multi-grained group relation graphs, and thus divide the group activity into multiple levels. Specifically, the Key Actor based Group Pooling layer (KeyPool) selects key persons in the activity to build the coarser-grained graph while the Key Actor based Group Unpooling layer (KeyUnPool) reconstructs the finer-grained graph according the corresponding KeyPool. Multiple KeyPool and KeyUnPool progressively build multi-grained graphs and learn multi-level structures of the group activity. Thanks to graph convolutions performed on multi-grained relation graphs, multi-level interactions are finally captured. In addition, graph readout (GR) layers are added to obtain multi-level spatio-temporal features of The group activity. Experimental results on two publicly available datasets demonstrate the effectiveness of KeyPool and KeyUnPool, and show our model can further improve the performance of group activity recognition.
机译:集团活动识别是一个具有挑战性的任务,主要问题是多人场景背景下复杂互动关系的原因。大多数现有方法集中在捕获个人或团体水平的群体活动的互动关系和学习特征。这些方法忽视了组活动的多级结构和互动关系。为了克服这一挑战,我们提出了一个多级交互关系模型(MIR),以灵活有效地学习组活动的多级结构,并捕获组活动中的多级交互关系。 MIR采用图形池和未加工网络来构建多颗粒组关系图,从而将组活动分为多个级别。具体地,基于关键演员的组池池层(Keypool)在活动中选择关键人员来构建较粗糙的粗糙图,而基于密钥actor基于的群体未降级层(keyunpool)根据相应的键池重建细粒图。多个Keypool和KeyUnPool逐步构建多粒图形并学习组活动的多级结构。由于对多粒度关系图执行的图表卷积,最终捕获了多级交互。此外,添加图表读数(GR)层以获得组活动的多级时空特征。两个公开数据集的实验结果展示了Keypool和KeyUnpool的有效性,并表明我们的模型可以进一步提高组活动识别的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号