首页> 外文会议>International Conference on Multimedia Analysis and Pattern Recognition >Searching For Desired Person Doing Desired Action based on Visual and Audio Feature in Large Scale Video Database
【24h】

Searching For Desired Person Doing Desired Action based on Visual and Audio Feature in Large Scale Video Database

机译:大型视频数据库中基于视觉和音频特征的所需人员搜索

获取原文

摘要

How to find a person doing an action in a video database is a challenging problem because the result must be correct at an instance level with the specific person doing the appropriate action. Even though there have been many works about face recognition and action recognition, they often focus on only one separate task. In this paper, the problem could be formulated into an instance retrieval where input is a query consisting of examples of the target person and examples of the desired action, and the result is a list of ranked positive shots. In this work, we proposed a simple but efficient person-action retrieval system by combining multimodal features including visual feature and audio feature to deal with various types of instances by making use of available visual or audio cues. The evaluation results on a large-scale BBC Eastenders dataset with 3rd rank in a total of 6 teams in TRECVID INS 2019 has proved the effectiveness of the proposed method.
机译:如何在视频数据库中找到正在做某事的人是一个具有挑战性的问题,因为结果必须在实例级别正确,并且特定人在做适当的行为。即使关于面部识别和动作识别的著作很多,但它们通常只专注于一项单独的任务。在本文中,可以将问题表达为实例检索,其中输入是一个由目标人的示例和所需动作的示例组成的查询,结果是一个排序的肯定镜头的列表。在这项工作中,我们提出了一个简单而有效的人员动作检索系统,该系统通过结合包括视觉功能和音频功能在内的多模式功能,利用可用的视觉或音频提示来处理各种类型的实例。在3个大型BBC Eastenders数据集上的评估结果 rd 在TRECVID INS 2019中总共6个团队中的排名证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号