首页> 外文会议>Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on >Recognizing manipulation actions in arts and crafts shows using domain-specific visual and textual cues
【24h】

Recognizing manipulation actions in arts and crafts shows using domain-specific visual and textual cues

机译:使用特定领域的视觉和文字提示识别手工艺品中的操纵动作

获取原文
获取原文并翻译 | 示例

摘要

We present an approach for automatic annotation of commercial videos from an arts-and-crafts domain with the aid of textual descriptions. The main focus is on recognizing both manipulation actions (e.g. cut, draw, glue) and the tools that are used to perform these actions (e.g. markers, brushes, glue bottle). We demonstrate how multiple visual cues such as motion descriptors, object presence, and hand poses can be combined with the help of contextual priors that are automatically extracted from associated transcripts or online instructions. Using these diverse features and linguistic information we propose several increasingly complex computational models for recognizing elementary manipulation actions and composite activities, as well as their temporal order. The approach is evaluated on a novel dataset of comprised of 27 episodes of PBS Sprout TV, each containing on average 8 manipulation actions.
机译:我们提供了一种借助文字描述自动注释来自工艺美术领域的商业视频的方法。主要重点是识别操作动作(例如,剪切,绘制,粘贴)和用于执行这些动作的工具(例如,标记,画笔,胶水瓶)。我们演示了如何借助上下文优先级(可以从关联的成绩单或在线说明中自动提取)来结合多个视觉提示(例如运动描述符,对象存在和手势)。利用这些多样化的功能和语言信息,我们提出了几种日益复杂的计算模型,用于识别基本的操纵动作和复合活动以及它们的时间顺序。该方法在一个新颖的数据集上进行了评估,该数据集包含27集PBS Sprout TV,每集平均包含8个操作动作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号