首页> 外文会议>Annual meeting of the Association for Computational Linguistics >An Analysis of Action Recognition Datasets for Language and Vision Tasks
【24h】

An Analysis of Action Recognition Datasets for Language and Vision Tasks

机译:语言和视觉任务的动作识别数据集分析

获取原文

摘要

A large amount of recent research has focused on tasks that combine language and vision, resulting in a proliferation of datasets and methods. One such task is action recognition, whose applications include image annotation, scene understanding and image retrieval. In this survey, we categorize the existing approaches based on how they conceptualize this problem and provide a detailed review of existing datasets, highlighting their diversity as well as advantages and disadvantages. We focus on recently developed datasets which link visual information with linguistic resources and provide a fine-grained syntactic and semantic analysis of actions in images.
机译:最近的大量研究集中在结合语言和视觉的任务上,从而导致数据集和方法的激增。一种这样的任务是动作识别,其应用包括图像注释,场景理解和图像检索。在本次调查中,我们根据现有方法对这个问题的概念进行归类,并对现有数据集进行详细审查,强调其多样性以及优缺点。我们专注于最近开发的数据集,这些数据集将视觉信息与语言资源联系在一起,并提供了图像中动作的细粒度句法和语义分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号