...
首页> 外文期刊>IEEE Transactions on Image Processing >Moment Retrieval via Cross-Modal Interaction Networks With Query Reconstruction
【24h】

Moment Retrieval via Cross-Modal Interaction Networks With Query Reconstruction

机译:通过具有查询重建的跨模型交互网络的时刻检索

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Moment retrieval aims to localize the most relevant moment in an untrimmed video according to the given natural language query. Existing works often only focus on one aspect of this emerging task, such as the query representation learning, video context modeling or multi-modal fusion, thus fail to develop a comprehensive system for further performance improvement. In this paper, we introduce a novel Cross-Modal Interaction Network (CMIN) to consider multiple crucial factors for this challenging task, including the syntactic dependencies of natural language queries, long-range semantic dependencies in video context and the sufficient cross-modal interaction. Specifically, we devise a syntactic GCN to leverage the syntactic structure of queries for fine-grained representation learning and propose a multi-head self-attention to capture long-range semantic dependencies from video context. Next, we employ a multi-stage cross-modal interaction to explore the potential relations of video and query contents, and we also consider query reconstruction from the cross-modal representations of target moment as an auxiliary task to strengthen the cross-modal representations. The extensive experiments on ActivityNet Captions and TACoS demonstrate the effectiveness of our proposed method.
机译:时刻检索旨在根据给定的自然语言查询本地化未经机视频中最相关的时刻。现有的作品通常只关注这个新兴任务的一个方面,例如查询表示学习,视频上下文建模或多模态融合,因此未能开发一个综合系统以进行进一步的性能改进。在本文中,我们介绍了一种新的跨模型交互网络(CMIN),以考虑这种具有挑战性的任务的多个关键因素,包括自然语言查询的句法依赖性,视频上下文中的远程语义依赖性以及足够的跨模型交互。具体而言,我们设计了一个句法GCN,以利用查询的句法结构进行细粒度的表示学习,并提出多人自我注意,从视频上下文捕获远程语义依赖关系。接下来,我们采用多阶段的跨模型交互来探索视频和查询内容的潜在关系,我们还考虑从目标时刻的跨模型表示作为辅助任务的查询重建,以加强跨模型表示。关于ActivityNet标题和炸玉米饼的广泛实验证明了我们提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号