首页> 外文会议>European conference on computer vision >Attention Enhanced Single Stage Multimodal Reasoner
【24h】

Attention Enhanced Single Stage Multimodal Reasoner

机译:注意力增强单级多式联版推理

获取原文

摘要

In this paper, we propose an Attention Enhanced Single Stage Multimodal Reasoner (ASSMR) to tackle the object referral task in the self-driving car scenario. We extract features from each modality and establish attention mechanisms to jointly process them. The Key Words Extractor (KWE) is used to extract the attribute and position/scale information of the target in the command, which are used to score the corresponding features through the Position/Scale Attention Module (P/SAM) and the Object Attention Module (OAM). Based on the attention mechanism, the effective part of the position/scale feature, the object attribute feature and the semantic feature of the command is enhanced. Finally, we map different features to a common embedding space to predict the final result. Our method is based on the simplified version of the Talk2Car dataset, and scored on 66.4 AP50 on the test set, while using the official region proposals.
机译:在本文中,我们提出了注意力增强的单阶段多模式推理(ASSMR)来解决自动驾驶汽车场景中的对象推荐任务。 我们从每个模态提取特征,并建立注意机制以共同处理它们。 关键词提取器(kWe)用于提取目标中目标的属性和位置/比例信息,用于通过位置/级注意模块(P / SAM)和对象注意模块进行评分相应的特征 (奥姆)。 基于关注机制,增强了位置/比例特征的有效部分,对象属性功能和命令的语义特征。 最后,我们将不同的功能映射到常见的嵌入空间以预测最终结果。 我们的方法基于Talk2Car数据集的简化版本,并在测试集上的66.4 AP50上进行评分,同时使用官方区域提案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号