首页> 外文期刊>PeerJ Computer Science >Joint embedding VQA model based on dynamic word vector
【24h】

Joint embedding VQA model based on dynamic word vector

机译:基于动态词向量的联合嵌入VQA模型

获取原文
           

摘要

The existing joint embedding Visual Question Answering models use different combinations of image characterization, text characterization and feature fusion method, but all the existing models use static word vectors for text characterization. However, in the real language environment, the same word may represent different meanings in different contexts, and may also be used as different grammatical components. These differences cannot be effectively expressed by static word vectors, so there may be semantic and grammatical deviations. In order to solve this problem, our article constructs a joint embedding model based on dynamic word vector—none KB-Specific network (N-KBSN) model which is different from commonly used Visual Question Answering models based on static word vectors. The N-KBSN model consists of three main parts: question text and image feature extraction module, self attention and guided attention module, feature fusion and classifier module. Among them, the key parts of N-KBSN model are: image characterization based on Faster R-CNN, text characterization based on ELMo and feature enhancement based on multi-head attention mechanism. The experimental results show that the N-KBSN constructed in our experiment is better than the other 2017—winner (glove) model and 2019—winner (glove) model. The introduction of dynamic word vector improves the accuracy of the overall results.
机译:现有的联合嵌入视觉问题应答模型使用不同的图像特征,文本表征和特征融合方法的不同组合,但所有现有模型都使用静态字向量进行文本表征。然而,在实际语言环境中,相同的词可以代表不同上下文中的不同含义,并且也可以用作不同的语法组件。这些差异不能通过静态字向量有效地表达,因此可能存在语义和语法偏差。为了解决这个问题,我们的物品基于动态字矢量 - 无KB特定网络(N-KBSN)模型构建了一个联合嵌入模型,其与基于静态字向量的常用视觉问题应答模型不同。 N-KBSN模型由三个主要部分组成:问题文本和图像功能提取模块,自我注意和引导注意模块,功能融合和分类器模块。其中,N-KBSN模型的关键部分是:基于更快的R-CNN,基于ELMO的文本表征的图像表征,基于多针注意机制的特征增强。实验结果表明,我们的实验中构建的N-KBSN优于其他2017年 - 获奖者(手套)模型和2019年 - 赢家(手套)模型。动态字矢量的引入提高了整体结果的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号