首页> 外文期刊>Information Processing & Management >Learning multimodal word representation with graph convolutional networks
【24h】

Learning multimodal word representation with graph convolutional networks

机译:学习与图形卷积网络的多模式字表示

获取原文
       

摘要

Multimodal models have been proven to outperform text-based models on learning semantic word representations. According to psycholinguistic theory, there is a graphical relationship among the modalities of language, and in recent years, the graph convolution network (GCN) has been proven to have substantial advantages in the extraction of non-European spatial features. This inspires us to propose a new multimodal word representation model, namely, GCNW, which uses the graph convolutional network to incorporate the phonetic and syntactic information into the word representation. We use a greedy strategy to update the modality-relation matrix in the GCN, and we train the model through unsupervised learning. We evaluated the proposed model on multiple downstream NLP tasks, and various experimental results demonstrate that the GCNW outperforms strong unimodal baselines and state-of-the-art multimodal models. We make the source code of both models available to encourage reproducible research.
机译:已被证明多式联运模型以学习语义字表示越突出基于文本的模型。根据精神语言学理论,语言模式之间存在图形关系,近年来,图表卷积网络(GCN)被证明在提取非欧洲空间特征方面具有实质性优势。这激发了我们提出了一种新的多模式字表示模型,即GCNW,它使用图形卷积网络将语音和语法信息结合到单词表示中。我们使用贪婪的策略来更新GCN中的模态关系矩阵,我们通过无监督的学习培训模型。我们在多个下游NLP任务中评估了所提出的模型,各种实验结果表明,GCNW优于强大的单向基线和最先进的多模式模型。我们制作两种型号的源代码可以鼓励可重复的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号