首页> 外文会议>International conference on database systems for advanced applications >Leveraging Pattern Associations for Word Embedding Models
【24h】

Leveraging Pattern Associations for Word Embedding Models

机译:利用模式关联进行词嵌入模型

获取原文

摘要

Word embedding method has been shown powerful to capture words association, and facilitated numerous applications by effectively bridging lexical gaps. Word semantic is encoded with vectors and modeled based on n-gram language models, as a result it only takes into consideration of words co-occurrences in a shallow slide windows. However, the assumption of the language modelling ignores valuable associations between words in a long distance beyond n-gram coverage. In this paper, we argue that it is beneficial to jointly modeling both surrounding context and flexible associative patterns so that the model can cover long distance and intensive association. We propose a novel approach to combine associated patterns for word embedding method via joint training objection. We apply our model for query expansion in document retrieval task. Experimental results show that the proposed method can perform significantly better than the state-of-the-arts baseline models.
机译:词嵌入方法已显示出强大的功能,可以捕获词的关联,并通过有效地弥合词汇间隙来促进众多应用。单词语义通过矢量进行编码,并基于n-gram语言模型进行建模,因此,它仅考虑了浅滑动窗口中单词共现的情况。但是,语言建模的假设忽略了n-gram覆盖范围之外的长距离单词之间的宝贵关联。在本文中,我们认为对周围环境和灵活的关联模式进行联合建模是有益的,这样该模型可以涵盖长距离和密集的关联。我们提出了一种新颖的方法,通过联合训练反对,将相关模式组合在一起用于词嵌入方法。我们将模型用于文档检索任务中的查询扩展。实验结果表明,所提出的方法的性能明显优于最新的基线模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号