首页> 外文会议>International Joint Conference on Artificial Intelligence >Think Globally, Embed Locally—Locally Linear Meta-embedding of Words
【24h】

Think Globally, Embed Locally—Locally Linear Meta-embedding of Words

机译:全球思考,嵌入当地 - 局部线性的荟萃嵌入单词

获取原文

摘要

Distributed word embeddings have shown superior performances in numerous Natural Language Processing (NLP) tasks. However, their performances vary significantly across different tasks, implying that the word embeddings learnt by those methods capture complementary aspects of lexical semantics. Therefore, we believe that it is important to combine the existing word embeddings to produce more accurate and complete meta-embeddings of words. For this purpose, we propose an unsupervised locally linear meta-embedding learning method that takes pre-trained word embeddings as the input, and produces more accurate meta embeddings. Unlike previously proposed metaembedding learning methods that learn a global projection over all words in a vocabulary, our proposed method is sensitive to the differences in local neighbourhoods of the individual source word embeddings. Moreover, we show that vector concatenation, a previously proposed highly competitive baseline approach for integrating word embeddings, can be derived as a special case of the proposed method. Experimental results on semantic similarity, word analogy, relation classification, and short-text classification tasks show that our metaembeddings to significantly outperform prior methods in several benchmark dalasets. establishing a new state of the art for meta-embeddings.
机译:分布式Word Embeddings在许多自然语言处理(NLP)任务中表现出卓越的性能。然而,他们的表演在不同的任务中差异很大,这意味着这些方法捕获了这些方法的单词捕获了词汇语义的互补方面。因此,我们认为将现有的单词嵌入物结合起来是重要的,以产生更准确和完整的单词的元嵌入。为此目的,我们提出了一个无人监督的本地线性元嵌入学习方法,将预先训练的单词嵌入为输入,并产生更准确的元嵌入。与先前提出的MetaEmbedding学习方法不同,学习方法在词汇中的所有单词上学习全球预测,我们所提出的方法对各个源词嵌入的本地社区的差异敏感。此外,我们表明矢量连接,先前提出的用于集成单词嵌入的高竞争性基线方法,可以作为所提出的方法的特殊情况来推导出来。语义相似性,单词类比,关系分类和短文本分类任务的实验结果表明,我们的MetoEmbeddings在几个基准Dalasets中显着优于现有方法。为Meta-Embeddings建立新的艺术状态。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号