首页> 外文会议>International conference on neural information processing >Learning Phrase Representations Based on Word and Character Embeddings
【24h】

Learning Phrase Representations Based on Word and Character Embeddings

机译:基于单词和字符嵌入的短语学习

获取原文

摘要

Most phrase embedding methods consider a phrase as a basic term and learn embeddings according to phrases' external contexts, ignoring the internal structures of words and characters. There are some languages such as Chinese, a phrase is usually composed of several words or characters and contains rich internal information. The semantic meaning of a phrase is also related to the meanings of its composing words or characters. Therefore, we take Chinese for example, and propose a joint words and characters embedding model for learning phrase representation. In order to disambiguate the word and character and address the issue of non-compositional phrases, we present multiple-prototype word and character embeddings and an effective phrase selection method. We evaluate the effectiveness of the proposed model on phrase similarities computation and analogical reasoning. The empirical result shows that our model outperforms other baseline methods which ignore internal word and character information.
机译:大多数短语嵌入方法都将短语作为基本术语,并根据短语的外部上下文学习嵌入,而忽略了单词和字符的内部结构。有一些语言,例如中文,一个短语通常由几个单词或字符组成,并包含丰富的内部信息。短语的语义含义还与组成单词或字符的含义有关。因此,我们以汉语为例,提出了一种联合的词和字符嵌入模型来学习短语表示。为了消除单词和字符的歧义并解决非组合短语的问题,我们提出了多原型单词和字符的嵌入以及一种有效的短语选择方法。我们评估该模型在短语相似度计算和类比推理中的有效性。实证结果表明,我们的模型优于其他忽略内部单词和字符信息的基线方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号