【24h】

The Coefficient of Synonymy

机译:同义词系数

获取原文

摘要

The measurement of synonymy between words is an essential function required by numerous NLP tasks. However, similarity measures often give word similarity values that are very difficult to interpret and compare when applied to popular word embeddings. We introduce a coefficient of synonymy Cs, which uses a new method based on word distance and the density of words around those being compared for measuring synonymy. This method provides consistent, comparable values indicating the level of synonymy between words. We compare measurements of Cswith cosine using four off-the-shelf word embeddings created by Word2Vec and GloVe and two 350,000 word pair datasets.
机译:单词之间的同义词的度量是许多NLP任务所需的基本功能。但是,相似性度量通常会给出单词相似性值,将其应用于流行词嵌入时很难解释和比较。我们引入一个同义词系数C s ,它使用一种基于单词距离和被比较单词周围单词密度的新方法来测量同义词。此方法提供一致,可比较的值,指示单词之间的同义词级别。我们比较C的量度 s 使用余弦,使用Word2Vec和GloVe创建的四个现成的词嵌入和两个350,000个词对数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号