首页> 外文会议>Computational Linguistics and Intelligent Text Processing >Probabilistic Word Vector and Similarity Based on Dictionaries
【24h】

Probabilistic Word Vector and Similarity Based on Dictionaries

机译:基于字典的概率词向量与相似度

获取原文

摘要

We propose a new method for computing the probabilistic vector expression of words based on dictionaries. This method provides a well-founded procedure based on stochastic process whose applicability is clear. The proposed method exploits the relationship between headwords and their explanatory notes in dictionaries. An explanatory note is a set of other words, each of which is expanded by its own explanatory note. This expansion is repeatedly applied, but even explanatory notes expanded infinitely can be computed under a simple assumption. The vector expression we obtain is a semantic expansion of the explanatory notes of words. We explain how to acquire the vector expression from these expanded explanatory notes. We also demonstrate a word similarity computation based on a Japanese dictionary and evaluate it in comparison with a known system based on TF·IDF. The results show the effectiveness and applicability of this probabilistic vector expression.
机译:我们提出了一种新的基于字典的单词概率向量表达方法。该方法提供了一个基于随机过程的有根据的程序,其适用性很明确。所提出的方法利用了字典中headwords和它们的解释性注释之间的关系。解释性注释是一组其他词,每个词都由其自己的解释性注释扩展。这种扩展是反复应用的,但即使是无限扩展的解释性注释,也可以在一个简单的假设下进行计算。我们获得的向量表达是单词解释性注释的语义扩展。我们将解释如何从这些扩展的说明中获取向量表达式。我们还演示了基于日语词典的单词相似度计算,并与基于TF·IDF的已知系统进行了比较。结果显示了这种概率载体表达的有效性和适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号