首页> 外文会议>Natural language understanding and intelligent applications >Chinese Word Similarity Computing Based on Combination Strategy
【24h】

Chinese Word Similarity Computing Based on Combination Strategy

机译:基于组合策略的中文词语相似度计算

获取原文
获取原文并翻译 | 示例

摘要

Chinese word similarity computing is a fundamental task for natural language processing. This paper presents a method to calculate the similarity between Chinese words based on combination strategy. We apply Baidubaike to train Word2Vector model, and then integrate different methods, semantic Dictionary-based method, Word2Vector-based method and Chinese FrameNet (CFN)-based method, to calculate the semantic similarity between Chinese words. The semantic Dictionary-based method includes dictionaries such as HowNet, DaCilin, Tongyici Cilin (Extended) and Antonym. The experiments are performed on 500 pairs of words and the Spearman correlation coefficient of test data is 0.524, which shows that the proposed method is feasible and effective.
机译:中文单词相似度计算是自然语言处理的基本任务。本文提出了一种基于组合策略的汉字相似度计算方法。我们将百度百科用于训练Word2Vector模型,然后将不同的方法,基于语义词典的方法,基于Word2Vector的方法和基于中文FrameNet(CFN)的方法进行集成,以计算汉字之间的语义相似度。基于语义词典的方法包括诸如HowNet,DaCilin,Tongyici Cilin(扩展)和反义词之类的词典。对500对单词进行了实验,测试数据的Spearman相关系数为0.524,表明该方法是可行和有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号