首页> 外文会议>International conference on information security and artificial intelligence >An Approach Based on Tongyici Cilin and Word Similarity for Chinese Word Sense Induction
【24h】

An Approach Based on Tongyici Cilin and Word Similarity for Chinese Word Sense Induction

机译:一种基于铜绿素辛格的方法和汉字感应诱导词相似性

获取原文

摘要

This paper presents a new approach of automatic unsupervised Word Sense Induction. This approach is based on Tongyici Cilin (a Chinese synonym dictionary). First we extract the neighbor words with consideration of the POS tags. Then, we calculate the word similarity according to the semantic code mapped in Tongyici Cilin (Extended). Finally, we design a greedy algorithm to accomplish clustering. The experimental results indicate that our approach is very potentially promising based on the benchmark data set provided by CIPS and SIGHAN. The work has an important significance for the usage of this thesaurus in Word Similarity Calculation and Word Sense Disambiguation.
机译:本文提出了一种自动无监督词感应诱导的新方法。这种方法是基于Tongyici Cilin(中国的同义词字典)。首先,我们考虑POS标签提取邻居单词。然后,我们根据在Tongyici Cilin(扩展)中映射的语义代码来计算单词相似度。最后,我们设计了一种贪婪的算法来完成聚类。实验结果表明,我们的方法非常有希望基于CIP和Sighan提供的基准数据集。这项工作对这个词库的使用具有重要意义,以词相似性计算和词语感歧义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号