首页> 外文会议>International Conference on Universal Digital Library(ICUDL2005); 20051031-1102; Hangzhou(CN) >Automatic Recognition and Mining of Chinese Synonyms for Information Retrieval
【24h】

Automatic Recognition and Mining of Chinese Synonyms for Information Retrieval

机译:信息检索中文同义词的自动识别和挖掘

获取原文
获取原文并翻译 | 示例

摘要

Automatic recognition and mining of synonyms play an important role in information retrieval. In addition, it is broadly used in other applications, such as auto-indexing, auto-classification, auto-abstracting and machine translation ,etc. In order to enhance the mining ability of the synonyms, this paper presents two methods. The first method is pattern matching algorithm based on the mode of dictionary definition, we set some digging rules by hands, the system then digs synonyms by pattern matching automatically. The second method is the PageRank algorithm based on the definition in dictionary. We analyze the relation links between a given words and other words, then construct the associated word graph. Finally, we use the PageRank algorithm to calculate the similarity degree and discover synonyms in the associated word graph. The mining practice of financial dictionaries show that the precision of pattern matching algorithm and PageRank algorithm reach 90% and 85.6% respectively.
机译:同义词的自动识别和挖掘在信息检索中起着重要作用。此外,它还广泛用于其他应用,例如自动索引,自动分类,自动抽象和机器翻译等。为了提高同义词的挖掘能力,提出了两种方法。第一种方法是基于字典定义模式的模式匹配算法,我们手动设置一些挖掘规则,然后系统通过模式匹配自动挖掘同义词。第二种方法是基于字典中定义的PageRank算法。我们分析给定单词与其他单词之间的关系链接,然后构造关联的单词图。最后,我们使用PageRank算法来计算相似度,并在关联的词图中发现同义词。金融词典的挖掘实践表明,模式匹配算法和PageRank算法的精度分别达到90%和85.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号