首页> 外文会议>2012 7th international conference on computer science amp; education >Information retrieval using Hellinger distance and sqrt-cos similarity
【24h】

Information retrieval using Hellinger distance and sqrt-cos similarity

机译:使用Hellinger距离和sqrt-cos相似度进行信息检索

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we propose a similarity measurement method based on the Hellinger distance and square-root cosine. Then use Hellinger distance as the distance metric for document clustering and a new square-root cosine similarity for query information retrieval. This new similarity/distance also bridges between traditional tf_idf weighting to binary weighting in vector space model. Finally, we conduct a comparison on performance between this method and the one based on Euclidean distance and cosine similarity. And from the results, we clearly observe that the precision and recall are improved by using the sqrt-cos similarity.
机译:本文提出了一种基于Hellinger距离和平方根余弦的相似度测量方法。然后,将Hellinger距离用作文档聚类的距离度量,并使用新的平方根余弦相似度进行查询信息检索。这种新的相似性/距离还在向量空间模型中的传统tf_idf加权与二进制加权之间架起了桥梁。最后,我们对该方法与基于欧氏距离和余弦相似度的方法之间的性能进行了比较。从结果中,我们可以清楚地看到,使用sqrt-cos相似度可以提高准确性和查全率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号