首页> 外文会议>Asia International Conference on Modelling Simulation >Application of Query Sensitive Similarity Measure in IR systems
【24h】

Application of Query Sensitive Similarity Measure in IR systems

机译:查询敏感相似度量在IR系统中的应用

获取原文

摘要

Document clustering has been widely used in information retrieval systems in order to improve the efficiency and also the effectiveness of ranked output systems using cluster hypothesis. This hypothesis states that relevant documents tend to be more similar to each other than to non-relevant documents, and therefore tend to appear in the same clusters. So far, the effectiveness of cluster hypothesis experimentally has been examined only for static-clustering and query-specific clustering using cosine similarity measure. On the other hand, the effectiveness of document clustering using query-sensitive similarity measure (QSSM) has been studied only with N-nearest neighbor test for very small and topic-specific document collections. In this paper, the cluster hypothesis for query-specific clustering is investigated using a query-sensitive similarity measure and a large document collection in an experimental environment. The results show that the cluster hypothesis holds for query-specific clustering using employed QSSM. And, the effectiveness of query-specific clustering will increase through the use of that QSSM.
机译:文档聚类已广泛用于信息检索系统,以提高效率以及使用群集假设的排名输出系统的有效性。该假设表明,相关文件往往彼此更加相似,而不是非相关文件,因此往往出现在同一集群中。到目前为止,仅针对使用余弦相似度测量的静态聚类和查询特定聚类进行了实验的群集假设的有效性。另一方面,使用查询敏感相似度测量(QSSM)的文档聚类的有效性已被研究仅使用N-Collect邻接测试非常小而主题的文档集合。在本文中,使用查询敏感的相似度测量和实验环境中的大文件集合来研究查询特定聚类的集群假设。结果表明,群集假设使用所采用的QSSM来占用查询特定群集。并且,查询特定群集的有效性将通过使用该QSSM来增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号