首页> 外文会议>Computer Conference, 2009. CSICC 2009 >Quantitative similarity-based evaluation of text retrieval algorithms
【24h】

Quantitative similarity-based evaluation of text retrieval algorithms

机译:基于定量相似度的文本检索算法评估

获取原文

摘要

Text retrieval engines, such as search engines, always return a list of documents in response to a given query. Existing evaluations of text retrieval algorithms mostly use precision and recall of the returned list of documents as main quality measures of a search engine. In this paper, we propose a novel approach for comparing different algorithms adopted by different search engines and evaluate their performance. In our approach, the results of each algorithm is treated as an inter-related set of documents and the effectiveness of the algorithm is evaluated based on the degree of relation in the set of documents. After verifying the correctness of the evaluation measure by examining the results of the two retrieval algorithms, BM25 and pivoted normalization, and comparing these results with an ideal ranking, we compare the results of these algorithms and investigate the impact of certain major factors like stemming on the results of the suggested algorithm. The effectiveness of our proposed method is justified through obtained experimental results.
机译:文本搜索引擎(例如搜索引擎)始终会响应给定查询返回文档列表。文本检索算法的现有评估大多使用精度和对返回的文档列表的召回作为搜索引擎的主要质量度量。在本文中,我们提出了一种新颖的方法来比较不同搜索引擎采用的不同算法并评估其性能。在我们的方法中,将每种算法的结果视为一组相互关联的文档,并根据文档集中的关联程度评估算法的有效性。通过检查两种检索算法BM25和透视归一化的结果验证了评估方法的正确性,并将这些结果与理想排名进行比较,我们比较了这些算法的结果并研究了某些主要因素(如词干对建议算法的结果。通过获得的实验结果证明了我们提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号