首页> 外文会议>WRI World Congress on Computer Science and Information Engineering >A Set-Covering-Based Approach for Overlapping Resource Selection in Distributed Information Retrieval
【24h】

A Set-Covering-Based Approach for Overlapping Resource Selection in Distributed Information Retrieval

机译:基于集覆盖的分布式信息检索中资源选择的重叠方法

获取原文

摘要

Resource selection, also called server selection, collection selection or database selection, is a foundational problem in distributed information retrieval (DIR). This paper introduces a set-covering-based algorithm for resource selection in DIR, with consideration of overlapping extent between resources. Give different document with different weight according to its position in merged results for question Q. Only results that have not appeared in some earlier selected resource are focused on in later selected resources. The score of each resource is decided by the total weights of those merged results included in, and only the resource with max score is selected in each selecting step. So, the selecting order is the actual rank of selected resources which are used to search the question Qpsila, which is similar to question Q. The approach saves big searching time due to overlapping between databases and, at the same time, enhances user's recall rate and precision.
机译:资源选择,也称为服务器选择,集合选择或数据库选择,是分布式信息检索(DIR)的基本问题。本文介绍了一种基于集合覆盖的DIR资源选择算法,其中考虑了资源之间的重叠程度。根据其在问题Q的合并结果中的位置,以不同的权重赋予其他文档。只有在某些较早选择的资源中未出现的结果才集中在较晚选择的资源中。每个资源的分数由其中包含的合并结果的总权重决定,并且在每个选择步骤中仅选择具有最大分数的资源。因此,选择顺序是用于搜索问题Qpsila的所选资源的实际等级,与问题Q相似。该方法由于数据库之间的重叠而节省了大量的搜索时间,同时提高了用户的召回率和精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号