首页> 外文会议>ACM international conference on information and knowledge management >Federated Search in the Wild: The Combined Power of over a Hundred Search Engines
【24h】

Federated Search in the Wild: The Combined Power of over a Hundred Search Engines

机译:在野外进行联合搜索:数百种搜索引擎的组合功能

获取原文

摘要

Federated search has the potential of improving web search: the user becomes less dependent on a single search provider and parts of the deep web become available through a unified interface, leading to a wider variety in the retrieved search results. However, a publicly available dataset for federated search reflecting an actual web environment has been absent. As a result, it has been difficult to assess whether proposed systems are suitable for the web setting. We introduce a new test collection containing the results from more than a hundred actual search engines, ranging from large general web search engines such as Google and Bing to small domain-specific engines. We discuss the design and analyze the effect of several sampling methods. For a set of test queries, we collected relevance judgements for the top 10 results of each search engine. The dataset is publicly available and is useful for researchers interested in resource selection for web search collections, result merging and size estimation of uncooperative resources.
机译:联合搜索具有改善Web搜索的潜力:用户变得不再依赖单个搜索提供程序,并且深层Web的一部分可通过统一界面使用,从而导致检索到的搜索结果种类更多。但是,缺少用于反映实际Web环境的联合搜索的公共可用数据集。结果,很难评估所提议的系统是否适合于网络设置。我们引入了一个新的测试集合,其中包含来自一百多个实际搜索引擎的结果,这些搜索引擎的范围从大型通用网络搜索引擎(例如Google和Bing)到小型特定于域的引擎。我们讨论设计并分析几种采样方法的效果。对于一组测试查询,我们收集了每个搜索引擎的前10个结果的相关性判断。该数据集是公开可用的,对于对网络搜索集合的资源选择,结果合并以及不合作的资源的大小感兴趣的研究人员很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号