首页> 外文会议>Advances in information retrieval theory >Probably Approximately Correct Search
【24h】

Probably Approximately Correct Search

机译:大概正确的搜索

获取原文
获取原文并翻译 | 示例

摘要

We consider the problem of searching a document collection using a set of independent computers. That is, the computers do not cooperate with one another either (ⅰ) to acquire their local index of documents or (ⅱ) during the retrieval of a document. During the acquisition phase, each computer is assumed to randomly sample a subset of the entire collection. During retrieval, the query is issued to a random subset of computers, each of which returns its results to the query-issuer, who consolidates the results. We examine how the number of computers, and the fraction of the collection that each computer indexes, affects performance in comparison to a traditional deterministic configuration. We provide analytic formulae that, given the number of computers and the fraction of the collection each computer indexes, provide the probability of an approximately correct search, where a "correct search" is defined to be the result of a deterministic search on the entire collection. We show that the randomized distributed search algorithm can have acceptable performance under a range of parameters settings. Simulation results confirm our analysis.
机译:我们考虑使用一组独立的计算机搜索文档集合的问题。即,计算机不相互协作(cooperate)以获取其本地文档索引,或者(ⅱ)在文档检索期间彼此不协作。在采集阶段,假定每台计算机随机采样整个集合的一个子集。在检索过程中,查询将发布到计算机的随机子集,每台计算机将其结果返回给查询发布者,查询发布者将合并结果。与传统的确定性配置相比,我们研究了计算机的数量以及每台计算机建立索引的集合的一部分对性能的影响。我们提供分析公式,给定计算机的数量和每台计算机索引的集合的比例,可以提供近似正确搜索的可能性,其中“正确搜索”定义为对整个集合进行确定性搜索的结果。我们表明,在一系列参数设置下,随机分布搜索算法可以具有可接受的性能。仿真结果证实了我们的分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号