首页> 外文期刊>Computer networks >PCIR: Combining DHTs and peer clusters for efficient full-text P2P indexing
【24h】

PCIR: Combining DHTs and peer clusters for efficient full-text P2P indexing

机译:PCIR:结合DHT和对等群集以实现高效的全文P2P索引

获取原文
获取原文并翻译 | 示例
           

摘要

Distributed hash tables (DHTs) are very efficient for querying based on key lookups. However, building huge term indexes, as required for IR-style keyword search, poses a scalability challenge for plain DHTs. Due to the large sizes of document term vocabularies, peers joining the network cause huge amounts of key inserts and, consequently, a large number of index maintenance messages. Thus, the key to exploiting DHTs for distributed information retrieval is to reduce index maintenance costs. Various approaches in this direction have been pursued, including the use of hybrid infrastructures, or changing the granularity of the inverted index to peer level. We show that indexing costs can be significantly reduced further by letting peers form groups in a self-organized fashion. Instead of each individual peer submitting index information separately, all peers of a group cooperate to publish the index updates to the DHT in batches. Our evaluation shows that this approach reduces index maintenance cost by an order of magnitude, while still keeping a complete and correct term index for query processing.
机译:分布式哈希表(DHT)对于基于键查找的查询非常有效。但是,IR样式关键字搜索所需的构建巨大的术语索引给普通DHT带来了可伸缩性挑战。由于文档术语词汇量很大,加入网络的同位体会导致大量的键插入,因此会导致大量的索引维护消息。因此,利用DHT进行分布式信息检索的关键是降低索引维护成本。在这个方向上已经采取了各种方法,包括使用混合基础结构,或者将反向索引的粒度更改为对等级别。我们表明,通过让同伴以自组织方式形成组,可以进一步显着降低索引编制成本。代替组中的所有对等方协作,而不是每个单独的对等方分别提交索引信息,而将其索引更新批量发布到DHT。我们的评估表明,这种方法将索引维护成本降低了一个数量级,同时仍为查询处理保留了完整且正确的术语索引。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号