...
首页> 外文期刊>Computer networks >Associative search in peer to peer networks: Harnessing latent semantics
【24h】

Associative search in peer to peer networks: Harnessing latent semantics

机译:对等网络中的关联搜索:利用潜在语义

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The success of a P2P file-sharing network highly depends on the scalability and versatility of its search mechanism. Two particularly desirable search features are scope (ability to find infrequent items) and support for partial-match queries (queries that contain typos or include a subset of keywords). While centralized-index architectures (such as Napster) can support both these features, existing decentralized architectures seem to support at most one: prevailing unstructured P2P protocols (such as Gnutella and FastTrack) deploy a "blind" search mechanism where the set of peers probed is unrelated to the query; thus they support partial-match queries but have limited scope. On the other extreme, the recently-proposed distributed hash tables (DHTs) such as CAN and CHORD, couple index location with the item's hash value, and thus have good scope but can not effectively support partial-match queries. Another hurdle to DHTs deployment is their tight control of the overlay structure and the information (part of the index) each peer maintains, which makes them more sensitive to failures and frequent joins and disconnects. We develop a new class of decentralized P2P architectures. Our design is based on unstructured architectures such as Gnutella and FastTrack, and retains many of their appealing properties including support for partial match queries, and relative resilience to peer failures. Yet, we obtain orders of magnitude improvement in the efficiency of locating rare items. Our approach exploits associations inherent in human selections to steer the search process to peers that are more likely to have an answer to the query. We demonstrate the potential of associative search using models, analysis, and simulations.
机译:P2P文件共享网络的成功很大程度上取决于其搜索机制的可伸缩性和多功能性。两个特别理想的搜索功能是范围(查找不常见项目的能力)和对部分匹配查询的支持(包含错别字或包含关键字子集的查询)。尽管集中索引架构(例如Napster)可以同时支持这两种功能,但现有的分散式架构似乎最多支持一种功能:流行的非结构化P2P协议(例如Gnutella和FastTrack)部署了一种“盲”搜索机制,在该机制下,对等体集被探测到与查询无关;因此,它们支持部分匹配查询,但范围有限。另一方面,最近提出的分布式哈希表(DHT)(例如CAN和CHORD)将索引位置与项目的哈希值耦合在一起,因此具有良好的范围,但不能有效地支持部分匹配查询。 DHT部署的另一个障碍是他们对重叠结构和每个对等方维护的信息(索引的一部分)的严格控制,这使它们对故障和频繁的连接和断开连接更加敏感。我们开发了新型的分散式P2P架构。我们的设计基于Gnutella和FastTrack等非结构化架构,并保留了它们许多吸引人的特性,包括对部分匹配查询的支持以及对同级故障的相对弹性。但是,我们在查找稀有物品的效率上获得了数量级的提高。我们的方法利用人工选择中固有的关联性来将搜索过程引导到更可能对查询有答案的同级。我们展示了使用模型,分析和模拟进行关联搜索的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号