...
首页> 外文期刊>World Wide Web >A distributed full-text top-k document dissemination system in distributed hash tables
【24h】

A distributed full-text top-k document dissemination system in distributed hash tables

机译:分布式哈希表中的分布式全文top-k文档分发系统

获取原文
获取原文并翻译 | 示例

摘要

Recent years witnessed the explosive growth of 'live' web content in the World Wide Web like Weblogs, RSS feeds, and real-time news, etc. The popular usage of RSS feeds/readers enables end users to subscribe for favorite contents via input RSS URLs. However, the RSS feeds/readers architecture suffers from (i) the high bandwidth consumption issue, and (ii) limited filtering semantics. In this paper, we proposed a stateful full text dissemination scheme over structured P2Ps to address both issues. Specifically, for the semantic side, end users are allowed to subscribe for favorite contents via input keywords; for the network bandwidth side, the cooperative content polling, filtering and disseminating via DHT-based P2P overlay networks save the network bandwidth consumption. Our contributions include the novel techniques to (i) reduce the unit-publishing cost by pruning irreverent documents during the forwarding path towards destinations, and (ii) reduce the publication amount by selecting a very small number of meaningful terms. Based on real data sets, our experimental results show that the proposed scheme can significantly reduce the publishing cost with low maintenance overhead and a high document quality.
机译:近年来,万维网上的“实时” Web内容(如Weblog,RSS feed和实时新闻等)呈爆炸式增长。RSSfeed /阅读器的广泛使用使最终用户可以通过输入RSS订阅喜欢的内容网址。但是,RSS提要/阅读器体系结构遭受(i)高带宽消耗问题和(ii)有限的过滤语义。在本文中,我们针对结构化P2P提出了一种有状态的全文传播方案,以解决这两个问题。具体而言,在语义方面,允许最终用户通过输入关键字订阅喜欢的内容;对于网络带宽方面,通过基于DHT的P2P覆盖网络进行的协作内容轮询,过滤和分发可以节省网络带宽消耗。我们的贡献包括新颖的技术,以(i)通过在到达目的地的转发路径中修剪无关的文档来降低单位出版成本,以及(ii)通过选择数量很少的有意义的术语来减少出版量。基于真实数据集,我们的实验结果表明,该方案可以显着降低发布成本,维护成本低,文档质量高。

著录项

  • 来源
    《World Wide Web》 |2011年第6期|p.545-572|共28页
  • 作者

    Weixiong Rao; Lei Chen;

  • 作者单位

    Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China,School of Computer and Information Engineering, HeNan University, China;

    Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China,School of Computer and Information Engineering, HeNan University, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    content dissemination; distributed hash table; p2p networks; rss;

    机译:内容分发;分布式哈希表;p2p网络;rss;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号