【24h】

A Pocket Guide to Web History

机译:网页历史的口袋指南

获取原文

摘要

Web archives like the Internet Archive preserve the evolutionary history of large portions of the Web. Access to them, however, is still via rather limited interfaces - a search functionality is often missing or ignores the time axis. Time-travel search alleviates this shortcoming by enriching keyword queries with a time-context of interest. In order to be effective, time-travel queries require historical PageRank scores. In this paper, we address this requirement and propose rank synopses as a novel structure to compactly represent and reconstruct historical PageRank scores. Rank synopses can reconstruct the PageRank score of a web page as of any point during its lifetime, even in the absence of a snapshot of the Web as of that time. We further devise a normalization scheme for PageRank scores to make them comparable across different graphs. Through a comprehensive evaluation over different datasets, we demonstrate the accuracy and space-economy of the proposed methods.
机译:Web档案如互联网档案,保留了Web的大量部分的进化历史。但是,访问它们仍然是通过相当有限的接口 - 搜索功能通常丢失或忽略时间轴。时间旅行搜索通过丰富关键字查询,通过感兴趣的时间背景来减轻这种缺点。为了有效,时间旅行查询需要历史PageRank分数。在本文中,我们解决了这一要求,并提出了作为一种小说结构的排序组,以紧凑地代表和重建历史PageRank分数。等级概要可以在其寿命期间重建网页的PageRank评分,即使在没有那段网络的快照时也是如此。我们进一步设计了PageRank分数的标准化方案,以使它们跨不同的图表相当。通过对不同数据集的综合评估,我们展示了所提出的方法的准确性和空间 - 经济性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号