首页> 外文会议>TPDL 2013 >Who and What Links to the Internet Archive
【24h】

Who and What Links to the Internet Archive

机译:谁以及哪些链接到互联网档案

获取原文

摘要

The Internet Archive's (IA) Wayback Machine is the largest and oldest public web archive and has become a significant repository of our recent history and cultural heritage. Despite its importance, there has been little research about how it is discovered and used. Based on web access logs, we analyze what users are looking for, why they come to IA, where they come from, and how pages link to IA. We find that users request English pages the most, followed by the European languages. Most human users come to web archives because they do not find the requested pages on the live web. About 65% of the requested archived pages no longer exist on the live web. We find that more than 82% of human sessions connect to the Wayback Machine via referrals from other web sites, while only 15% of robots have referrers. Most of the links (86%) from websites are to individual archived pages at specific points in time, and of those 83% no longer exist on the live web.
机译:Internet Archive(IA)的Wayback机器是最大,最古老的公共网络档案,已成为我们最近的历史和文化遗产的重要储存库。尽管重要的是,关于如何发现和使用时几乎没有研究。基于Web Access日志,我们分析了用户正在寻找的,为什么他们来到IA,他们来自哪里以及页面如何链接到IA。我们发现用户最多要求英文页面,其次是欧洲语言。大多数人类用户来到Web Archives,因为他们没有找到Live Web上的请求的页面。 Live Web上的约65%的被请求的存档页面不再存在。我们发现超过82%的人类会话通过来自其他网站的推荐连接到Warback机器,而只有15%的机器人有推荐人。来自网站的大多数链接(86%)是特定点的单个存档页面,并且在现场网络上的83%不再存在。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号