首页> 外文会议>ACM/IEEE-CS joint conference on digital libraries >WARCreate - Create Wayback-Consumable WARC Files from Any Webpage
【24h】

WARCreate - Create Wayback-Consumable WARC Files from Any Webpage

机译:WARCreate-从任何网页创建Wayback消耗性WARC文件

获取原文

摘要

The Internet Archive's Wayback Machine is the most common way that typical users interact with web archives. The Internet Archive uses the Heritrix web crawler to transform pages on the publicly available web into Web ARChive (WARC) files, which can then be accessed using the Way-back Machine. Because Heritrix can only access the publicly available web, many personal pages (e.g., password-protected pages, social media pages) cannot be easily archived into the standard WARC format. We have created a Google Chrome extension. WARCreate, that allows a user to create a WARC file from any webpage. Using this tool, content that might have been otherwise lost in time can be archived in a standard format by any user. This tool provides a way for casual users to easily create archives of personal online content. This is one of the first steps in resolving issues of "long term storage, maintenance, and access of personal digital assets that have emotional, intellectual, and historical value to individuals" [3].
机译:Internet存档的Wayback Machine是典型用户与Web存档交互的最常见方式。 Internet档案库使用Heritrix Web搜寻器将公共可用Web上的页面转换为Web ARChive(WARC)文件,然后可以使用Way-back Machine对其进行访问。由于Heritrix仅能访问公共网站,因此许多个人页面(例如,受密码保护的页面,社交媒体页面)无法轻松地归档为标准WARC格式。我们已经创建了一个Google Chrome浏览器扩展程序。 WARCreate,允许用户从任何网页创建WARC文件。使用此工具,任何用户都可以标准格式将原本可能会丢失的内容及时存档。该工具为休闲用户提供了一种轻松创建个人在线内容档案的方法。这是解决“对个人具有情感,智力和历史价值的个人数字资产的长期存储,维护和访问” [3]的第一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号