首页> 外文会议>International Conference on Theory and Practice of Digital Libraries >Evaluating the SiteStory Transactional Web Archive with the ApacheBench Tool
【24h】

Evaluating the SiteStory Transactional Web Archive with the ApacheBench Tool

机译:使用ApacheBench工具评估SiteEtty交易Web归档

获取原文
获取外文期刊封面目录资料

摘要

Conventional Web archives are created by periodically crawling a Web site and archiving the responses from the Web server. Although easy to implement and commonly deployed, this form of archiving typically misses updates and may not be suitable for all preservation scenarios, for example a site that is required (perhaps for records compliance) to keep a copy of all pages it has served. In contrast, transactional archives work in conjunction with a Web server to record all content that has been served. Los Alamos National Laboratory has developed SiteStory, an open-source transactional archive written in Java that runs on Apache Web servers, provides a Memento compatible access interface, and WARC file export features. We used Apache's ApacheBench utility on a pre-release version of SiteStory to measure response time and content delivery time in different environments. The performance tests were designed to determine the feasibility of SiteStory as a production-level solution for high fidelity automatic Web archiving. We found that SiteStory does not significantly affect content server performance when it is performing transactional archiving. Content server performance slows from 0.076 seconds to 0.086 seconds per Web page access when the content server is under load, and from 0.15 seconds to 0.21 seconds when the resource has many embedded and changing resources.
机译:通过定期爬行网站并将来自Web服务器的响应归类来创建传统的Web归档。虽然易于实现和常用,但这种归档通常会错过更新,并且可能不适合所有保存方案,例如,所需的站点(可能是记录合规性),以保留其所服务的所有页面的副本。相比之下,事务归档与Web服务器一起工作,以记录已提供的所有内容。 LOS Alamos National Laboratory已经开发了SiteTy,一个在Apache Web服务器上运行的Java中编写的开源交易存档,提供了Memento兼容的访问接口和WARC文件导出功能。我们使用Apache的ApacheBench Utility在SiteTory的预发布版本上测量不同环境中的响应时间和内容交货时间。性能测试旨在确定SITETY作为高保真自动WEB归档的生产级别解决方案的可行性。我们发现当在执行事务存档时,SeteTory不会显着影响内容服务器性能。内容服务器性能从内容服务器在负载下的加载时从0.076秒放大到0.086秒,并且在资源有许多嵌入和更改的资源时,0.15秒到0.21秒。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号