首页> 外文会议>ACM/IEEE-CS joint conference on digital libraries >A Study of Automation from Seed URL Generation to Focused Web Archive Development: The CTRnet Context
【24h】

A Study of Automation from Seed URL Generation to Focused Web Archive Development: The CTRnet Context

机译:从种子URL生成自动化对聚焦Web档案开发的研究:CtreNT语言

获取原文

摘要

In the event of emergencies and disasters, massive amounts of web resources are generated and shared. Due to the rapidly changing nature of those resources, it is important to start archiving them as soon as a disaster occurs. This led us to develop a prototype system for constructing archives with minimum human intervention using the seed URLs extracted from tweet collections. We present the details of our prototype system. We applied it to five tweet collections that had been developed in advance, for evaluation. We also identify five categories of non-relevant files and conclude with a discussion of findings from the evaluation.
机译:如果发生紧急情况和灾难,会生成和共享大量的Web资源。由于这些资源的快速变化,一旦发生灾难,就会开始归档它们是很重要的。这使我们开发了一个原型系统,用于构建档案,使用从Tweet集合中提取的种子URL构建档案。我们介绍了我们的原型系统的详细信息。我们将其应用于提前开发的五个推文集合,以进行评估。我们还通过评估的调查结果讨论了五类非相关文件和结论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号