Towards Building a Collection of Web Archiving Research Articles

机译：建立一系列网络归档研究文章

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The field of Web Archiving exists in a fluid, fragmented, and heterogeneous state. Part of the problem is that this field is relatively new and its literature is scattered across a wide range of journal and conference venues. This makes the state of Web Archiving as a discipline particularly difficult to ascertain. This paper presents an approach to building a collection of articles about the subject. We begin with a small dataset of articles taken from a Web Archiving Bibliography and then proceed to expand it by crawling the Web and collecting additional documents. The crawled documents are then classified using machine learning classification techniques. We show that by extracting the documents' titles and abstracts and representing them using the "bag of words" approach, we are able to accurately identify documents from the Web crawler as documents that are about Web Archiving. We also discuss our results in the context of Web Archiving as an emerging field.

机译：在流体，碎片化和异构状态下存在网络归档领域。部分问题是，该领域比较新，其文学分散在广泛的日记和会议场地。这使得Web归档状态作为特别难以确定的学科。本文介绍了建立关于该主题的文章集合的方法。我们从一个来自Web归档参考书目中获取的一小数据集，然后通过爬网并收集其他文件来进行展开。然后使用机器学习分类技术进行逐渐进行爬行的文件。我们展示了通过提取文件的标题和摘要并使用“单词”方法来表示，我们能够准确地识别Web爬虫的文档作为关于Web归档的文档。我们还在Web Archiving作为新兴领域的背景下讨论我们的结果。

著录项

来源
《Annual Meeting of the Association for Information Science and Technology》|2014年||共5页
会议地点
作者
Brenda Reyes Ayala; Cornelia Caragea;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G201-53;
关键词
入库时间 2022-08-21 09:42:20

相似文献

外文文献
中文文献
专利

1. Archive Web: collaboratively extending and exploring web archive collections-How would you like to work with your collections? [J] . Zeon Trevor Fernando, Ivana Marenzi, Wolfgang Nejdl International journal on digital libraries . 2018,第1期

机译：存档网络：协作扩展和探索Web存档集合-您想如何使用您的集合？
2. Historians' Use of Digital Archival Collections: The Web, Historical Scholarship, and Archival Research [J] . Donghee Sinn, Nicholas Soares Journal of the American Society for Information Science and Technology . 2014,第9期

机译：历史学家对数字档案馆藏的使用：网络，历史奖学金和档案馆研究
3. Web 2.0 Tools and Strategies for Archives and Local History Collections Web 2.0 for Librarians and Information Professionals [J] . Karine Burger Archivaria . 2011,第72期

机译：用于档案馆和本地历史馆藏的Web 2.0工具和策略用于图书馆员和信息专业人员的Web 2.0
4. Towards Building a Collection of Web Archiving Research Articles [C] . Brenda Reyes Ayala, Cornelia Caragea Proceedings of the 77th ASISamp;T annual meeting, Connecting collections, cultures, and communities . 2014

机译：致力于构建Web归档研究文章的集合
5. Bootstrapping Web Archive Collections from Micro-Collections in Social Media [D] . ?Nwala, Alexander C. 2020

机译：从微观集合社交媒体引导 Web归档收藏
6. Building an integrated infrastructure for exploring biodiversity: field collections and archives of mammals and parasites [O] . Kurt E Galbreath, Eric P Hoberg, Joseph A Cook, -1

机译：建立用于探索生物多样性的综合基础设施：哺乳动物和寄生虫的实地收集和档案
7. Scientific journal «Plant and Soil Science» Font Size Make font size smaller Make font size default Make font size larger Language Select Language User Username Password Remember me Article Tools Print this article Indexing metadata How to cite item Finding References Email this article (Login required) Email the author (Login required) About The Authors YE. Krestʹyaninov National University of Life and Enviromental Sciences of Ukraine L. Yermakova National University of Life and Enviromental Sciences of Ukraine T. Antal National University of Life and Enviromental Sciences of Ukraine Social networks Information For Readers For Authors For Librarians Author Fees This journal charges the following author fees. Publication of one page: 50.00 (UAH) The fee include those of the journal’s publishing, online hosting and archiving. The ability of authors to pay the fee does not influence the peer review process. No fee can be paid prior to the final positive decision of the reviewers and the editor in charge, regarding the article proposed to be evaluated in order to be published. Depending upon each particular case, the fee can be covered by the journal edition. Details Recipient: National University of Life and Environmental Sciences of Ukraine Address: Heroyiv Oborony st., 15, Kyiv-03041, Ukraine. Current account number 31254247216289 Bank: State Treasury Service of Ukraine, Kyiv Bank code 820172 Certificate of VAT №100155865 Payment: In an article in scientific journal "Plant and Soil Science" Personal Account 18.02.06.06.01 Tel .: +38 044 527 87 20 Email: nti_dep@nubip.edu.ua Example of bibliographic description The list of journals included in scientometric databases: - Scopus (Uкraine, Belarus, Poland, Russia); - Іndex Copernicus; - Web of Sciense (humanities, natural sciences, social sciences); - РІНЦ. Search algorithm and calculation scientometric indicator: - Scopus; - Publish or Perish; - Google Scholar; - SNIP-іndex journal. Home About Login Register Search Current Archives Statistics Reminder for authors Editorial Board Home > Vol 10, No 1 (2019) > Krestʹyaninov Formation of corn grain yield and quality depending on micronutrients topdressing under conditions of Left bank Forest Steppe [O] . YE. Krestʹyaninov, L. Yermakova, T. Antal 2019

机译：科学杂志«植物和土壤科学»字体大小使字体大小较小Make Font Size默认制作字体大小较大语言选择语言用户用户用户用户用户用户名称打印本文索引项目查找参考文章查找参考文章电子邮件本文（需要登录）通过电子邮件发送给作者ye的作者（需要登录）。 Krest'yaninov国立生活大学L. Yermakova国立生活大学L. Yermakova国立生命大学乌克兰塔斯坦国立生命学院与乌克兰读者的环境网络社会网络信息图书馆员的社交网络信息作者作者费用的作者提供费用以下内容作者费用。出版一页：50.00（UAH）费用包括期刊出版，在线托管和归档的费用。作者支付费用的能力不会影响同行评审过程。在审核人员和负责编辑的最终决定之前，无需支付费用，就拟议的文章进行了评估，以便公布。根据每个特定案例，期刊版本可以涵盖费用。详细资料收件人：国立生活大学乌克兰的环境科学地址：伊莱夫·奥诺齐St.，15，Kyiv-03041，乌克兰。当前账户号码31254247216289银行：乌克兰国家财政部服务，基辅银行代码820172增值税证书№100155865付款：在科学期刊“植物和土壤科学”个人账户中的一篇文章18.02.06.06.01电话。：+38 044 527 87 20封电子邮件：nti_dep@nubip.edu.ua书目描述的示例，Sciporal数据库中包含的期刊列表： - Scopus（uкraine，白俄罗斯，波兰，俄罗斯）; - іdex哥白尼; - 巩膜网（人文，自然科学，社会科学）; - рінц。搜索算法与计算科学计量指标： - Scopus; - 发表或灭亡; - 谷歌学术; - Snip-index杂志。主页关于登录登记搜索当前档案统计提醒为作者编辑委员会主页> Vol 10，第1（2019）> Krest'yaninov在左岸森林草原条件下的微量营养营养营养营养品，质量

Towards Building a Collection of Web Archiving Research Articles

摘要

著录项

相似文献

相关主题

期刊订阅