首页> 外文学位 >Exploring automatic citation classification.
【24h】

Exploring automatic citation classification.

机译:探索自动引用分类。

获取原文
获取原文并翻译 | 示例

摘要

Currently, citation indexes used by digital libraries are very limited. They only provide raw citation counts and link scientific articles through their citations. There are more than one type of citations, but citation indexes treat all citations equally.;Many classification schemes currently exist. However, manual annotation of all existing digital documents is infeasible because of the sheer magnitude of the digital content, which brings about the need for automating the annotating process, but not much research has been done in the area. One of the reasons preventing researchers from researching automated citation classification is the lack on annotated corpora that they can use.;This thesis explores automated citation classification. We make several contributions to the field of citation classification. We present a new citation scheme that is easier to work with than most. Also, we present a document acquisition and citation annotation tool that helps with the development of annotated citation corpora. And finally, we present some experiments with automating citation classification.;One way to improve citation indexes is to determine the types of citations in scientific articles (background, support, perfunctory reference, etc.). This will enable researchers to query citation indexes more efficiently by locating articles grouped by citation types. For example, it can enable a researcher to locate all background material needed to understand a specific article by locating all "background" citations.
机译:当前,数字图书馆使用的引用索引非常有限。他们仅提供原始的引文计数,并通过引文链接科学文章。引文类型不止一种,但引文索引对所有引文均等对待。;目前存在许多分类方案。然而,由于数字内容的巨大规模,对所有现有数字文档进行人工注释是不可行的,这带来了对注释过程进行自动化的需求,但是该领域的研究还很少。阻碍研究人员研究自动引文分类的原因之一是缺乏可使用的带注释的语料库。我们对引用分类领域做出了一些贡献。我们提出了一种新的引用方案,该方案比大多数方法更容易使用。此外,我们还提供了一个文档获取和引证注释工具,可帮助开发带注释的引证语料库。最后,我们提出了一些自动进行引文分类的实验。改进引文索引的一种方法是确定科学文章中的引文类型(背景,支持,敷衍引用等)。通过查找按引文类型分组的文章,这将使研究人员能够更有效地查询引文索引。例如,它可以使研究人员通过查找所有“背景”引文来查找理解特定文章所需的所有背景材料。

著录项

  • 作者

    Radoulov, Radoslav.;

  • 作者单位

    University of Waterloo (Canada).;

  • 授予单位 University of Waterloo (Canada).;
  • 学科 Mathematics.;Computer Science.
  • 学位 M.Math.
  • 年度 2008
  • 页码 102 p.
  • 总页数 102
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号