首页> 外文会议>International Conference on Theory and Practice of Digital Libraries >Improved Bibliographic Reference Parsing Based on Repeated Patterns
【24h】

Improved Bibliographic Reference Parsing Based on Repeated Patterns

机译:基于重复模式改进的书目参考解析

获取原文

摘要

Parsing details like author names and titles out of bibliographic references of scientific publications is an important issue. However, most existing techniques are tailored to the highly standardized reference styles used in the last two to three decades. Their performance tends to degrade when faced with the wider variety of reference styles used in older, historic publications. Thus, existing techniques are of limited use when creating comprehensive bibliographies covering both historic and contemporary scientific publications. This paper presents RefParse, a generic approach to bibliographic reference parsing that is independent of any specific reference style. Its core feature is an inference mechanism that exploits the regularities inherent in any list of references to deduce its format. Our evaluation shows that RefParse outperforms existing parsers both for contemporary and for historic reference lists.
机译:在科学出版物的书目中解析了作者姓名和标题的详细信息是一个重要问题。然而,大多数现有技术都是针对过去两到三十年中使用的高度标准化的参考风格量身定制的技术。当面对更广泛的历史性出版物的主要参考风格面对时,它们的性能往往会降低。因此,在创建涵盖历史和当代科学出版物的综合书目时,现有技术是有限的。本文提出了refparse,一种与任何特定参考样式无关的书目参考解析的通用方法。其核心功能是推断机制,它利用任何引用列表中固有的常规的推断机制,以推断其格式。我们的评估表明,Refparse优于现代的解析器,也以当代和历史参考列表。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号