首页> 外文会议>International conference on theory and practice of digital libraries >Improved Bibliographic Reference Parsing Based on Repeated Patterns
【24h】

Improved Bibliographic Reference Parsing Based on Repeated Patterns

机译:基于重复模式的改进书目参考解析

获取原文

摘要

Parsing details like author names and titles out of bibliographic references of scientific publications is an important issue. However, most existing techniques are tailored to the highly standardized reference styles used in the last two to three decades. Their performance tends to degrade when faced with the wider variety of reference styles used in older, historic publications. Thus, existing techniques are of limited use when creating comprehensive bibliographies covering both historic and contemporary scientific publications. This paper presents RefParse, a generic approach to bibliographic reference parsing that is independent of any specific reference style. Its core feature is an inference mechanism that exploits the regularities inherent in any list of references to deduce its format. Our evaluation shows that RefParse outperforms existing parsers both for contemporary and for historic reference lists.
机译:从科学出版物的书目参考中解析出诸如作者姓名和标题之类的细节是一个重要的问题。但是,大多数现有技术都是针对过去两到三十年中使用的高度标准化的参考样式量身定制的。当面对历史悠久的旧出版物中使用的各种参考样式时,它们的性能往往会下降。因此,当创建涵盖历史和当代科学出版物的综合书目时,现有技术的用途有限。本文介绍了RefParse,这是一种参考书目参考解析的通用方法,与任何特定的参考样式无关。它的核心功能是一种推理机制,该机制利用任何引用列表中固有的规律性来推断其格式。我们的评估表明,无论是针对现代参考列表还是针对历史参考列表,RefParse均优于现有的解析器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号