首页> 外文会议>Digital Libraries: Universal and Ubiquitous Access to Information >Consolidation of References to Persons in Bibliographic Databases
【24h】

Consolidation of References to Persons in Bibliographic Databases

机译:书目数据库中对人的引用的合并

获取原文
获取原文并翻译 | 示例

摘要

Entity resolution is the process of determining if, in a specific context, two or more references correspond to the same entity. In this work, we address this problem in the context of references to persons as they are found in bibliographic data, specifically in the case of consolidating multiple datasets. Or solution follows the extraction, transformation and loading (ETL) process, typical in data warehouses. It computes the similarities of the attribute values for the references, and employs a decision tree to decide when the references match. We describe the characteristics of these references within bibliographic datasets, and how we explored those characteristics by developing new similarity metrics to improve the quality of the consolidation process. We evaluated our work by designing an experiment with data from four national libraries. The results show that the proposed similarity metrics contribute significantly to the consolidation process.
机译:实体解析是确定在特定上下文中两个或更多引用是否对应于同一实体的过程。在这项工作中,我们将参考书目数据中提到的人来解决这个问题,特别是在合并多个数据集的情况下。或者解决方案遵循数据仓库中典型的提取,转换和加载(ETL)过程。它计算引用的属​​性值的相似度,并使用决策树来确定引用何时匹配。我们在书目数据集中描述了这些参考的特征,以及我们如何通过开发新的相似性指标来探索这些特征以提高合并过程的质量。我们通过设计来自四个国家图书馆的数据进行的实验来评估我们的工作。结果表明,提出的相似性度量标准对合并过程做出了重要贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号