首页> 外文会议>International Conference on Theory and Practice of Digital Libraries >Metadata Enrichment of Multi-disciplinary Digital Library: A Semantic-Based Approach
【24h】

Metadata Enrichment of Multi-disciplinary Digital Library: A Semantic-Based Approach

机译:多学科数字图书馆的元数据富集:基于语义的方法

获取原文

摘要

In the scientific digital libraries, some papers from different research communities can be described by community-dependent keywords even if they share a semantically similar topic. Articles that are not tagged with enough keyword variations are poorly indexed in any information retrieval system which limits potentially fruitful exchanges between scientific disciplines. In this paper, we introduce a novel experimentally designed pipeline for multi-label semantic-based tagging developed for open-access metadata digital libraries. The approach starts by learning from a standard scientific categorization and a sample of topic tagged articles to find semantically relevant articles and enrich its metadata accordingly. Our proposed pipeline aims to enable researchers reaching articles from various disciplines that tend to use different terminologies. It allows retrieving semantically relevant articles given a limited known variation of search terms. In addition to achieving an accuracy that is higher than an expanded query based method using a topic synonym set extracted from a semantic network, our experiments also show a higher computational scalability versus other comparable techniques. We created a new benchmark extracted from the open-access metadata of a scientific digital library and published it along with the experiment code to allow further research in the topic.
机译:在科学的数字图书馆中,即使他们共享语义上的主题,也可以通过社区相关的关键词来描述来自不同研究社区的一些论文。在任何信息检索系统中索引不足标记的文章在任何信息检索系统中都是不佳的,这限制了科学学科之间的潜在富有成效的交流。在本文中,我们介绍了一种用于开发开放式元数据库开发的多标签语义标记的新型实验设计的管道。该方法从标准的科学分类和主题样本开始时开始,标记文章的样本,以找到语义相关文章并相应地丰富其元数据。我们拟议的管道旨在使研究人员能够从各种学科中到达往往使用不同术语的文章。它允许给定针对有限的搜索术语变异的语义相关文章。除了使用从语义网络中提取的主题同义词的基于基于查询的方法的准确性来实现,我们的实验还显示了更高的计算可伸缩性与其他可比技术。我们创建了一种从科学数字图书馆的开放访问元数据提取的新基准,并与实验代码一起发布,以便进一步研究该主题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号