首页> 外文会议>2014 IEEE/ACM Joint Conference on Digital Libraries >Disambiguating publication venue titles using association rules
【24h】

Disambiguating publication venue titles using association rules

机译:使用关联规则消除发布场所标题的歧义

获取原文
获取原文并翻译 | 示例

摘要

Research agencies in several countries evaluate the impact of scientific publications of researcher groups to define their investments, and one of the main used metrics is the quality of the publication venues where their works were published. Several bibliometric indexes have been formulated by measuring the quality of a publication venue. However, given a set of citations extracted, for example, from curricula vitae of a researcher group, to effectively use bibliometric indexes to evaluate their quality it is necessary to identify correctly the publication venue title of each citation. This task is not easy, since there are not unique identifiers for publication venues. Frequently, citations contain abbreviated forms and acronyms, publication venues share similar titles, sometimes they change their titles, divide or merge, creating new ones. Traditional digital libraries deal with this problem by creating Authority Files. In this work, we present a twofold contribution: (i) the creation of a Computer Science publication venue authority file and (ii) the proposal of a method that uses association rules to disambiguate publication venue titles originated from citations. The disambiguator is a supervised learning method that uses the authority file to train a classifier, whose generated model is a set of association rules to identify publication venues. Experiments show that our method obtains better results than three state of art baselines.
机译:多个国家/地区的研究机构评估研究人员团体的科学出版物的影响以定义其投资,主要使用的指标之一是其作品发表的出版场所的质量。通过测量出版场所的质量,已经制定了若干文献索引。但是,给定一组引文,例如从研究人员的履历中提取的引文,为了有效地使用文献索引来评估其质量,有必要正确地识别每个引文的出版地名称。由于没有发布场所的唯一标识符,因此此任务并不容易。通常,引文包含缩写形式和首字母缩写词,出版场所共享相似的标题,有时它们会更改标题,划分或合并以创建新的标题。传统的数字图书馆通过创建授权文件来解决此问题。在这项工作中,我们提出了两个方面的贡献:(i)创建计算机科学出版场所授权文件,以及(ii)建议使用关联规则来消除源自引用的出版场所名称的方法。歧义消除器是一种监督学习方法,使用授权文件来训练分类器,分类器的生成模型是一组关联规则以标识发布场所。实验表明,我们的方法比三种最先进的基准可获得更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号