首页> 外文会议>International Conference on Theory and Practice of Digital Libraries >Venue Classification of Research Papers in Scholarly Digital Libraries
【24h】

Venue Classification of Research Papers in Scholarly Digital Libraries

机译:学术数字图书馆研究论文的场地分类

获取原文

摘要

Open-access scholarly digital libraries crawl periodically a list of URLs in order to obtain appropriate collections of freely-available research papers. The metadata of the crawled papers, e.g., title, authors, and references, are automatically extracted before the papers are indexed in a digital library. The venue of publication is another important aspect about a scientific paper, which reflects its authoritativeness. However, the venue is not always readily available for a paper. Instead, it needs to be extracted from the references lists of other papers that cite the target paper. We explore a supervised learning approach to automatically classifying the venue of a research paper using information solely available from the content of the paper and show experimentally on a dataset of approximately 44,000 papers that this approach outperforms several baselines and prior work.
机译:开放式学术学术图书馆定期爬行一个URL列表,以获得可自由的可自由研究论文的适当集合。爬行文件的元数据,例如标题,作者和参考,在数字图书馆索引之前自动提取。出版物的场地是关于科学论文的另一个重要方面,这反映了其权威性。但是,场地并不总是可用的纸张。相反,它需要从引用目标纸张的其他文件的参考文献列表中提取。我们探索了一个监督的学习方法,可以使用单独从纸张内容提供的信息自动对研究文件进行分类,并在实验上显示在大约44,000篇论文的数据集上,这种方法优于几个基线和事先工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号