首页> 外文会议>International conference on computational linguistics >WikiRcf: Wikilinks as a route to recommending appropriate references for scientific Wikipedia pages
【24h】

WikiRcf: Wikilinks as a route to recommending appropriate references for scientific Wikipedia pages

机译:WikiRcf:Wikilinks作为向科学Wikipedia页面推荐适当参考的途径

获取原文

摘要

The exponential increase in the usage of Wikipedia as a key source of scientific knowledge among the researchers is making it absolutely necessary to metamorphose this knowledge repository into an integral and self-contained source of information for direct utilization. Unfortunately, the references which support the content of each Wikipedia entity page, are far from complete. Why are the reference section ill-formed for most Wikipedia pages? Is this section edited as frequently as the other sections of a page? Can there be appropriate surrogates that can automatically enhance the reference section? In this paper, we propose a novel two step approach - WikiRef - that (i) leverages the wikilinks present in a scientific Wikipedia target page and. thereby, (ii) recommends highly relevant references to be included in that target page appropriately and automatically borrowed from the reference section of the wikilinks. In the first step, we build a classifier to ascertain whether a wikilink is a potential source of reference or not. In the following step, we recommend references to the target page from the reference section of the wikilinks that are classified as potential sources of references in the first step. We perform an extensive evaluation of our approach on datasets from two different domains - Computer Science and Physics. For Computer Science we achieve a notably good performance with a precision@I of 0.44 for reference recommendation as opposed to 0.38 obtained from the most competitive baseline. For the Physics dataset, we obtain a similar performance boost of 10% with respect to the most competitive baseline.
机译:Wikipedia作为研究人员主要科学知识来源的使用呈指数级增长,因此绝对有必要将该知识库转变为一个完整且自包含的信息源,以直接使用。不幸的是,支持每个Wikipedia实体页面内容的参考文献还远远不够完整。为什么大多数Wikipedia页面的参考部分格式错误?该部分的编辑频率与页面其他部分的编辑频率一样吗?是否可以使用适当的替代方法来自动增强参考部分?在本文中,我们提出了一种新颖的两步方法-WikiRef-(i)利用科学Wikipedia目标页面中存在的Wikilink,并且。因此,(ii)建议将高度相关的参考适当地包括在该目标页面中,并自动从Wikilink的参考部分中借用。第一步,我们建立一个分类器,以确定Wikilink是否是潜在的参考来源。在接下来的步骤中,我们建议从Wikilink的参考部分中对目标页面的参考,在第一步中,这些参考链接被归类为潜在的参考来源。我们对来自两个不同领域的数据集-计算机科学和物理学进行了广泛的评估。对于计算机科学,我们以0.44的I精度提供了显着良好的性能,作为参考推荐,而不是从最具竞争力的基准获得的0.38。对于物理数据集,相对于最具竞争力的基准,我们获得了类似的10%的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号