首页> 外文会议>Workshop on Scholarly Document Processing >UniHD@CL-SciSumm 2020: Citation Extraction as Search
【24h】

UniHD@CL-SciSumm 2020: Citation Extraction as Search

机译:UNIHD @ CL-SCISUMM 2020:引文提取为搜索

获取原文

摘要

This work presents the entry by the team from Heidelberg University in the CL-SciSumm 2020 shared task at the Scholarly Document Processing workshop at EMNLP 2020. As in its previous iterations, the task is to highlight relevant parts in a reference paper, depending on a citance text excerpt from a citing paper. We participated in tasks 1A (cited text span identification) and 1B (citation context classification). Contrary to most previous works, we frame Task 1A as a search relevance problem, and introduce a 2-step re-ranking approach, which consists of a preselection based on BM25 in addition to positional document features, and a top-k re-ranking with BERT. For Task 1B, we follow previous submissions in applying methods that deal well with low resources and imbalanced classes.
机译:这项工作介绍了Heidelberg University的Cl-Scisumm 2020在EMNLP 2020的学术文档处理研讨会上的分享任务。如前所述,任务是在参考文件中突出相关部件,具体取决于a从引用纸张摘录的CITANCE TEXT。我们参与了任务1A(引用文本跨度标识)和1B(引文上下文分类)。与大多数以前的作品相反,我们将任务1A框架作为搜索相关问题,并引入了一个2步重写方法,其除了位置文档功能之外还包括基于BM25的预选,以及顶级k重新排名用伯特。对于任务1B,我们遵循以前的提交,以应用低资源和不平衡类的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号