首页> 外文会议>International conference on computational linguistics >A Fully Coreference-annotated Corpus of Scholarly Papers from the ACL Anthology
【24h】

A Fully Coreference-annotated Corpus of Scholarly Papers from the ACL Anthology

机译:来自ACL选集的全参考文献语料库

获取原文

摘要

We describe a large coreference annotation task performed on a corpus of 266 papers from the ACL Anthology, a publicly, electronically available collection of scientific papers in the domain of computational linguistics and language technology. The annotation comprises mainly noun phrase coreference of the full textual content of each paper in the Anthology subset. It has been performed carefully and at least twice for each paper (initial annotation and secondary correction phase). The purpose of this paper is to summarize the comprehensive annotation schema and release the corpus publicly, along with this paper. The corpus is by far larger than the ACE coreference corpora. It can be used to train coreference resolution systems in the Computational Linguistics and Language Technology domain for semantic search, taxonomy extraction, question answering, citation analysis, scientific discourse analysis, etc.
机译:我们描述了对来自ACL Anthology的266篇论文的语料库执行的大型共指注释任务,ACL Anthology是在计算语言学和语言技术领域中公开可通过电子方式获取的科学论文的集合。注释主要包括Anthology子集中每篇论文全文内容的名词短语共指。仔细地进行了此操作,并且对每篇论文至少进行了两次(初始注释和次级校正阶段)。本文的目的是总结全面的注释架构,并与本文一起公开发布主体。语料库远远大于ACE共同参照语料库。它可用于在计算语言学和语言技术领域训练共指解析系统,以进行语义搜索,分类学提取,问题回答,引文分析,科学话语分析等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号