首页> 外文会议>International conference on computational linguistics >A Fully Coreference-annotated Corpus of Scholarly Papers from the ACL Anthology
【24h】

A Fully Coreference-annotated Corpus of Scholarly Papers from the ACL Anthology

机译:来自ACL选集的学术论文的一个完全循环撰写的语料库

获取原文

摘要

We describe a large coreference annotation task performed on a corpus of 266 papers from the ACL Anthology, a publicly, electronically available collection of scientific papers in the domain of computational linguistics and language technology. The annotation comprises mainly noun phrase coreference of the full textual content of each paper in the Anthology subset. It has been performed carefully and at least twice for each paper (initial annotation and secondary correction phase). The purpose of this paper is to summarize the comprehensive annotation schema and release the corpus publicly, along with this paper. The corpus is by far larger than the ACE coreference corpora. It can be used to train coreference resolution systems in the Computational Linguistics and Language Technology domain for semantic search, taxonomy extraction, question answering, citation analysis, scientific discourse analysis, etc.
机译:我们描述了在来自ACL选集的266篇论文的语料库中,在计算语言学和语言技术领域的科学论文中的266篇论文的语料库上执行了大型Coreference注释任务。注释主要包括在选集子集中的每篇论文的全文内容的名词短语芯参考。它已经仔细执行,每份纸张至少进行两次(初始注释和次要校正阶段)。本文的目的是总结全面的注释架构并公开释放语料库。语料库远远大于ACE Coreference Corpora。它可用于培训计算语言学和语言技术领域的COSTEREDS解析系统进行语义搜索,分类学提取,问题应答,引文分析,科学话语分析等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号