首页> 外文会议>Annual meeting of the Association for Computational Linguistics >NNE: A Dataset for Nested Named Entity Recognition in English Newswire
【24h】

NNE: A Dataset for Nested Named Entity Recognition in English Newswire

机译:NNE:英语新闻界中嵌套命名实体识别的数据集

获取原文

摘要

Named entity recognition (NER) is widely used in natural language processing applications and downstream tasks. However, most NER tools target flat annotation from popular datasets, eschewing the semantic information available in nested entity mentions. We describe NNE—a fine-grained, nested named entity dataset over the full Wall Street Journal portion of the Penn Treebank (PTB). Our annotation comprises 279,795 mentions of 114 entity types with up to 6 layers of nesting. We hope the public release of this large dataset for English newswire will encourage development of new techniques for nested NER.
机译:命名实体识别(ner)广泛用于自然语言处理应用程序和下游任务。但是,大多数ner工具目标来自流行数据集的平整注释,避免了嵌套实体提到的语义信息。我们描述了NNE-A细粒度,嵌套的命名实体数据集Penn TreeBank(PTB)的完整墙街道期刊。我们的注释包括114个实体类型的279,795个提升,最多可嵌套6层。我们希望公众发布这款大型数据集的英语新闻网将鼓励开发嵌套网的新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号