【24h】

Towards Coreference for Literary Text: Analyzing Domain-Specific Phenomena

机译:对文学文本的Coreference:分析特定于域的现象

获取原文

摘要

Coreference resolution is the task of grouping together references to the same discourse entity. Resolving coreference in literary texts could benefit a number of Digital Humanities (DH) tasks, such as analyzing the depiction of characters and/or their relations. Domain-dependent training data has shown to improve coreference resolution for many domains, e.g. the biomedical domain, as its properties differ significantly from news text or dialogue, on which automatic systems are typically trained. This also holds for literary texts. We therefore analyze the specific properties of coreference-related phenomena on a number of texts and give directions for the adaptation of annotation guidelines. As some of the adaptations have profound impact, we also present a new annotation tool for coreference, with a focus on enabling annotation of long texts with many discourse entities.
机译:Coreference解析是将引用与同一话语实体一起分组的任务。解决文学文本中的Coreference可以使许多数字人文(DH)任务受益,例如分析字符和/或其关系的描述。域依赖的训练数据显示,提高许多域的Coreference分辨率,例如,生物医学域,因为其性质从新闻文本或对话中显着不同,通常培训自动系统。这也适用于文学文本。因此,我们分析了许多文本上的Coreference相关现象的具体特性,并给出了注释指南的适应方向。随着一些适应性的影响,我们还为Coreference提供了一个新的注释工具,重点是启用许多话语实体的长文本的注释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号