首页> 外文会议>9th International conference on language resources and evaluation >Computational Narratology: Extracting Tense Clusters from Narrative Texts
【24h】

Computational Narratology: Extracting Tense Clusters from Narrative Texts

机译:计算叙述:从叙述文本中提取紧张簇

获取原文

摘要

Computational Narratology is an emerging field within the Digital Humanities. In this paper, we tackle the problem of extracting temporal information as a basis for event extraction and ordering, as well as further investigations of complex phenomena in narrative texts. While most existing systems focus on news texts and extract explicit temporal information exclusively, we show that this approach is not feasible for narratives. Based on tense information of verbs, we define temporal clusters as an annotation task and validate the annotation schema by showing that the task can be performed with high inter-annotator agreement. To alleviate and reduce the manual annotation effort, we propose a rule-based approach to robustly extract temporal clusters using a multi-layered and dynamic NLP pipeline that combines off-the-shelf components in a heuristic setting. Comparing our results against human judgements, our system is capable of predicting the tense of verbs and sentences with very high reliability: for the most prevalent tense in our corpus, more than 95% of all verbs are annotated correctly.
机译:计算叙述是数字人文中的新兴领域。在本文中,我们解决了提取时间信息作为事件提取和排序的基础的问题,以及在叙事文本中进一步调查复杂现象。虽然大多数现有系统专注于新闻文本并专门提取明确的时间信息,但我们表明这种方法对于叙事来说是不可行的。基于动词的时态信息,我们将时间群定义为注释任务,并通过显示可以使用高annotator协议执行任务来验证注释架构。为了缓解和减少人工标注的努力,我们提出了一个基于规则的方法使用一个多层次,动态NLP管道稳健提取时间群,在启发式设置都包括了现成的,现成的组件。我们的研究结果对人的判断相比,我们的系统能够预测的动词时态和句子具有很高的可靠性:在我们的语料库中最普遍紧张,所有动词的95%以上的正确注释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号