【24h】

Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction

机译:Doc2EDAG:中国金融事件提取的端到端文档级框架

获取原文

摘要

Most existing event extraction (EE) methods merely extract event arguments within the sentence scope. However, such sentence-level EE methods struggle to handle soaring amounts of documents from emerging applications, such as finance, legislation, health, etc., where event arguments always scatter across different sentences, and even multiple such event mentions frequently co-exist in the same document. To address these challenges, we propose a novel end-to-end model, Doc2EDAG, which can generate an entity-based directed acyclic graph to fulfill the document-level EE (DEE) effectively. Moreover, we reformal-ize a DEE task with the no-trigger-words design to ease document-level event labeling. To demonstrate the effectiveness of Doc2EDAG, we build a large-scale real-world dataset consisting of Chinese financial announcements with the challenges mentioned above. Extensive experiments with comprehensive analyses illustrate the superiority of Doc2EDAG over state-of-the-art methods.
机译:大多数现有的事件提取(EE)方法仅提取句子范围内的事件参数。但是,此类句子级的EE方法难以处理来自新兴应用程序(如财务,立法,卫生等)的大量文档,其中事件参数始终散布在不同的句子中,甚至多个此类事件引用经常并存于不同的句子中。相同的文件。为了解决这些挑战,我们提出了一种新颖的端到端模型Doc2EDAG,该模型可以生成基于实体的有向无环图,以有效地实现文档级EE(DEE)。此外,我们使用无触发字设计重新设计了DEE任务,以简化文档级事件标记。为了证明Doc2EDAG的有效性,我们构建了一个大规模的现实世界数据集,其中包含上述挑战的中国金融公告。进行全面分析的大量实验表明Doc2EDAG优于最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号