首页> 外文会议>Annual meeting of the Association for Computational Linguistics;ACL 2011 >Template-Based Information Extraction without the Templates
【24h】

Template-Based Information Extraction without the Templates

机译:没有模板的基于模板的信息提取

获取原文

摘要

Standard algorithms for template-based information extraction (IE) require predefined template schemas, and often labeled data, to learn to extract their slot fillers (e.g., an embassy is the Target of a Bombing template). This paper describes an approach to template-based IE that removes this requirement and performs extraction without knowing the template structure in advance. Our algorithm instead learns the template structure automatically from raw text, inducing template schemas as sets of linked events (e.g., bombings include detonate, set off, and destroy events) associated with semantic roles. We also solve the standard IE task, using the induced syntactic patterns to extract role fillers from specific documents. We evaluate on the MUC-4 terrorism dataset and show that we induce template structure very similar to hand-created gold structure, and we extract role fillers with an Fl score of .40, approaching the performance of algorithms that require full knowledge of the templates.
机译:用于基于模板的信息提取(IE)的标准算法要求使用预定义的模板架构(通常带有标记的数据)来学习提取其插槽填充符(例如,使馆是轰炸模板的目标)。本文介绍了一种基于模板的IE的方法,该方法消除了这一要求并执行提取,而无需事先知道模板的结构。相反,我们的算法会从原始文本中自动学习模板结构,将模板方案归纳为与语义角色相关联的链接事件集(例如,爆炸包括引爆,引爆和销毁事件)。我们还解决了标准IE任务,使用诱导句法模式从特定文档中提取角色填充符。我们对MUC-4恐怖主义数据集进行了评估,结果表明我们诱导出的模板结构与手工创建的黄金结构非常相似,并且我们提取了Fl得分为.40的角色填充物,从而接近了需要完全了解模板的算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号