首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Template-Based Information Extraction without the Templates
【24h】

Template-Based Information Extraction without the Templates

机译:基于模板的信息提取而无需模板

获取原文

摘要

Standard algorithms for template-based information extraction (IE) require predefined template schemas, and often labeled data, to learn to extract their slot fillers (e.g., an embassy is the Target of a Bombing template). This paper describes an approach to template-based IE that removes this requirement and performs extraction without knowing the template structure in advance. Our algorithm instead learns the template structure automatically from raw text, inducing template schemas as sets of linked events (e.g., bombings include detonate, set off, and destroy events) associated with semantic roles. We also solve the standard IE task, using the induced syntactic patterns to extract role fillers from specific documents. We evaluate on the MUC-4 terrorism dataset and show that we induce template structure very similar to hand-created gold structure, and we extract role fillers with an Fl score of .40, approaching the performance of algorithms that require full knowledge of the templates.
机译:基于模板的信息提取(IE)的标准算法需要预定义的模板模式和通常标记的数据,以学习提取其插槽填充物(例如,大使馆是轰炸模板的目标)。本文介绍了基于模板的方法,即去除该要求并在不提前了解模板结构的情况下执行提取。我们的算法改为自动从原始文本中自动学习模板结构,将模板模式作为链接事件组(例如,爆炸包括引导,删除和销毁事件)与语义角色相关联。我们还使用诱导的句法模式来解决标准IE任务,从特定文件中提取角色填充物。我们在Muc-4恐怖主义数据集上评估,并显示我们诱导模板结构与手工创造的金色结构非常相似,并且我们提取具有FL得分的角色填充物,即40,接近需要全面了解模板的算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号