【24h】

Joint Bootstrapping Machines for High Confidence Relation Extraction

机译:联合自举机,用于高置信度关系提取

获取原文

摘要

Semi-supervised bootstrapping techniques for relationship extraction from text iteratively expand a set of initial seed instances. Due to the lack of labeled data, a key challenge in bootstrapping is semantic drift: if a false positive instance is added during an iteration, then all following iterations are contaminated. We introduce BREX, a new bootstrapping method that protects against such contamination by highly effective confidence assessment. This is achieved by using entity and template seeds jointly (as opposed to just one as in previous work), by expanding entities and templates in parallel and in a mutually constraining fashion in each iteration and by introducing higher-quality similarity measures for templates. Experimental results show that BREX achieves an F_1 that is 0.13 (0.87 vs. 0.74) better than the state of the art for four relationships.
机译:用于从文本中提取关系的半监督自举技术迭代地扩展了一组初始种子实例。由于缺少标记数据,引导过程中的一个关键挑战是语义漂移:如果在迭代过程中添加了误报实例,则随后的所有迭代都将受到污染。我们引入了BREX,这是一种新的自举方法,可通过高效的置信度评估来防止此类污染。这是通过联合使用实体和模板种子(而不是以前的工作中的一个),通过在每次迭代中并行且以相互约束的方式扩展实体和模板以及为模板引入更高质量的相似性度量来实现的。实验结果表明,对于四个关系,BREX的F_1优于现有技术的F_1,为0.13(0.87对0.74)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号