首页> 外文会议>Annual meeting of the Association for Computational Linguistics;ACL 2012 >A Discriminative Hierarchical Model for Fast Coreference at Large Scale
【24h】

A Discriminative Hierarchical Model for Fast Coreference at Large Scale

机译:大规模快速共指的判别层次模型

获取原文

摘要

Methods that measure compatibility between mention pairs are currently the dominant approach to coreference. However, they suffer from a number of drawbacks including difficulties scaling to large numbers of mentions and limited representational power. As these drawbacks become increasingly restrictive, the need to replace the pairwise approaches with a more expressive, highly scalable alternative is becoming urgent. In this paper we propose a novel discriminative hierarchical model that recursively partitions entities into trees of latent sub-entities. These trees succinctly summarize the mentions providing a highly compact, information-rich structure for reasoning about entities and coreference uncertainty at massive scales. We demonstrate that the hierarchical model is several orders of magnitude faster than pairwise, allowing us to perform coreference on six million author mentions in under four hours on a single CPU.
机译:衡量提及对之间兼容性的方法目前是主要的共指方法。但是,它们具有许多缺点,包括难以扩展到大量提及内容和有限的表示能力。随着这些缺点变得越来越严格,用更富有表现力,高度可扩展的替代方法来替换成对方法变得迫在眉睫。在本文中,我们提出了一种新颖的区分层次模型,该模型将实体递归地划分为潜在子实体的树。这些树简要地总结了所提到的内容,从而提供了一个高度紧凑,信息丰富的结构,用于大规模地推理实体和共指不确定性。我们证明了分层模型比成对模型快几个数量级,从而使我们可以在一个CPU上不到四个小时的时间内对600万个作者提及的内容进行共引用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号