首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Learning Soft Linear Constraints with Application to Citation Field Extraction
【24h】

Learning Soft Linear Constraints with Application to Citation Field Extraction

机译:学习软线性约束及其在引文场提取中的应用

获取原文

摘要

Accurately segmenting a citation string into fields for authors, titles, etc. is a challenging task because the output typically obeys various global constraints. Previous work has shown that modeling soft constraints, where the model is encouraged, but not require to obey the constraints, can substantially improve segmentation performance. On the other hand, for imposing hard constraints, dual decomposition is a popular technique for efficient prediction given existing algorithms for unconstrained inference. We extend dual decomposition to perform prediction subject to soft constraints. Moreover, with a technique for performing inference given soft constraints, it is easy to automatically generate large families of constraints and learn their costs with a simple convex optimization problem during training. This allows us to obtain substantial gains in accuracy on a new, challenging citation extraction dataset.
机译:将输出正确地分为作者,标题等字段是一项艰巨的任务,因为输出通常会遵循各种全局约束。先前的工作表明,在对软约束进行建模(鼓励模型但不要求遵守约束)的情况下,可以大大提高细分效果。另一方面,为了强加硬约束,对偶分解是给定现有算法无约束推断的一种有效预测的流行技术。我们扩展对偶分解以执行受软约束约束的预测。此外,通过一种用于执行给定软约束的推理的技术,可以很容易地自动生成大量约束,并在训练过程中通过简单的凸优化问题来了解其成本。这使我们可以在新的,具有挑战性的引文提取数据集上获得大量的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号