首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Learning Soft Linear Constraints with Application to Citation Field Extraction
【24h】

Learning Soft Linear Constraints with Application to Citation Field Extraction

机译:学习软线性约束与应用引导字段提取

获取原文

摘要

Accurately segmenting a citation string into fields for authors, titles, etc. is a challenging task because the output typically obeys various global constraints. Previous work has shown that modeling soft constraints, where the model is encouraged, but not require to obey the constraints, can substantially improve segmentation performance. On the other hand, for imposing hard constraints, dual decomposition is a popular technique for efficient prediction given existing algorithms for unconstrained inference. We extend dual decomposition to perform prediction subject to soft constraints. Moreover, with a technique for performing inference given soft constraints, it is easy to automatically generate large families of constraints and learn their costs with a simple convex optimization problem during training. This allows us to obtain substantial gains in accuracy on a new, challenging citation extraction dataset.
机译:准确地将引文字符串分割为作者,标题等的字段是一个具有挑战性的任务,因为输出通常遵守各种全局约束。以前的工作表明,建模软限制,鼓励模型,但不需要遵守约束,可以大大提高分割性能。另一方面,为了施加硬度限制,双分解是给定针对未约束推理的现有算法的有效预测的流行技术。我们扩展了双分解,以执行对软限制的预测。此外,利用用于对赋予软限制的推理进行推理的技术,很容易自动生成大型约束,并在训练期间通过简单的凸优化问题来学习其成本。这使我们可以在新的挑战引用提取数据集上准确地获得大幅提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号