首页> 外文期刊>JMLR: Workshop and Conference Proceedings >End-to-End Learning for Structured Prediction Energy Networks
【24h】

End-to-End Learning for Structured Prediction Energy Networks

机译:结构化预测能源网络的端到端学习

获取原文
           

摘要

Structured Prediction Energy Networks (SPENs) are a simple, yet expressive family of structured prediction models (Belanger and McCallum, 2016). An energy function over candidate structured outputs is given by a deep network, and predictions are formed by gradient-based optimization. This paper presents end-to-end learning for SPENs, where the energy function is discriminatively trained by back-propagating through gradient-based prediction. In our experience, the approach is substantially more accurate than the structured SVM method of Belanger and McCallum (2016), as it allows us to use more sophisticated non-convex energies. We provide a collection of techniques for improving the speed, accuracy, and memory requirements of end-to-end SPENs, and demonstrate the power of our method on 7-Scenes image denoising and CoNLL-2005 semantic role labeling tasks. In both, inexact minimization of non-convex SPEN energies is superior to baseline methods that use simplistic energy functions that can be minimized exactly.
机译:结构化预测能源网络(SPEN)是结构化预测模型的简单而富于表现力的系列(Belanger and McCallum,2016)。候选结构化输出上的能量函数由深度网络给出,而预测则通过基于梯度的优化来形成。本文介绍了SPEN的端到端学习,其中通过基于梯度的预测进行反向传播来区别地训练能量函数。根据我们的经验,该方法比Belanger和McCallum(2016)的结构化SVM方法准确得多,因为它允许我们使用更复杂的非凸能量。我们提供了一系列提高端到端SPEN的速度,准确性和内存要求的技术,并展示了我们的方法在7种场景图像去噪和CoNLL-2005语义角色标记任务中的强大功能。在这两种方法中,非凸型SPEN能量的不精确最小化均优于使用简单化能量函数(可以精确地最小化)的基线方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号