【24h】

Proximal Alternating Direction Network: A Globally Converged Deep Unrolling Framework

机译:近端交替方向网络:全球融合的深度展开框架

获取原文

摘要

Deep learning models have gained great success in many real-world applications. However, most existing networks are typically designed in heuristic manners, thus lack of rigorous mathematical principles and derivations. Several recent studies build deep structures by unrolling a particular optimization model that involves task information. Unfortunately, due to the dynamic nature of network parameters, their resultant deep propagation networks do not possess the nice convergence property as the original optimization scheme does. This paper provides a novel proximal unrolling framework to establish deep models by integrating experimentally verified network architectures and rich cues of the tasks. More importantly, we prove in theory that 1) the propagation generated by our unrolled deep model globally converges to a critical-point of a given variational energy, and 2) the proposed framework is still able to learn priors from training data to generate a convergent propagation even when task information is only partially available. Indeed, these theoretical results are the best we can ask for, unless stronger assumptions are enforced. Extensive experiments on various real-world applications verify the theoretical convergence and demonstrate the effectiveness of designed deep models.
机译:深入学习模式在许多现实世界应用中取得了巨大的成功。然而,大多数现有网络通常以启发式方式设计,因此缺乏严格的数学原理和衍生。几项研究通过展开涉及任务信息的特定优化模型来构建深层结构。遗憾的是,由于网络参数的动态性质,随着原始优化方案的形成,它们所产生的深度传播网络没有良好的收敛性。本文提供了一种新颖的近端展开框架,通过集成实验验证的网络架构和富裕的任务提示来建立深层模型。更重要的是,我们在理论上证明了1)由我们展开的深度模型产生的传播全局会聚到给定的变分能的关键点,并且2)所提出的框架仍然能够从训练数据学习前沿以产生收敛的前提即使任务信息仅为部分可用,传播也是传播。实际上,除非强制执行了更强的假设,否则这些理论结果是我们可以要求的最好的。对各种现实世界应用的广泛实验验证了理论收敛性,并展示了设计深层模型的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号