Proximal Alternating Direction Network: A Globally Converged Deep Unrolling Framework

机译：近端交替方向网络：全球融合的深度展开框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep learning models have gained great success in many real-world applications. However, most existing networks are typically designed in heuristic manners, thus lack of rigorous mathematical principles and derivations. Several recent studies build deep structures by unrolling a particular optimization model that involves task information. Unfortunately, due to the dynamic nature of network parameters, their resultant deep propagation networks do not possess the nice convergence property as the original optimization scheme does. This paper provides a novel proximal unrolling framework to establish deep models by integrating experimentally verified network architectures and rich cues of the tasks. More importantly, we prove in theory that 1) the propagation generated by our unrolled deep model globally converges to a critical-point of a given variational energy, and 2) the proposed framework is still able to learn priors from training data to generate a convergent propagation even when task information is only partially available. Indeed, these theoretical results are the best we can ask for, unless stronger assumptions are enforced. Extensive experiments on various real-world applications verify the theoretical convergence and demonstrate the effectiveness of designed deep models.

机译：深入学习模式在许多现实世界应用中取得了巨大的成功。然而，大多数现有网络通常以启发式方式设计，因此缺乏严格的数学原理和衍生。几项研究通过展开涉及任务信息的特定优化模型来构建深层结构。遗憾的是，由于网络参数的动态性质，随着原始优化方案的形成，它们所产生的深度传播网络没有良好的收敛性。本文提供了一种新颖的近端展开框架，通过集成实验验证的网络架构和富裕的任务提示来建立深层模型。更重要的是，我们在理论上证明了1）由我们展开的深度模型产生的传播全局会聚到给定的变分能的关键点，并且2）所提出的框架仍然能够从训练数据学习前沿以产生收敛的前提即使任务信息仅为部分可用，传播也是传播。实际上，除非强制执行了更强的假设，否则这些理论结果是我们可以要求的最好的。对各种现实世界应用的广泛实验验证了理论收敛性，并展示了设计深层模型的有效性。

著录项

来源
《AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence 》|2018年|628-1461p|共8页
会议地点
作者
Risheng Liu; Xin Fan; Shichao Cheng; Xiangyu Wang; Zhongxuan Luo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Deep Proximal Unrolling: Algorithmic Framework, Convergence Analysis and Applications [J] . Liu Risheng, Cheng Shichao, Ma Long, IEEE Transactions on Image Processing . 2019 ,第10期

机译：深度近端展开：算法框架，收敛性分析和应用
2. The convergence rate of the proximal alternating direction method of multipliers with indefinite proximal regularization [J] . Min Sun, Jing Liu Journal of inequalities and applications . 2017 ,第1期

机译：具有不确定的近端正则化的乘法器的近端交替方向方法的收敛速度
3. The Proximal Alternating Direction Method of Multipliers in the Nonconvex Setting: Convergence Analysis and Rates [J] . Mathematics of operations research . 2020 ,第2期

机译：非耦合设置中乘法器的近端交替方向方法：收敛分析和速率
4. Proximal Alternating Direction Network: A Globally Converged Deep Unrolling Framework [C] . Risheng Liu, Xin Fan, Shichao Cheng, AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：近端交替方向网络：全球融合的深度展开框架
5. Learning Bayesian Networks via the Alternating Direction Method of Multipliers [D] . Li, Jinchao 2015

机译：通过乘数的交替方向方法学习贝叶斯网络
6. The convergence rate of the proximal alternating direction method of multipliers with indefinite proximal regularization [O] . Min Sun, Jing Liu -1

机译：具有不确定的近端正则化的乘法器的近端交替方向方法的收敛速度
7. The convergence rate of the proximal alternating direction method of multipliers with indefinite proximal regularization [O] . Min Sun, Jing Liu 2017

机译：具有不确定的近端正则化的乘法器的近端交替方向方法的收敛速度
8. Global and Linear Convergence of the Generalized Alternating Direction Method of Multipliers. [R] . Deng, W., Yin, W. 2012

机译：乘子广义交替方向的全局和线性收敛性。

Proximal Alternating Direction Network: A Globally Converged Deep Unrolling Framework

摘要

著录项

相似文献

相关主题

期刊订阅