首页> 外文会议>International Conference on Machine Learning >Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

【24h】

Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

机译：概率函数下降：对GAN的统一视角，变分推理和加强学习

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The goal of this paper is to provide a unifying view of a wide range of problems of interest in machine learning by framing them as the minimization of functionals defined on the space of probability measures. In particular, we show that generative adversarial networks, variational inference, and actor-critic methods in reinforcement learning can all be seen through the lens of our framework. We then discuss a generic optimization algorithm for our formulation, called probability functional descent (PFD), and show how this algorithm recovers existing methods developed independently in the settings mentioned earlier.

机译：本文的目标是通过将它们的概念绘制为在概率测量空间空间中定义的功能最小化，提供机器学习中的广泛问题的统一性。特别是，我们表明，通过我们框架的镜头可以看到增强学习中的生成的对抗网络，变分推论和演员 - 批评方法。然后，我们讨论我们配方的通用优化算法，称为概率函数下降（PFD），并展示该算法如何在前面提到的设置中独立开发的现有方法。

著录项

来源
《International Conference on Machine Learning》|2019年|1420-2201p|共17页
会议地点
作者
Casey Chu; Jose Blanchet; Peter Glynn;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词

相似文献

外文文献
中文文献
专利

1. Advanced NOMA Receivers From a Unified Variational Inference Perspective [J] . Meng Xiangming, Zhang Lei, Wang Chao, IEEE Journal on Selected Areas in Communications . 2021,第4期

机译：来自统一变分推理的先进的NOMA接收器视角
2. Perspectives of probabilistic inferences: Reinforcement learning and an adaptive network compared [J] . Rieskamp J Journal of experimental psychology. Learning, memory, and cognition . 2006,第6期

机译：概率推断的观点：强化学习和自适应网络的比较
3. Swarm robots reinforcement learning convergence accuracy-based learning classifier systems with gradient descent (XCS-GD) [J] . Jie Shao, Haixia Lin, Kaibian Zhang Neural computing & applications . 2014,第2期

机译：群体机器人强化学习基于梯度下降的基于学习精度的学习分类器系统（XCS-GD）
4. Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning [C] . Casey Chu, Jose Blanchet, Peter Glynn International Conference on Machine Learning . 2019

机译：概率函数下降：对GAN的统一视角，变分推理和加强学习
5. Functional Characterization of Striatal Afferent Projections in the Context of Reinforcement Learning [D] . Parker, Nathan Francis. 2019

机译：强化学习背景下纹状体传入预测的功能性
6. Probability learning as a function of momentary reinforcement probability [O] . Ben A. Williams 1972

机译：概率学习与瞬时强化概率的关系
7. RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion [O] . Muhammad Sarmad, Hyunjoo Jenny Lee, Young Min Kim 2019

机译：RL-GaN网：加强学习代理控制GAN网络，用于实时点云形状完成
8. Reinforcement Learning Through Gradient Descent [R] . Baird, L. C. 1999

机译：通过梯度下降强化学习

Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅