首页> 外文会议>International Conference on Machine Learning >Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning
【24h】

Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

机译:概率函数下降:对GAN的统一视角,变分推理和加强学习

获取原文

摘要

The goal of this paper is to provide a unifying view of a wide range of problems of interest in machine learning by framing them as the minimization of functionals defined on the space of probability measures. In particular, we show that generative adversarial networks, variational inference, and actor-critic methods in reinforcement learning can all be seen through the lens of our framework. We then discuss a generic optimization algorithm for our formulation, called probability functional descent (PFD), and show how this algorithm recovers existing methods developed independently in the settings mentioned earlier.
机译:本文的目标是通过将它们的概念绘制为在概率测量空间空间中定义的功能最小化,提供机器学习中的广泛问题的统一性。特别是,我们表明,通过我们框架的镜头可以看到增强学习中的生成的对抗网络,变分推论和演员 - 批评方法。然后,我们讨论我们配方的通用优化算法,称为概率函数下降(PFD),并展示该算法如何在前面提到的设置中独立开发的现有方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号