Reinforcement Learning for Continuous Stochastic Actions: An Approximation of Probability Density Function by Orthogonal Wave Function Expansion

Hideki SATOH

首页> 外文期刊>IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences >Reinforcement Learning for Continuous Stochastic Actions: An Approximation of Probability Density Function by Orthogonal Wave Function Expansion

【24h】

Reinforcement Learning for Continuous Stochastic Actions: An Approximation of Probability Density Function by Orthogonal Wave Function Expansion

机译：连续随机动作的强化学习：通过正交波函数展开的概率密度函数逼近

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A function approximation based on an orthonormal wave function expansion in a complex space is derived. Although a probability density function (PDF) cannot always be expanded in an orthogonal series in a real space because a PDF is a positive real function, the function approximation can approximate an arbitrary PDF with high accuracy. It is applied to an actor-critic method of reinforcement learning to derive an optimal policy expressed by an arbitrary PDF in a continuous-action continuous-state environment. A chaos control problem and a PDF approximation problem are solved using the actor-critic method with the function approximation, and it is shown that the function approximation can approximate a PDF well and that the actor-critic method with the function approximation exhibits high performance.

机译：推导基于复空间中正交波函数展开的函数逼近。尽管由于PDF是正实函数，所以概率密度函数（PDF）不能总是在实际空间中以正交序列扩展，但是函数逼近可以高精度地近似任意PDF。它将其应用于强化学习的行为者批评方法，以得出在连续作用连续状态环境中由任意PDF表示的最优策略。使用具有函数逼近的actor-critic方法可以解决混沌控制问题和PDF逼近问题，结果表明，函数逼近可以很好地逼近PDF，具有函数逼近的actor-critic方法具有较高的性能。

著录项

来源
《IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences》 |2006年第8期|p.2173-2180|共8页
作者
Hideki SATOH;
展开▼
作者单位

Future University-Hakodate, Hakodateshi, 041-8655 Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
actor-critic; continuous; probability density; orthogonal expansion; approximation;

机译：行为评论家;连续性;概率密度;正交展开;逼近;

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning for Continuous Stochastic Actions - An Approximation of Probability Density Function by Orthogonal Expansion [J] . Hideki SATOH 電子情報通信学会技術研究報告. 非線形問題. Nonlinear Problems . 2005,第206期

机译：连续随机动作的强化学习-正交扩展的概率密度函数逼近
2. Reinforcement Learning for Continuous Stochastic Actions An Approximation of Probability Density Function by Orthogonal Expansion [J] . Hideki SATOH, 佐藤仁樹電子情報通信学会技術研究報告. 非線形問題. Nonlinear Problems . 2005,第206期

机译：连续随机动作的强化学习：通过正交展开近似概率密度函数
3. Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions [J] . Tamosiunaite M, Asfour T, Worgotter F Biological Cybernetics: Communication and Control in Organisms and Automata: = Nachrichtenubertragung, Nachrichtenverarbeitung, Steuerung und Regelung in Organismen und in Automaten . 2009,第3期

机译：通过使用连续动作的基于接受域的函数逼近方法，通过强化学习来学习达到
4. Piecewise Polynomial Approximation of Probability Density Functions with Application to Uncertainty Quantification for Stochastic PDEs [C] . Giacomo Capodaglio, Max Gunzburger Conference on Quantification of Uncertainty: Improving Efficiency and Technology . 2017

机译：概率密度函数的分段多项式逼近及其在随机PDE不确定性量化中的应用
5. APPROXIMATIONS IN DENSITY-FUNCTIONAL THEORY USING AN OPTIMIZED COMPOSITE EFFECTIVE POTENTIAL AND SIMPLE GRADIENT EXPANSIONS OF THE KINETIC AND EXCHANGE ENERGY DENSITY FUNCTIONALS. [D] . MOBAREK, OWAISH HARBI M. 1984

机译：使用优化的复合有效势和动能及交换能密度函数的简单梯度展开来逼近密度泛函理论。
6. Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions [O] . Minija Tamosiunaite, Tamim Asfour, Florentin Wörgötter -1

机译：通过使用连续动作的基于受体场的函数逼近方法通过强化学习来学习达到
7. Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions [O] . Minija Tamosiunaite, Tamim Asfour, Florentin Wörgötter 2009

机译：通过使用连续动作的基于受体场的函数逼近方法，通过强化学习来学习达到

Reinforcement Learning for Continuous Stochastic Actions: An Approximation of Probability Density Function by Orthogonal Wave Function Expansion

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅