首页>
外国专利>
Value function representation method of reinforcement learning and apparatus using this
Value function representation method of reinforcement learning and apparatus using this
展开▼
机译:强化学习的价值函数表示方法及装置
展开▼
页面导航
摘要
著录项
相似文献
摘要
Reinforcement learning is one of the intellectual operations applied to autonomously moving robots etc. It is a system having excellent sides, for example, enabling operation in unknown environments. However, it has the basic problem called the “incomplete perception problem”. A variety of solution has been proposed, but none has been decisive. The systems also become complex. A simple and effective method of solution has been desired.;A complex value function defining a state-action value by a complex number is introduced. Time series information is introduced into a phase part of the complex number value. Due to this, the time series information is introduced into the value function without using a complex algorithm, so the incomplete perception problem is effectively solved by simple loading of the method.
展开▼