首页> 外国专利> A method for converting continuous systems into Markov decision processes.

A method for converting continuous systems into Markov decision processes.

机译:一种将连续系统转换为马尔可夫决策过程的方法。

摘要

A continuous dynamical system is converted to a Markov decision process (MDP) with discrete states. A predetermined number of continuous states of the continuous system is selected, wherein each continuous state corresponds to one discrete state of the MDP. Delaunay triangulation is applied to the continuous states to produce a set of triangles, wherein vertices of each triangle represent the continuous states. For each discrete state, a next discrete state y = f ( x, a ) is determined, wherein x represents the continuous state corresponding to the discrete state, a is a control action, and f is a non-linear transition function for the continuous state. A particular triangle containing the next discrete state y is identified, and the next discrete state y is expressed as probabilities of transitioning to the discrete states corresponding to the continuous states x represented by the vertices of the particular triangle.
机译:连续的动力学系统被转换为具有离散状态的马尔可夫决策过程(MDP)。选择连续系统的预定数量的连续状态,其中每个连续状态对应于MDP的一个离散状态。将Delaunay三角剖分应用于连续状态以生成一组三角形,其中每个三角形的顶点表示连续状态。对于每个离散状态,确定下一个离散状态y = f(x,a),其中x表示与离散状态相对应的连续状态,a是控制动作,f是该连续状态的非线性转换函数州。标识包含下一个离散状态y的特定三角形,并将下一个离散状态y表示为转变为与该特定三角形的顶点表示的连续状态x对应的离散状态的概率。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号