首页> 外国专利> A method for converting continuous systems into Markov decision processes.

A method for converting continuous systems into Markov decision processes.

机译：一种将连续系统转换为马尔可夫决策过程的方法。

页面导航

摘要
著录项
相似文献

摘要

A continuous dynamical system is converted to a Markov decision process (MDP) with discrete states. A predetermined number of continuous states of the continuous system is selected, wherein each continuous state corresponds to one discrete state of the MDP. Delaunay triangulation is applied to the continuous states to produce a set of triangles, wherein vertices of each triangle represent the continuous states. For each discrete state, a next discrete state y = f ( x, a ) is determined, wherein x represents the continuous state corresponding to the discrete state, a is a control action, and f is a non-linear transition function for the continuous state. A particular triangle containing the next discrete state y is identified, and the next discrete state y is expressed as probabilities of transitioning to the discrete states corresponding to the continuous states x represented by the vertices of the particular triangle.

机译：连续的动力学系统被转换为具有离散状态的马尔可夫决策过程（MDP）。选择连续系统的预定数量的连续状态，其中每个连续状态对应于MDP的一个离散状态。将Delaunay三角剖分应用于连续状态以生成一组三角形，其中每个三角形的顶点表示连续状态。对于每个离散状态，确定下一个离散状态y = f（x，a），其中x表示与离散状态相对应的连续状态，a是控制动作，f是该连续状态的非线性转换函数州。标识包含下一个离散状态y的特定三角形，并将下一个离散状态y表示为转变为与该特定三角形的顶点表示的连续状态x对应的离散状态的概率。

著录项

公开/公告号JP5562216B2

专利类型
公开/公告日2014-07-30

原文格式PDF
申请/专利权人ミツビシ・エレクトリック・リサーチ・ラボラトリーズ・インコーポレイテッド;
展开▼

申请/专利号JP20100257949
发明设计人ダニエル・エヌ・ニコヴスキ;
展开▼

申请日2010-11-18
分类号G05B13/02;D06F33/02;
国家 JP
入库时间 2022-08-21 16:13:34

相似文献

专利
外文文献
中文文献