首页>
外国专利>
USING CYCLIC MARKOV DECISION PROCESS TO DETERMINE OPTIMUM POLICY
USING CYCLIC MARKOV DECISION PROCESS TO DETERMINE OPTIMUM POLICY
展开▼
机译:使用循环马尔可夫决策过程确定最佳策略
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method for determining an optimum policy by using a Markov decision process in which T subspaces each have at least one state having a cyclic structure includes identifying, with a processor, subspaces that are part of a state space; selecting a t-th (t is a natural number, t≦T) subspace among the identified subspaces; computing a probability of, and an expected value of a cost of, reaching from one or more states in the selected t-th subspace to one or more states in the t-th subspace in a following cycle; and recursively computing a value and an expected value of a cost based on the computed probability and expected value of the cost, in a sequential manner starting from a (t−1)-th subspace.
展开▼