首页> 外文会议>Chinese Control Conference >Finite convergence of value iteration algorithm for discounted infinite horizon optimal control of stochastic logical systems

【24h】

Finite convergence of value iteration algorithm for discounted infinite horizon optimal control of stochastic logical systems

机译：随机逻辑系统的无穷大折扣最优控制的值迭代有限收敛算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper investigates the discounted infinite horizon optimal control problem for the stochastic multi-valued logical dynamical systems with finite states. After giving the equivalent descriptions of the stochastic logical dynamical system in terms of Markov decision process, the infinite horizon optimization problem is presented in an algebraic form. Based on the semi-tensor product of matrices and the increasing-dimension technique, it is proved that the optimal stationary policy is obtained by a finite horizon value iteration process, and an exact horizon length estimation for the finite horizon approach is derived. As an application, the optimization problem of Human-machine game is investigated.

机译：本文研究了具有有限状态的随机多值逻辑动力系统的无穷无限最优水平折扣控制问题。在根据马尔可夫决策过程给出了随机逻辑动力学系统的等效描述之后，无限代数优化问题以代数形式提出。基于矩阵的半张量积和增维技术，证明了通过有限水平值迭代过程获得了最优平稳策略，并推导了有限水平方法的精确水平长度估计。作为一种应用，研究了人机游戏的优化问题。

著录项

来源
《Chinese Control Conference》|2016年|216-222|共7页
会议地点
作者
Yuhu Wu; Ximing Sun; Wei Wang; Tielong Shen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Optimal control; Aerospace electronics; Cost function; Markov processes; Convergence; Man-machine systems;

机译：最优控制;航空电子;成本函数;马尔可夫过程;收敛;人机系统;

相似文献

外文文献
中文文献
专利

1. An algebraic expression of finite horizon optimal control algorithm for stochastic logical dynamical systems [J] . Wu Yuhu, Shen Tielong Systems and Control Letters . 2015,第Null期

机译：随机逻辑动力系统有限水平最优控制算法的代数表达式
2. Neural approximations in discounted infinite-horizon stochastic optimal control problems [J] . Giorgio Gnecco, Marcello Sanguineti Engineering Applications of Artificial Intelligence . 2018,第SEPa期

机译：无限水平对折随机最优控制问题的神经网络近似
3. Indefinite Mean-Field Stochastic Linear-Quadratic Optimal Control: From Finite Horizon to Infinite Horizon [J] . Yuan-Hua Ni, Xun Li, Ji-Feng Zhang IEEE Transactions on Automatic Control . 2016,第11期

机译：不确定平均场随机线性二次最优控制：从有限水平到无限水平
4. Finite convergence of value iteration algorithm for discounted infinite horizon optimal control of stochastic logical systems [C] . Yuhu Wu, Ximing Sun, Wei Wang, Chinese Control Conference . 2016

机译：有限折叠价值迭代算法折扣无限地平线的随机逻辑系统最优控制
5. On the convergence of model -free policy iteration algorithms for reinforcement learning: Stochastic approximation under discontinuous mean dynamics. [D] . Williams, John Kevin. 2000

机译：关于用于增强学习的无模型策略迭代算法的收敛：不连续平均动力学下的随机逼近。
6. Movement duration Fitts’s law and an infinite-horizon optimal feedback control model for biological motor systems [O] . Ning Qian, Yu Jiang, Zhong-Ping Jiang, -1

机译：运动持续时间FITTS法和生物电机系统的无限范围最佳反馈控制模型
7. Indefinite mean-field stochastic linear-quadratic optimal control : from finite horizon to infinite horizon [O] . Ni YH, Li X, Zhang JF 2016

机译：不确定平均场随机线性-二次最优控制：从有限水平到无限水平
8. Existence of Optimal Stochastic Controls (I). Convergence of the Finite Difference Approximations of a Discounted Problem for a Diffusion. [R] . kushner, h. j. 1974

机译：最优随机控制的存在性（I）。扩散问题折扣问题有限差分逼近的收敛性。

Finite convergence of value iteration algorithm for discounted infinite horizon optimal control of stochastic logical systems

摘要

著录项

相似文献

相关主题

期刊订阅