A Span Seminorm Approach to Controlled Markov Set-Chains

机译：受控Markov集链的Span半范式方法

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In a controlled Markov set-chain with finite state and action spaces, we find a policy, called average-optimal, which maximizes Cesaro sums of each timeu27s reward over all stationaly policies under some partial order. Under uniformly scrambling conditions, the dynamic programming operator for our model is proved to be a contraction in a span seminorm. And, analysing the behavior of expected total rewards over the T-horizon as T approaches ∞ by a fixed point of a span-contraction operator we give a constructive proof for the existence of an average-optimal policy.

机译：在具有有限状态和动作空间的受控马尔可夫集合链中，我们找到了一个称为平均最优的策略，该策略在某些局部顺序下，对所有平稳策略的每次奖励的Cesaro总和最大化。在均匀加扰条件下，我们模型的动态规划算子被证明是一个跨半范数的收缩。并且，当跨距收缩算子的一个固定点使T接近∞时，通过分析在T地平线上的预期总报酬的行为，我们为平均最优策略的存在提供了建设性的证据。

著录项

作者
Hosaka Masanori; Kurano Masami; 蔵野正美;
展开▼
作者单位

展开▼
年度 1998
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Finite-Step Approximation Error Bounds for Solving Average-Reward-Controlled Markov Set-Chains [J] . Hyeong Soo Chang IEEE Transactions on Automatic Control . 2008,第1期

机译：求解平均收益控制的马尔可夫集链的有限步逼近误差界
2. Solving Controlled Markov Set-Chains With Discounting via Multipolicy Improvement [J] . Hyeong Soo Chang, Edwin K. P. Chong IEEE Transactions on Automatic Control . 2007,第期

机译：通过多策略改进以折扣解决受控Markov集链
3. Solving Controlled Markov Set-Chains With Discounting via Multipolicy Improvement [J] . Hyeong Soo Chang, Chong E.K.P. IEEE Transactions on Automatic Control . 2007,第3期

机译：通过多策略改进以折扣解决受控Markov集链
4. On Solving Controlled Markov Set-Chains via Multi-Policy Improvement [C] . Hyeong Soo Chang, Edwin K. P. Chong, The Institute of Electrical and Electronics EngineersInc. IEEE Conference on Decision and Control . 2005

机译：通过多政策改进解决控制的马尔可夫集链
5. Density Control of Multi-Agent Systems with Safety Constraints: A Markov Chain Approach [D] . Demirer, Nazli. 2017

机译：具有安全约束的多智能体系统的密度控制：马尔可夫链方法
6. Impedance Control for Robotic Rehabilitation: A Robust Markovian Approach [O] . Andres L. Jutinico, Jonathan C. Jaimes, Felix M. Escalante, 2017

机译：机器人康复的阻抗控制：鲁棒马尔可夫方法
7. NON-DISCOUNTED OPTIMAL POLICIES IN CONTROLLED MARKOV SET-CHAINS [O] . Masanori Hosaka, Masami Kurano 1999

机译：受控马尔可夫集链中的非折扣最佳政策

A Span Seminorm Approach to Controlled Markov Set-Chains

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅