The vanishing discount approach to average reward optimality: the strongly and the weakly continuous cases

aacute; eacute; ndez-Lerma; On; s Prieto-Rumeau; simo Hern; Tom

首页> 外文期刊>Morfismos >The vanishing discount approach to average reward optimality: the strongly and the weakly continuous cases

【24h】

The vanishing discount approach to average reward optimality: the strongly and the weakly continuous cases

机译：消失的折衷方法可实现平均奖励最优：强和弱连续案例

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

著录项
相似文献
相关主题

著录项

来源
《Morfismos》 |2008年第2期|共页
作者
aacute; eacute; ndez-Lerma; On; s Prieto-Rumeau; simo Hern; Tom;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类数学;
关键词

相似文献

外文文献
中文文献
专利

1. THE VANISHING DISCOUNT APPROACH FOR THE AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES [J] . Costa OLV, Dufour F Journal of Applied Probability . 2009,第4期

机译：分段确定性马尔可夫过程的平均连续控制的消失折扣法
2. Solutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: The fixed-point approach revisited [J] . Vega-Amaya Oscar Journal of Mathematical Analysis and Applications . 2018,第1期

机译：Markov决策过程的平均成本优化方程解决方案与弱连续内核：重新发现的定点方法
3. Contraction conditions for average and alpha-discount optimality in countable state Markov games with unbounded rewards [J] . Altman E, Hordijk A, Spieksma FM Mathematics of operations research . 1997,第3期

机译：具有无穷奖励的可数状态Markov游戏中平均和alpha折扣最优的收缩条件
4. Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning [C] . Sridhar Mahadevan Machine learning . 1996

机译：敏感性折扣最优：统一折扣和平均奖励强化学习
5. The Effects of Values Activation on Temptation Coping and Confidence: Testing Delayed Reward Discounting and Religiosity/Spirituality as Moderators [D] . Varma, Malini 2020

机译：价值激活对诱惑应对和自信心的影响：以主持人身份测试延迟奖励折扣和宗教/灵性
6. Perceived food palatability blood glucose level and future discounting: Lack of evidence for blood glucose level’s impact on reward discounting [O] . Rafał Muda, Przemysław Sawicki, Michał Ginszt 2021

机译：感知食物适口性血糖水平和未来折扣：缺乏血糖水平对奖励折扣的影响的证据
7. The Vanishing Discount Approach for the Average Continuous Control of Piecewise Deterministic Markov Processes [O] . O. L. V. Costa, F. Dufour 2009

机译：分段确定性马尔可夫流程平均连续控制的消失折扣方法
8. Contraction Conditions for Average and alpha-Discount Optimality in CountableState Markov Games with Unbounded Rewards [R] . Altman, E., Hordijk, A., Spieksma, F. M. 1994

机译：具有无界奖励的Countablestate markov游戏中平均和alpha折扣最优性的收缩条件

The vanishing discount approach to average reward optimality: the strongly and the weakly continuous cases

著录项

相似文献

相关主题

期刊订阅