Discrete-time control with non-constant discount factor

Jasso-Fuentes Hector; Menaldi Jose-Luis; Prieto-Rumeau Tomas

首页> 外文期刊>Mathematical methods of operations research >Discrete-time control with non-constant discount factor

【24h】

Discrete-time control with non-constant discount factor

机译：Discrete-time control with non-constant discount factor

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

This paper deals with discrete-time Markov decision processes (MDPs) with Borel state and action spaces, and total expected discounted cost optimality criterion. We assume that the discount factor is not constant: it may depend on the state and action; moreover, it can even take the extreme values zero or one. We propose sufficient conditions on the data of the model ensuring the existence of optimal control policies and allowing the characterization of the optimal value function as a solution to the dynamic programming equation. As a particular case of these MDPs with varying discount factor, we study MDPs with stopping, as well as the corresponding optimal stopping times and contact set. We show applications to switching MDPs models and, in particular, we study a pollution accumulation problem.

著录项

来源
《Mathematical methods of operations research》 |2020年第2期|377-399|共23页
作者
Jasso-Fuentes Hector; Menaldi Jose-Luis; Prieto-Rumeau Tomas;
展开▼
作者单位

CINVESTAV IPN, Math Dept, A Postal 14-740, Mexico City 07000, DF, Mexico;

Wayne State Univ, Dept Math, Detroit, MI 48202 USA;

UNED, Dept Stat & Operat Res, Calle Senda Rey 9, Madrid 28040, Spain;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类运筹学;
关键词
Markov decision processes; Dynamic programming; Optimal stopping problems;

Discrete-time control with non-constant discount factor

摘要

著录项

相关主题

期刊订阅