Rationally inattentive Markov decision processes over a finite horizon

机译：有限范围内的注意力不集中的马尔可夫决策过程

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The framework of Rationally Inattentive Markov Decision Processes (RIMDPs) is an extension of Partially Observable Markov Decision Processes (POMDP) to the case when the observation kernel that governs the information gathering process is also selected by the decision maker. At each time, an observation kernel is chosen subject to a constraint on the Shannon conditional mutual information between the history of states and the current observation given the history of past observations. This set-up naturally arises in the context of networked control systems, artificial intelligence, and economic decision-making by boundedly rational agents. We show that, under certain structural assumptions on the information pattern and on the optimal policy, Bellman's Principle of Optimality can be used to derive a general dynamic programming recursion for this problem that reduces to solving a sequence of conditional rate-distortion problems.

机译：理性不专心的马尔可夫决策过程（RIMDP）的框架是部分可观察的马尔可夫决策过程（POMDP）的扩展，适用于决策者还选择管理信息收集过程的观察内核的情况。每次都选择观察核，但要考虑到状态历史和当前观察之间在给定过去观察历史的情况下对香农条件互信息的约束。这种设置自然是在网络控制系统，人工智能以及由有限理性主体进行的经济决策的背景下产生的。我们表明，在信息模式和最优策略的某些结构假设下，Bellman的最优性原理可用于得出该问题的一般动态规划递归，该递归可简化为解决一系列条件率失真问题。

著录项

来源
《Asilomar Conference on Signals, Systems and Computers》|2017年|621-627|共7页
会议地点
作者
Ehsan Shafieepoorfard; Maxim Raginsky;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Kernel; Markov processes; Mutual information; History; Economics; Decision making; Process control;

机译：内核;马尔可夫过程;共同信息;历史;经济学;决策制定;过程控制;

相似文献

外文文献
中文文献
专利

1. Optimal decisions for continuous time Markov decision processes overn finite planning horizons [J] . Buchholz Peter, Dohndorf Iryna, Scheftelowitsch Dimitri Computers & operations research . 2017,第jana期

机译：有限规划范围内连续时间马尔可夫决策过程的最优决策
2. Reachability and Safety Objectives in Markov Decision Processes on Long but Finite Horizons [J] . Journal of Optimization Theory and Applications . 2020,第3期

机译：马尔可夫决策过程中的可达性和安全目标长但有限的视野
3. Numerical analysis of continuous time Markov decision processes over finite horizons [J] . Peter Buchholz, Ingo Schulz Computers & operations research . 2011,第3期

机译：有限时间范围内连续时间马尔可夫决策过程的数值分析
4. Rationally inattentive Markov decision processes over a finite horizon [C] . Ehsan Shafieepoorfard, Maxim Raginsky Asilomar Conference on Signals, Systems, and Computers . 2017

机译：合理地无限期的马尔可夫决策过程在一个有限的地平线上
5. Finite memory policies for partially observable Markov decision processes. [D] . Lusena, Christopher David. 2001

机译：用于部分可观察的马尔可夫决策过程的有限内存策略。
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. Rationally inattentive control of Markov processes [O] . Shafieepoorfard, Ehsan, Raginsky, Maxim, Meyn, Sean P. 2016

机译：对马尔可夫过程的理性疏忽控制

Rationally inattentive Markov decision processes over a finite horizon

摘要

著录项

相似文献

相关主题

期刊订阅