Incomplete learning from endogenous data in dynamic allocation

Monica Brezzi; Tze Leung Lai

首页> 外文期刊>Econometrica >Incomplete learning from endogenous data in dynamic allocation

【24h】

Incomplete learning from endogenous data in dynamic allocation

机译：动态分配中对内源数据的不完全学习

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper studies the problem of learning from endogenous data by an economic agent who chooses actions sequentially from a finite set {a_1,...,a_k} such that the reward R(a_j) of action a_j has a probability distribution depending on an unknown parameter θ_j that has a prior distribution II~((j)). The agent's objective is to maximize the total discounted reward ∫…∫E_(θ_1…θ_k){Σ from t = 0 to ∞ of β~tR(X_(t+1))}dII~((1))(θ_1)…dII~((k))(θ_k), where 0 < β < 1 is a discount factor and X_t denotes the action chosen by the agent at time t. The optimal solution to this problem, commonly called the "discounted multiarmed bandit problem," was shown by Gittins and Jones (1974) and Gittins (1979) to be the "index rule" that chooses at each stage the action with the largest "dynamic allocation index" (DAI). The theory of multi-armed bandits has been applied to decision making in labor markets (cf. Jovanovic (1979), Mortensen (1985)), general search problems involving nondurable goods (cf. Banks and Sundaram (1992)) and pricing under demand uncertainty (cf. Rothschild (1974)).

机译：本文研究了经济主体从内在数据中学习的问题，该主体从有限集{a_1，...，a_k}中顺序选择行动，使得行动a_j的奖励R（a_j）具有取决于未知数的概率分布具有先验分布II〜（（j））的参数θ_j。代理的目标是使总折价奖励∫…∫E_（θ_1...θ_k）{Σ从t〜0到β〜tR（X_（t + 1））} dII〜（（（1））（θ_1）的∞ …dII〜（（k））（θ_k），其中0 <β<1是折扣因子，X_t表示代理在时间t选择的动作。 Gittins和Jones（1974）和Gittins（1979）证明了这个问题的最佳解决方案，通常称为“折扣多臂匪徒问题”，它是在每个阶段选择具有最大“动态”作用的“索引规则”。分配指数”（DAI）。多臂匪徒理论已应用于劳动力市场的决策（参见Jovanovic（1979），Mortensen（1985）），涉及非耐用品的一般搜索问题（参见Banks and Sundaram（1992））和需求定价不确定性（参见Rothschild（1974））。

著录项

来源
《Econometrica 》 |2000年第6期| 共6页
作者
Monica Brezzi; Tze Leung Lai;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类经济 ;
关键词

相似文献

外文文献
中文文献
专利

1. Incomplete learning from endogenous data in dynamic allocation [J] . Monica Brezzi, Tze Leung Lai Econometrica . 2000 ,第6期

机译：动态分配中对内源数据的不完全学习
2. Deep learning for fast MR imaging: A review for learning reconstruction from incomplete k-space data [J] . Wang Shanshan, Xiao Taohui, Liu Qiegen, Biomedical signal processing and control . 2021 ,第Pta1期

机译：快速映像的深度学习：从不完整的K空间数据学习重建综述
3. Maximum Margin Learning with Incomplete Data: Learning Networks instead of Tables [J] . Sandor Szedmak, Steve R. Gunn, Yizhao Ni, JMLR: Workshop and Conference Proceedings . 2010 ,第2010期

机译：使用不完整数据的最大利润率学习：学习网络而非表格
4. Bayesian Optimizatin Algorithm for Learning Structure of Dynamic Bayesian Networks from Incomplete Data [C] . 2008年中国控制与决策会议（2008 Chinese Control and Decision Conference）论文集 . 2008

机译：基于不完全数据的动态贝叶斯网络学习结构的贝叶斯优化算法
5. Learning from incomplete high-dimensional data [D] . Lou, Qiang 2013

机译：从不完整的高维数据中学习
6. Dynamic Proteomics: a database for dynamics and localizations of endogenous fluorescently-tagged proteins in living human cells [O] . Milana Frenkel-Morgenstern, Ariel A. Cohen, Naama Geva-Zatorsky, 2010

机译：动态蛋白质组学：关于活人细胞中内源性荧光标记蛋白的动力学和定位的数据库
7. Learning stochastic process-based models of dynamical systems from knowledge and data - Libraries, incomplete models and data [O] . Tanevski Jovan, Todorovski Ljupčo, Džeroski Sašo 2015

机译：从知识和数据中学习基于随机过程的动力系统模型-图书馆，不完整的模型和数据

Incomplete learning from endogenous data in dynamic allocation

摘要

著录项

相似文献

相关主题

期刊订阅