Online Learning with Constraints

机译：有约束的在线学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study online learning where the objective of the decision maker is to maximize her average long-term reward given that some average constraints are satisfied along the sample path. We define the reward-in-hindsight as the highest reward the decision maker could have achieved, while satisfying the constraints, had she known Nature's choices in advance. We show that in general the reward-in-hindsight is not attainable. The convex hull of the reward-in-hindsight function is, however, attainable. For the important case of a single constraint the convex hull turns out to be the highest attainable function. We further provide an explicit strategy that attains this convex hull using a calibrated forecasting rule.

机译：我们研究在线学习，决策者的目标是最大化她的平均长期奖励，因为在样本路径上满足一些平均约束。我们将见解奖励定义为决策者在满足约束条件的前提下，如果她事先知道Nature的选择，则可以获得的最高奖励。我们表明，总的来说，事后发现是无法实现的。但是，可以实现事后奖励功能的凸包。对于单个约束的重要情况，凸包被证明是可获得的最高功能。我们进一步提供了使用校准的预测规则来达到该凸包的显式策略。

著录项

来源
《Annual Conference on Learning Theory(COLT 2006); 20060622-25; Pittsburgh,PA(US)》|2006年|P.529-543|共15页
会议地点 PittsburghPA(US)
作者
Shie Mannor; John N. Tsitsiklis;
展开▼
作者单位

Department of Electrical and Computer Engingeering McGill University, Quebec H3A-2A7;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. OL4EL: Online Learning for Edge-Cloud Collaborative Learning on Heterogeneous Edges with Resource Constraints [J] . Han Qing, Yang Shusen, Ren Xuebin, IEEE Communications Magazine . 2020,第5期

机译：OL4EL：在资源限制的异构边缘上的边缘云协同学习在线学习
2. Online barrier-actor-critic learning for H_∞ control with full-state constraints and input saturation [J] . Yang Yongliang, Ding Da-Wei, Xiong Haoyi, Journal of the Franklin Institute . 2020,第6期

机译：在线障碍演员 - 以全状态约束和输入饱和度控制H_∞控制
3. Multi-agent graphical games with input constraints: an online learning solution [J] . Tianxiang WANG, Bingchang WANG, Yong LIANG 控制理论与应用（英文版） . 2020,第002期

机译：输入受限的多主体图形游戏：在线学习解决方案
4. Online Adaptive Learning in Energy Trading Stackelberg Games with Time-Coupling Constraints [C] . Styliani I. Kampezidou, Justin Romberg, Kyriakos G. Vamvoudakis, Annual American Control Conference . 2021

机译：在线自适应学习在能源交易Stackelberg游戏中的时间耦合约束
5. Quasi-experimental study: Generational (Y, X, & Boomer) reactions and learning effectiveness of asynchronous, mobile-based online learning versus asynchronous, computer-based online learning. [D] . Smith III, E. R. 2014

机译：准实验研究：异步，基于移动的在线学习与异步，基于计算机的在线学习的世代（Y，X和Boomer）反应和学习效果。
6. Coronavirus Disease 2019 (COVID-19) Learning Online: A Flipped Classroom Based on Micro-Learning Combined with Case-Based Learning in Undergraduate Medical Students [O] . Qiaohui Qian, Yuzhong Yan, Fei Xue, 2021

机译：Coronavirus疾病2019（Covid-19）在线学习：基于微观学习的翻转教室结合了基于案例的学习本科医学生
7. Learning with Online Constraints: Shifting Concepts and Active Learning [O] . Monteleoni Claire E. 2006

机译：在线约束学习：转变观念和积极学习

Online Learning with Constraints

摘要

著录项

相似文献

相关主题

期刊订阅