Boosted and Reward-regularized Classification for Apprenticeship Learning

机译：提升和奖励定期的学徒学习分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper deals with the problem of learning from demonstrations, where an agent called the apprentice tries to learn a behavior from demonstrations of another agent called the expert. To address this problem, we place ourselves into the Markov Decision Process (MDP) framework, which is well suited for sequential decision making problems. A way to tackle this problem is to reduce it to classification but doing so we do not take into account the MDP structure. Other methods which take into account the MDP structure need to solve MDPs which is a difficult task and/or need a choice of features which is problem-dependent. The main contribution of the paper is to extend a large margin approach, which is a classification method, by adding a regularization term which takes into account the MDP structure. The derived algorithm, called Reward-regularized Classification for Apprenticeship Learning (RCAL), does not need to solve MDPs. But, the major advantage is that it can be boosted: this avoids the choice of features, which is a drawback of parametric approaches. A state of the art experiment (Highway) and generic experiments (structured Garnets) are conducted to show the performance of RCAL compared to algorithms from the literature.

机译：本文涉及从示范中学习的问题，其中称为学徒的代理商试图从其他代理人的示范中学到一个名为专家的代理人。为了解决这个问题，我们将自己置于马尔可夫决策过程（MDP）框架中，这非常适合连续决策问题。解决这个问题的方法是将其减少到分类，但这样做是不考虑MDP结构。考虑到MDP结构的其他方法需要解决MDP，这是一个困难的任务和/或需要选择的功能。纸张的主要贡献是扩展了一个大的边际方法，它是一种分类方法，通过添加考虑MDP结构的正则化术语。派生算法，称为奖励正常化的学徒学习分类（RCAL），不需要解决MDP。但是，主要优点是它可以提升：这避免了特征的选择，这是参数方法的缺点。进行了最先进的实验（公路）和通用实验（结构化装甲）以显示与文献中的算法相比的Rcal的性能。

著录项

来源
《International Conference on Autonomous Agents and Multiagent Systems》|2014年||共8页
会议地点
作者
Bilal Piot; Matthieu Geist; Olivier Pietquin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Algorithms; Learning from Demonstrations; Inverse Reinforcement Learning; Large margin methods; Boosting;

机译：算法;从示范中学习;反钢筋学习;大边缘方法;提升;

相似文献

外文文献
中文文献
专利

1. HCKBoost: Hybridized composite kernel boosting with extreme learning machines for hyperspectral image classification [J] . Ergul Ugur, Bilgin Gokhan Neurocomputing . 2019,第MARa21期

机译：HCKBoost：混合复合内核增强与极限学习机一起用于高光谱图像分类
2. Boosting sparsity-induced autoencoder: A novel sparse feature ensemble learning for image classification [J] . Rui Shi, Jian Ji, Chunhui Zhang, International Journal of Advanced Robotic Systems . 2019,第3期

机译：增强稀疏性诱导的自动编码器：一种新颖的稀疏特征集成学习，用于图像分类
3. Land cover classification of multispectral remote sensing images based on time-spectrum association features and multikernel boosting incremental learning [J] . Bi Fukun, Hou Jinyuan, Wang Yuting, Journal of Applied Remote Sensing . 2019,第4期

机译：基于时间频谱关联特征和多时期提升增量学习的多光谱遥感图像的土地覆盖分类
4. Boosted and Reward-regularized Classification for Apprenticeship Learning [C] . Bilal Piot, Matthieu Geist, Olivier Pietquin International Conference on Autonomous Agents and Multiagent Systems . 2014

机译：提升和奖励定期的学徒学习分类
5. Boosting and online learning for classification and ranking. [D] . Valizadegan, Hamed. 2010

机译：促进和在线学习以进行分类和排名。
6. A Novel Deep-Learning-Based Bug Severity Classification Technique Using Convolutional Neural Networks and Random Forest with Boosting [O] . Ashima Kukkar, Rajni Mohana, Anand Nayyar, 2019

机译：基于卷积神经网络和Boosting随机森林的基于深度学习的错误严重性分类新技术
7. Boosted and Reward-regularized Classification for Apprenticeship Learning [O] . Piot Bilal, Geist Matthieu, Pietquin Olivier 2014

机译：学徒制学习的提升和奖励正规化分类

Boosted and Reward-regularized Classification for Apprenticeship Learning

摘要

著录项

相似文献

相关主题

期刊订阅