Speeding Up Incremental Learning Using Data Efficient Guided Exploration

机译：使用数据高效导向探索加快增量学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To cope with varying conditions, motor primitives (MPs) must support generalization over task parameters to avoid learning separate primitives for each situation. In this regard, deterministic and probabilistic models have been proposed for generalizing MPs to new task parameters, thus providing limited generalization. Although generalization of MPs using probabilistic models has been studied, it is not clear how such generalizable models can be learned efficiently. Reinforcement learning can be more efficient when the exploration process is tuned with data uncertainty, thus reducing unnecessary exploration in a data-efficient way. We propose an empirical Bayes method to predict uncertainty and utilize it for guiding the exploration process of an incremental learning framework. The online incremental learning framework uses a single human demonstration for constructing a database of MPs. The main ingredients of the proposed framework are a global parametric model (GPDMP) for generalizing MPs for new situations, a model-free policy search agent for optimizing the failed predicted MPs, model selection for controlling the complexity of GPDMP, and empirical Bayes for extracting the uncertainty of MPs prediction. Experiments with a ball-in-a-cup task demonstrate that the global GPDMP model generalizes significantly better than linear models and Locally Weighted Regression especially in terms of extrapolation capability. Furthermore, the model selection has successfully identified the required complexity of GPDMP even with few training samples while satisfying the Occam Razor's prinicple. Above all, the uncertainty predicted by the proposed empirical Bayes approach successfully guided the exploration process of the model-free policy search. The experiments indicated statistically significant improvement of learning speed over covariance matrix adaptation (CMA) with a significance of p = 0.002.

机译：为了应对不同的条件，电机基元（MPS）必须支持任务参数的泛化，以避免为每种情况学习单独的基元。在这方面，已经提出了确定性和概率模型，用于将MP概括为新的任务参数，从而提供有限的泛化。尽管研究了使用概率模型的MPS的泛化，但目前尚不清楚如何有效地学习这些更广泛的模型。当探索过程随数据不确定性调整勘探过程时，强化学习可能更有效，从而以数据有效的方式降低了不必要的探索。我们提出了一种经验贝叶斯方法来预测不确定性并利用它来指导增量学习框架的勘探过程。在线增量学习框架使用单个人类演示来构建MPS数据库。所提出的框架的主要成分是全局参数模型（GPDMP），用于概括新情况的MPS，用于优化失败的预测MPS，用于控制GPDMP复杂性的无模型策略搜索代理，以及用于提取的经验贝叶MPS预测的不确定性。带有球形杯前任务的实验表明，全球GPDMP模型概括了比线性模型和局部加权回归概括为外推能力。此外，即使在满足欧洲潮流的春天的情况下，模型选择也成功地确定了GPDMP的所需复杂性。最重要的是，所提出的经验贝叶斯方法预测的不确定性成功引导了无模式策略搜索的探索过程。实验表明，具有协方差矩阵适应（CMA）的学习速度的统计显着改善，其具有P = 0.002的显着性。

著录项

来源
《IEEE International Conference on Robotics and Automation》|2018年|4921-5941p|共8页
会议地点
作者
Murtaza Hazara; Ville Kyrki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP242-53;
关键词

相似文献

外文文献
中文文献
专利

1. AN ACTIVE EXPLORATION METHOD FOR DATA EFFICIENT REINFORCEMENT LEARNING [J] . DONGFANG ZHAO, JIAFENG LIU, Rui WU, International Journal of Applied Mathematics and Computer Science . 2019,第2期

机译：数据有效加固学习的主动探索方法
2. An Active Exploration Method for Data Efficient Reinforcement Learning [J] . Dongfang Zhao, Jiafeng Liu, Rui Wu, International journal of applied mathematics and computer science . 2019,第2期

机译：数据有效增强学习的积极探索方法
3. Instructional support for learning with agent-based simulations: A tale of vicarious and guided exploration learning approaches [J] . Dubovi Ilana, Lee Victor R. Computers & education . 2019,第DECa期

机译：通过基于主体的模拟为学习提供教学支持：替代和指导性探索学习方法的故事
4. Speeding Up Incremental Learning Using Data Efficient Guided Exploration [C] . Murtaza Hazara, Ville Kyrki IEEE International Conference on Robotics and Automation . 2018

机译：使用数据高效导向探索加快增量学习
5. Efficient Incremental Model Learning on Data Streams [D] . Chen, Xilun. 2019

机译：高效增量模型在数据流上学习
6. Incremental Learning With Selective Memory (ILSM): Towards Fast Prostate Localization for Image Guided Radiotherapy [O] . Yaozong Gao, Yiqiang Zhan, Dinggang Shen -1

机译：选择性记忆增量学习（ILSM）：影像引导放射治疗的快速前列腺定位
7. Speeding Up Incremental Learning Using Data Efficient Guided Exploration [O] . Murtaza Hazara, Ville Kyrki 2018

机译：使用数据高效导向探索加快增量学习
8. Efficient Incremental Induction of Decision Lists. Can Incremental LearningOutperform Non-Incremental Learning [R] . Shen, W. M. 1996

机译：决策列表的有效增量归纳。增量学习可以改变非增量学习

Speeding Up Incremental Learning Using Data Efficient Guided Exploration

摘要

著录项

相似文献

相关主题

期刊订阅