BSE-MAML: Model Agnostic Meta-Reinforcement Learning via Bayesian Structured Exploration

机译：BSE-MAML：贝叶斯结构勘探模型无可止境的荟萃强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep reinforcement learning (RL) is playing an increasingly important role in web services such as news recommendation, vulnerability detection, and personalized services. Exploration is a key component of RL, which determines whether these RL-based applications could find effective solutions eventually. In this paper, we propose a novel gradient–based fast adaptation approach for model agnostic meta-reinforcement learning via Bayesian structure exploration (BSE-MAML). BSE-MAML could effectively learn exploration strategies from prior experience by updating policy with embedding latent space via a Bayesian mechanism. Coherent stochasticity injected by latent space are more efficient than random noise, and can produce exploration strategies to perform well in novel environment. We have conducted extensive experiments to evaluate BSE-MAML. Experimental results show that BSE-MAML achieves better performance in exploration in realistic environments with sparse rewards, compared to state-of-the-art meta-RL algorithms, RL methods without learning exploration strategies, and task-agnostic exploration approaches.

机译：深度加强学习（RL）在Web服务中发挥着越来越重要的作用，例如新闻推荐，漏洞检测和个性化服务。探索是RL的关键组成部分，它决定了这些基于RL的应用程序是否可以最终找到有效的解决方案。在本文中，我们提出了一种基于新的基于梯度的快速适应方法，可通过贝叶斯结构勘探（BSE-MAML）进行模型不可行的元增强学习。 BSE-MAML可以通过更新通过贝叶斯机制嵌入潜在空间的政策，有效地学习勘探策略。被潜在空间注入的相干性速度比随机噪声更有效，并且可以产生在新环境中表现良好的勘探策略。我们对评估BSE-MAML进行了广泛的实验。实验结果表明，与最先进的META-RL算法相比，BSE-MAML在具有稀疏奖励的现实环境中实现了更好的性能，而无需学习探索策略，以及任务 - 不可知的勘探方法。

著录项

来源
《IEEE International Conference on Services Computing》|2020年|60-67|共8页
会议地点
作者
Haonan Wang; Yiyun Zhang; Dawei Feng; Dongsheng Li; Feng Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep reinforcement learning; Exploration; Services Computing; Web services;

机译：深增强学习;探索;服务计算;Web服务;

相似文献

外文文献
中文文献
专利

1. Network structure exploration via Bayesian nonparametric models [J] . Chen Y., Wang X. L., Xiang X., Journal of statistical mechanics: Theory and Experiment . 2015,第1期

机译：网络结构探索通过贝叶斯非参数模型
2. Bayesian learning of structures of ordered block graphical models with an application on multistage manufacturing processes [J] . Chao Wang, Xiaojin Zhu, Shiyu Zhou, AIIE Transactions . 2021,第7期

机译：多级制造过程中有序块图形模型结构的贝叶斯学习
3. AutoPrognosis: Automated Clinical Prognostic Modeling via Bayesian Optimization with Structured Kernel Learning [J] . Ahmed Alaa, Mihaela Schaar JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：自动预后：通过贝叶斯优化和结构化内核学习进行自动临床预后建模
4. B-Small: A Bayesian Neural Network Approach to Sparse Model-Agnostic Meta-Learning [C] . Anish Madan, Ranjitha Prasad IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：B-small：贝叶斯神经网络方法稀疏模型 - 不可知的元学习
5. Learning by imitation and exploration: Bayesian models and applications in humanoid robotics. [D] . Grimes, David B. 2007

机译：通过模仿和探索学习：类人机器人中的贝叶斯模型及其应用。
6. Enhanced Bayesian modelling in BAPS software for learning genetic structures of populations [O] . Jukka Corander, Pekka Marttinen, Jukka Sirén, 2008

机译：BAPS软件中增强的贝叶斯建模用于学习群体的遗传结构
7. B-Small: A Bayesian Neural Network Approach to Sparse Model-Agnostic Meta-Learning [O] . Anish Madan, Ranjitha Prasad 2021

机译：b-small：贝叶斯神经网络方法稀疏模型 - 不可知的元学习

BSE-MAML: Model Agnostic Meta-Reinforcement Learning via Bayesian Structured Exploration

摘要

著录项

相似文献

相关主题

期刊订阅