首页> 外文会议>SICE-ICASE International Joint Conference >A Policy-Improving System with a Mixture of Bayesian Networks Adapting Agents to Continuously Changing Environments
【24h】

A Policy-Improving System with a Mixture of Bayesian Networks Adapting Agents to Continuously Changing Environments

机译:一种政策改进系统,具有贝叶斯网络混合适应代理以不断变化的环境

获取原文

摘要

A variety of adaptive learning systems which adapt themselves to complicated environments has been studied and developed in the broad field of AI researches. For example, many reinforcement learning (RL) methods have been proposed to adapt agents to the environments. At the same time, Bayesian network (BN), one of the stochastic models, has attracted increasing attention due to its noise robustness, reasoning power, etc. We have proposed a system improving RL agents' policies with a mixture model of BNs, and have evaluated the adapting performance of our system. Each structure of BN can be regarded as a stochastic knowledge representation in the policy acquired through RL. It has been confirmed that the agent with our system could improve their policies by the information derived from the mixture, and then could adequately adapt to dynamically-switched environments. In this research, we propose a method to appropriately normalize mixing parameters of the mixture for the use in common adaptive learning systems, and evaluate the fundamental performance of our system in continuously-changing environment.
机译:研究了各种适应性学习系统,该系统适应了复杂的环境,并在广泛的AI研究领域中进行了研究。例如,已经提出了许多增强学习(RL)方法以使代理适应环境。与此同时,贝叶斯网络(BN)是一种随机模型之一,由于其噪音鲁棒性,推理权力等引起了越来越多的关注。我们提出了一种用BNS的混合模型改善RL代理商的政策,以及已经评估了我们系统的适应性能。 BN的每个结构可以被视为通过RL获取的政策中的随机知识表示。已经证实,具有我们系统的代理可以通过来自混合的信息来改善其策略,然后可以充分适应动态交换环境。在这项研究中,我们提出了一种方法来适当地将混合物的混合参数正常化用于共同的自适应学习系统的用途,并评估我们系统在不断变化的环境中的基本性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号