首页> 外文会议>International Conference on Modern Engineering Solutions for the Industry >A novel framework of representation policy iteration based on Fuzzy C-means clustering method
【24h】

A novel framework of representation policy iteration based on Fuzzy C-means clustering method

机译:基于模糊C均值聚类方法的一种新颖的表示策略迭代框架

获取原文

摘要

To resolve the subset sampling problem in the representation policy iteration (RPI) method in Reinforcement Learning, a sub-sampling method - Fuzzy C-means clustering method is proposed in this paper. Fuzzy C-means method is used to select the representative points from the collected samples in the proposed method. Based on the proposed method, an integrated RPI algorithm is introduced. Illustrative experiments on Mountain Car problem were accomplished, to show the excellent performance of the introduced RPI algorithm. In addition, the method using the previous trajectory-based sub-sampling method is also presented, to demonstrate the advantages of the proposed method.
机译:为了解决强化学习中的表示策略迭代(RPI)方法中的子集采样问题,本文提出了一种子采样方法 - 模糊C-MEARY聚类方法。 模糊C-均值方法用于以所提出的方法从收集的样本中选择代表点。 基于所提出的方法,介绍了一种集成的RPI算法。 完成了山地车问题的说明性实验,表明了引入的RPI算法的优异性能。 另外,还提出了使用先前轨迹的子采样方法的方法,以证明所提出的方法的优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号