首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Continuous Learning of Action and State Spaces (CLASS)
【24h】

Continuous Learning of Action and State Spaces (CLASS)

机译:持续学习行动和国家空间(类)

获取原文
       

摘要

We present a novel approach to state space discretization for constructivist and reinforcement learning. Constructivist learning and reinforcement learning often operate on a predefined set of states and transitions (state space). AI researchers design algorithms to reach particular goal states in this state space (for example, visualized in the form of goal cells that a robot should reach in a grid). When the size and the dimensionality of the state space increases, however, finding goal states becomes intractable. It is nonetheless assumed that these algorithms can have useful applications in the physical world provided that there is a way to construct a discrete state space of reasonable size and dimensionality. Yet, the manner in which the state space is discretized is the source of many problems for both constructivist and reinforcement learning approaches. The problems can roughly be divided into two categories: (1) wiring too much domain information into the solution, and (2) requiring massive storage to represent the state space (such as Q-tables. The problems relate to (1) the non generality arising from wiring domain information into the solution, and (2) non scalability of the approach to useful domains involving high dimensional state spaces. Another important limitation is that high dimensional state spaces require a massive number of learning trials. We present a new approach that builds upon ideas from place cells and cognitive maps.
机译:我们提出了一种新的建设主义和加强学习空间离散化方法。建构主义学习和加强学习经常在预定义的状态和转换(状态空间)上运行。 AI研究人员设计算法在该状态空间中达到特定目标状态(例如,以机器人应该达到网格的目标单元格形式可视化)。然而,当状态空间的尺寸和维度增加时,找到目标状态变得棘手。尽管如此,这些算法可以在物理世界中具有有用的应用,但是有一种方法可以构建合理尺寸和维度的离散状态空间。然而,州空间被离散化的方式是构造主义和增强学习方法的许多问题的来源。这些问题可以大致分为两类:(1)将太多的域信息接线到解决方案中,并且(2)需要大量存储来表示状态空间(例如Q表。问题与(1)有关从接线信息引起的一般性,进入解决方案,以及(2)对涉及高维状态空间的有用域的方法的不可扩大性。另一个重要限制是高维状态空间需要大量的学习试验。我们提出了一种新方法这在从地区细胞和认知地图的想法上建立。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号