首页> 外文会议>Neural nets Wirn Vietri-98 >Learning to balance a pole on a movable cart through RL: what can be gained using Adaptive NN?
【24h】

Learning to balance a pole on a movable cart through RL: what can be gained using Adaptive NN?

机译:通过RL学习平衡可移动推车上的杆:使用自适应NN可以得到什么?

获取原文
获取原文并翻译 | 示例

摘要

The work of Barto, Sutton and Williams on the ACE/ASE model for Reinforcement Learning is here put in perspective. In their work a state-control (input-output) map, which allows to balance a pole hinged on a moving cart as long as possible, is learned when the only information provided by the environment is the system failure. This work has given rise to a large body of research in the fields of machine learning and artificial intelligence. Its relevance lies in the fact that it can be applied to control all those systems which are only partially known. A critical issue is the exploration of the state space which may require impractical amount of memory and learning time. Adaptive networks, which have been studied in the most recent years, offer a natural solution in the implementation of the learning system allowing an adaptive partitioning of the state space according to the task difficulty experienced in the different regions.
机译:本文着眼于Barto,Sutton和Williams在ACE / ASE强化学习模型上的工作。在他们的工作中,当环境提供的唯一信息是系统故障时,将了解状态控制(输入-输出)图,该图允许尽可能长地平衡铰接在移动小车上的杆。这项工作引起了机器学习和人工智能领域的大量研究。它的相关性在于可以将其应用于控制仅部分已知的所有那些系统。一个关键问题是对状态空间的探索,这可能需要不切实际的内存和学习时间。近年来已经研究的自适应网络为实现学习系统提供了自然的解决方案,该学习系统允许根据不同区域中遇到的任务难度对状态空间进行自适应划分。

著录项

  • 来源
    《Neural nets Wirn Vietri-98 》|1998年|179-184|共6页
  • 会议地点 Vietri sul Mare(IT)
  • 作者单位

    Laboratory of Human Motion Study and Virtual Reality Istituto Neuroscienze e Bioimmagini, CNR, Via f.lli Cervi 83, 20090 Segrate (Milano), Italy;

    Laboratory of Human Motion Study and Virtual Reality Istituto Neuroscienze e Bioimmagini, CNR, Via f.lli Cervi 83, 20090 Segrate (Milano), Italy;

    Laboratory of Human Motion Study and Virtual Reality Istituto Neuroscienze e Bioimmagini, CNR, Via f.lli Cervi 83, 20090 Segrate (Milano), Italy;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化系统理论 ;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号