首页> 外文会议>IEEE Congress on Evolutionary Computation >Revisiting the Acrobot #x2018;height#x2019; task: An example of efficient evolutionary policy search under an episodic goal seeking task
【24h】

Revisiting the Acrobot #x2018;height#x2019; task: An example of efficient evolutionary policy search under an episodic goal seeking task

机译:重新审视Acrobot‘身高’ 任务:在展示任务下的高效进化政策搜索示例

获取原文

摘要

Evolutionary methods for addressing the temporal sequence learning problem generally fall into policy search as opposed to value function optimization approaches. Various recent results have made the claim that the policy search approach is at best inefficient at solving episodic ‘goal seeking’ tasks i.e., tasks under which the reward is limited to describing properties associated with a successful outcome have no qualification for degrees of failure. This work demonstrates that such a conclusion is due to a lack of diversity in the training scenarios. We therefore return to the Acrobot ‘height’ task domain originally used to demonstrate complete failure in evolutionary policy search. This time a very simple stochastic sampling heuristic for defining a population of training configurations is introduced. Benchmarking two recent evolutionary policy search algorithms — Neural Evolution of Augmented Topologies (NEAT) and Symbiotic Bid-Based (SBB) Genetic Programming — under this condition demonstrates solutions as effective as those returned by advanced value function methods. Moreover this is achieved while remaining within the evaluation limit imposed by the original study.
机译:用于解决时间序列学习问题的进化方法通常属于策略搜索,而不是价值函数优化方法。各种最近的结果已经提出了策略搜索方法在求解情节&#x2018处于最佳低效;目标寻求’奖励的任务仅限于描述与成功结果相关的属性没有资格对失败程度。这项工作表明,这样的结论是由于培训方案缺乏多样性。因此,我们返回神奇‘高度’任务域最初用于在进化政策搜索中展示完全失败。这次引入了用于定义培训配置群体的一个非常简单的随机采样启发式。基准测试两个最近的进化政策搜索算法 - 增强拓扑(整洁)和基于共生投标(SBB)遗传编程的神经演进 - 根据这种情况,表明解决方案如先进的价值函数方法返回的解决方案。此外,这是在剩余的原始研究所施加的评估限制的同时实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号