【24h】

Predictive Projections

机译:预测预测

获取原文

摘要

This paper addresses the problem of learning control policies in very high dimensional state spaces. We propose a linear dimensionality reduction algorithm that discovers predictive projections: projections in which accurate predictions of future states can be made using simple nearest neighbor style learning. The goal of this work is to extend the reach of existing reinforcement learning algorithms to domains where they would otherwise be inapplicable without extensive engineering of features. The approach is demonstrated on a synthetic pendulum balancing domain, as well as on a robot domain requiring visually guided control.
机译:本文解决了非常高维状态空间中学习控制策略的问题。我们提出了一种发现预测投影的线性维度减少算法:可以使用简单的最近邻风格学习来制作未来状态的准确预测的投影。这项工作的目标是将现有的加强学习算法的范围扩展到域名,否则在没有广泛的特征工程的情况下可以不适用。该方法在合成摆平衡域上证明,以及需要视觉引导控制的机器人域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号