首页> 美国政府科技报告 >Towards Feature Selection in Actor-Critic Algorithms
【24h】

Towards Feature Selection in Actor-Critic Algorithms

机译:关于actor-Critic算法中的特征选择

获取原文

摘要

Choosing features for the critic in actor-critic algorithms with function approximation is known to be a challenge. Too few critic features can lead to degeneracy of the actor gradient, and too many features may lead to slower convergence of the learner. In this paper, the authors show that a well- studied class of actor policies satisfy the known requirements for convergence when the actor features are selected carefully. They demonstrate that two popular representations for value methods -- the barycentric interpolators and the graph Laplacian proto-value functions -- can be used to represent the actor so as to satisfy these conditions. A consequence of this work is a generalization of the proto-value function methods to the continuous action actor-critic domain. Finally, they analyze the performance of this approach using a simulation of a torque-limited inverted pendulum.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号