首页> 外文会议>11th European Symposium on Artificial Neural Networks (ESANN '2003); Apr 23-25, 2003; Bruges, Belgium >Characterization of the absolutely expedient learning algorithms for stochastic automata in a non-discrete space of actions
【24h】

Characterization of the absolutely expedient learning algorithms for stochastic automata in a non-discrete space of actions

机译:非离散动作空间中随机自动机绝对权宜的学习算法的特征

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This work presents a learning algorithm to reach the optimum action of an arbitrary set of actions contained in R~m. An initial and arbitrary probability' measure on R~m allow us to select an action and the probability is sequentially updated by a stochastic automaton using the response of the environment to the selected action. We prove that the corresponding random sequence of probability measures converges in law to a probability measure degenerate on the optimum action, with probability as close to one as we desire.
机译:这项工作提出了一种学习算法,以达到R_m中包含的任意一组动作的最佳动作。在Rm上的初始概率和任意概率测量使我们能够选择一个动作,并且该概率由随机自动机使用环境对所选动作的响应进行顺序更新。我们证明了概率测度的相应随机序列在法律上收敛于在最佳动作上退化的概率测度,概率接近我们期望的概率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号