首页> 外文会议>International Conference on Algorithmic Learning Theory >Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning
【24h】

Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

机译:简单语法的概率概括及其在加固学习中的应用

获取原文

摘要

Recently, some non-regular subclasses of context-free grammars have been found to be efficiently learnable from positive data. In order to use these efficient algorithms to infer probabilistic languages, one must take into account not only equivalences between languages but also probabilistic generalities of grammars. The probabilistic generality of a grammar G is the class of the probabilistic languages generated by probabilistic grammars constructed on G. We introduce a subclass of simple grammars (SGs), referred as to unifiable simple grammars (USGs), which is a superclass of an efficiently learnable class, right-unique simple grammars (RSGs). We show that the class of RSGs is unifiable within the class of USGs, whereas SGs and RSGs are not unifiable within the class of SGs and RSGs, respectively. We also introduce simple context-free decision processes, which are a natural extension of finite Markov decision processes and intuitively may be thought of a Markov decision process with stacks. We propose a reinforcement learning method on simple context-free decision processes, as an application of the learning and unification algorithm for RSGs from positive data.
机译:最近,已经发现从正数据有效地学习无背景语法的一些非规则的子类。为了使用这些高效的算法来推断概率语言,不仅必须考虑语言之间的等效性,而且必须考虑到语法的概率总体。语法G的概率普遍性是由G构建的概率语法产生的概率语言的类。我们介绍了简单语法(SGS)的子类,提到了统一的简单语法(USG),这是一个有效的超类学习课程,右独特的简单语法(RSG)。我们表明,在USG的课程中,RSG的类是统一的,而SGS和RSG分别在SGS和RSG的类别中也不是统一的。我们还介绍了简单的无背景决策过程,这些过程是有限马尔可夫决策过程的自然延伸,并直观地可能被认为是带有堆栈的马尔可夫决策过程。我们提出了一种关于简单的无背景决策过程的加强学习方法,作为从正数据的RSG的学习和统一算法的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号