Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

机译：简单语法的概率概括及其在加固学习中的应用

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently, some non-regular subclasses of context-free grammars have been found to be efficiently learnable from positive data. In order to use these efficient algorithms to infer probabilistic languages, one must take into account not only equivalences between languages but also probabilistic generalities of grammars. The probabilistic generality of a grammar G is the class of the probabilistic languages generated by probabilistic grammars constructed on G. We introduce a subclass of simple grammars (SGs), referred as to unifiable simple grammars (USGs), which is a superclass of an efficiently learnable class, right-unique simple grammars (RSGs). We show that the class of RSGs is unifiable within the class of USGs, whereas SGs and RSGs are not unifiable within the class of SGs and RSGs, respectively. We also introduce simple context-free decision processes, which are a natural extension of finite Markov decision processes and intuitively may be thought of a Markov decision process with stacks. We propose a reinforcement learning method on simple context-free decision processes, as an application of the learning and unification algorithm for RSGs from positive data.

机译：最近，已经发现从正数据有效地学习无背景语法的一些非规则的子类。为了使用这些高效的算法来推断概率语言，不仅必须考虑语言之间的等效性，而且必须考虑到语法的概率总体。语法G的概率普遍性是由G构建的概率语法产生的概率语言的类。我们介绍了简单语法（SGS）的子类，提到了统一的简单语法（USG），这是一个有效的超类学习课程，右独特的简单语法（RSG）。我们表明，在USG的课程中，RSG的类是统一的，而SGS和RSG分别在SGS和RSG的类别中也不是统一的。我们还介绍了简单的无背景决策过程，这些过程是有限马尔可夫决策过程的自然延伸，并直观地可能被认为是带有堆栈的马尔可夫决策过程。我们提出了一种关于简单的无背景决策过程的加强学习方法，作为从正数据的RSG的学习和统一算法的应用。

著录项

来源
《International Conference on Algorithmic Learning Theory》|2006年||共15页
会议地点
作者
Takeshi Shibata; Ryo Yoshinaka; Takashi Chikayama;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Learning Probabilistic Hierarchical Task Networks as Probabilistic Context-Free Grammars to Capture User Preferences [J] . NAN LI, WILLIAM CUSHING, SUBBARAO KAMBHAMPATI, ACM transactions on intelligent systems . 2014,第2期

机译：将概率分层任务网络学习为概率上下文无关文法，以捕获用户首选项
2. Architectural planning with shape grammars and reinforcement learning: Habitability and energy efficiency [J] . Lawrence Mandow, Jose-Luis Perez-de-la-Cruz, Ana Belen Rodriguez-Gavilan, Engineering Applications of Artificial Intelligence . 2020,第Nova期

机译：塑造语法和强化学习的建筑规划：适用性和能源效率
3. Alignment in vision-based syntactic language games for teams of robots using stochastic regular grammars and reinforcement learning: The fully autonomous case and the human supervised case [J] . Maravall Dario, Mario Mingo Jack, De Lope Javier Robotics and Autonomous Systems . 2015,第Pta2期

机译：使用随机规则语法和强化学习的机器人团队基于视觉的句法语言游戏中的对齐方式：完全自主的情况和人类监督的情况
4. Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning [C] . Takeshi Shibata, Ryo Yoshinaka, Takashi Chikayama Algorithmic Learning Theory; Lecture Notes in Artificial Intelligence; 4264 . 2006

机译：简单文法的概率泛化及其在强化学习中的应用
5. Classicism vs. eliminative connectionism: Learning simple artificial grammars with backpropagation networks [D] . Vilcu, Marius 2005

机译：古典主义与消除联系主义：通过反向传播网络学习简单的人工语法
6. Learning simple and complex artificial grammars in the presence of a semantic reference field: effects on performance and awareness [O] . Esther Van den Bos, Fenna H. Poletiek -1

机译：在存在语义参考域的情况下学习简单和复杂的人工语法：对性能和意识的影响
7. A Generalization of Linear Indexed Grammars Equivalent to Simple Context-Free Tree Grammars [O] . Makoto Kanazawa 2014

机译：线性索引语法的推广等价于简单无上下文树语法

Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅