首页> 外文期刊>Fundamenta Informaticae >Grammatical Inference of PCFGs Applied to Language Modelling and Unsupervised Parsing
【24h】

Grammatical Inference of PCFGs Applied to Language Modelling and Unsupervised Parsing

机译:PCFG的语法推断在语言建模和无监督分析中的应用

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Recently, different theoretical learning results have been found for a variety of context-free grammar subclasses through the use of distributional learning [1]. However, these results are still not extended to probabilistic grammars. In this work, we give a practical algorithm, with some proven properties, that learns a subclass of probabilistic grammars from positive data. A minimum satisfiability solver is used to direct the search towards small grammars. Experiments on well-known context-free languages and artificial natural language grammars give positive results. Moreover, our analysis shows that the type of grammars induced by our algorithm are, in theory, capable of modelling context-free features of natural language syntax. One of our experiments shows that our algorithm can potentially outperform the state-of-the-art in unsupervised parsing on the WSJ10 corpus.
机译:最近,通过使用分布学习[1],已经发现了针对各种无上下文语法子类的不同理论学习结果。但是,这些结果仍未扩展到概率语法。在这项工作中,我们给出了一种具有一些经过验证的特性的实用算法,该算法可以从正数数据中学习概率语法的子类。最小可满足性求解器用于将搜索定向到小语法。对著名的无上下文语言和人工自然语言语法的实验给出了积极的结果。此外,我们的分析表明,从理论上讲,我们的算法产生的语法类型能够对自然语言语法的无上下文特征进行建模。我们的一项实验表明,在WSJ10语料库上的无监督解析中,我们的算法有可能优于最新技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号