首页> 外文会议>Annual meeting of the Association for Computational Linguistics;Meeting of the Association for Computational Linguistics >Phrase-based Statistical Language Generation using Graphical Models and Active Learning
【24h】

Phrase-based Statistical Language Generation using Graphical Models and Active Learning

机译:使用图形模型和主动学习的基于短语的统计语言生成

获取原文

摘要

Most previous work on trainable language generation has focused on two paradigms: (a) using a statistical model to rank a set of generated utterances, or (b) using statistics to inform the generation decision process. Both approaches rely on the existence of a handcrafted generator, which limits their scalability to new domains. This paper presents Bagel, a statistical language generator which uses dynamic Bayesian networks to learn from semantically-aligned data produced by 42 untrained annotators. A human evaluation shows that Bagel can generate natural and informative utterances from unseen inputs in the information presentation domain. Additionally, generation performance on sparse datasets is improved significantly by using certainty-based active learning, yielding ratings close to the human gold standard with a fraction of the data.
机译:以前有关可训练语言生成的大多数工作都集中在两个范式上:(a)使用统计模型对一组生成的话语进行排名,或(b)使用统计信息为生成决策过程提供信息。两种方法都依赖于手工生成器的存在,这限制了它们在新领域的可扩展性。本文介绍了Bagel,这是一种统计语言生成器,它使用动态贝叶斯网络从42个未经训练的注释者产生的语义对齐数据中学习。人工评估表明,百吉饼可以从信息呈现领域中看不见的输入中产生自然而有益的话语。此外,通过使用基于确定性的主动学习,可显着提高稀疏数据集的生成性能,并以一部分数据获得接近人类金本位的评级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号