首页> 外文会议>International Conference on Natural Language Processing and Chinese Computing >Large-Scale Simple Question Generation by Template-Based Seq2seq Learning
【24h】

Large-Scale Simple Question Generation by Template-Based Seq2seq Learning

机译:由基于模板的SEQ2SEQ学习的大规模简单问题

获取原文

摘要

Numerous machine learning tasks achieved substantial advances with the help of large-scale supervised learning corpora over past decade. However, there's no large-scale question-answer corpora available for Chinese question answering over knowledge bases. In this paper, we present a 28M Chinese Q&A corpora based on the Chinese knowledge base provided by NLPCC2017 KBQA challenge. We propose a novel neural network architecture which combines template-based method and seq2seq learning to generate highly fluent and diverse questions. Both automatic and human evaluation results show that our model achieves outstanding performance (76.8 BLEU and 43.1 ROUGE). We also propose a new statistical metric called DIVERSE to measure the linguistic diversity of generated questions and prove that our model can generate much more diverse questions compared with other baselines.
机译:众多机器学习任务在过去十年中的大规模监督学习集团的帮助下取得了重大进展。但是,没有大规模的质疑答案对中国问题提供的关于知识库的问题。在本文中,我们在NLPCC2017 KBQA挑战提供的中国知识库,展示了一家28米的中国Q&A Cotora。我们提出了一种新的神经网络架构,将基于模板的方法和SEQ2Seq学习结合起来,从而产生高流利和多样化的问题。自动和人类评估结果都表明,我们的模型实现了出色的性能(76.8 BLEU和43.1胭脂)。我们还提出了一种称为多样化的新统计指标,以衡量所产生的问题的语言多样性,并证明我们的模型可以与其他基线相比产生更多样化的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号