首页> 外文会议>Natural language understanding and intelligent applications >Large-Scale Simple Question Generation by Template-Based Seq2seq Learning
【24h】

Large-Scale Simple Question Generation by Template-Based Seq2seq Learning

机译:通过基于模板的Seq2seq学习大规模生成简单问题

获取原文
获取原文并翻译 | 示例

摘要

Numerous machine learning tasks achieved substantial advances with the help of large-scale supervised learning corpora over past decade. However, there's no large-scale question-answer corpora available for Chinese question answering over knowledge bases. In this paper, we present a 28M Chinese Q&A corpora based on the Chinese knowledge base provided by NLPCC2017 KBQA challenge. We propose a novel neural network architecture which combines template-based method and seq2seq learning to generate highly fluent and diverse questions. Both automatic and human evaluation results show that our model achieves outstanding performance (76.8 BLEU and 43.1 ROUGE). We also propose a new statistical metric called DIVERSE to measure the linguistic diversity of generated questions and prove that our model can generate much more diverse questions compared with other baselines.
机译:在过去的十年中,借助大型监督学习语料库,许多机器学习任务取得了长足的进步。但是,没有大规模的问答库可用于基于知识库的中文问答。本文基于NLPCC2017 KBQA挑战赛提供的中文知识库,提出了一个2800万个中文问答集。我们提出了一种新颖的神经网络架构,该架构结合了基于模板的方法和seq2seq学习以生成高度流利且多样化的问题。自动和人工评估结果均表明,我们的模型具有出色的性能(76.8 BLEU和43.1 ROUGE)。我们还提出了一种称为DIVERSE的新统计量度,以测量所生成问题的语言多样性,并证明与其他基准相比,我们的模型可以生成更多样化的问题。

著录项

  • 来源
  • 会议地点 Dalian(CN)
  • 作者单位

    Key Laboratory of Computational Linguistics, Ministry of Education, School of Electronics Engineering and Computer Science, Peking University, No. 5 Yiheyuan Road, Haidian District, Beijing 100871, China;

    Key Laboratory of Computational Linguistics, Ministry of Education, School of Electronics Engineering and Computer Science, Peking University, No. 5 Yiheyuan Road, Haidian District, Beijing 100871, China;

    Key Laboratory of Computational Linguistics, Ministry of Education, School of Electronics Engineering and Computer Science, Peking University, No. 5 Yiheyuan Road, Haidian District, Beijing 100871, China;

    Key Laboratory of Computational Linguistics, Ministry of Education, School of Electronics Engineering and Computer Science, Peking University, No. 5 Yiheyuan Road, Haidian District, Beijing 100871, China;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Question generation; Template-based seq2seq Linguistic diversity;

    机译:问题产生;基于模板的seq2seq语言多样性;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号