首页> 外文会议>International natural language generation conference >Automatic Quality Estimation for Natural Language Generation: Ranting (Jointly Rating and Ranking)
【24h】

Automatic Quality Estimation for Natural Language Generation: Ranting (Jointly Rating and Ranking)

机译:自然语言生成的自动质量估计:随机(共同评分和排名)

获取原文

摘要

We present a recurrent neural network based system for automatic quality estimation of natural language generation (NLG) outputs, which jointly learns to assign numerical ratings to individual outputs and to provide pair-wise rankings of two different outputs. The latter is trained using pairwise hinge loss over scores from two copies of the rating network. We use learning to rank and synthetic data to improve the quality of ratings assigned by our system: we synthesise training pairs of distorted system outputs and train the system to rank the less distorted one higher. This leads to a 12% increase in correlation with human ratings over the previous benchmark. We also establish the state of the art on the dataset of relative rankings from the E2E NLG Challenge (Dusek et al., 2019), where synthetic data lead to a 4% accuracy increase over the base model.
机译:我们提出了一种基于递归神经网络的系统,用于自然语言生成(NLG)输出的自动质量评估,该系统共同学习为各个输出分配数值等级并提供两个不同输出的成对排名。后者是使用成对铰链损失对两个评级网络副本的分数进行训练的。我们使用学习对数据进行排名和综合数据来提高系统分配的评级质量:我们综合训练对失真的系统输出的训练对,并训练系统将失真程度较小的系统排序为更高的等级。这导致与人类评级的相关性比以前的基准提高了12%。我们还在E2E NLG挑战赛(Dusek等人,2019)的相对排名数据集上建立了最先进的技术,其中合成数据导致基础模型的准确性提高了4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号