首页> 外文会议>International natural language generation conference >Modeling Confidence in Sequence-to-Sequence Models
【24h】

Modeling Confidence in Sequence-to-Sequence Models

机译:序列到序列模型中的置信度建模

获取原文

摘要

Recently, significant improvements have been achieved in various natural language processing tasks using neural sequence-to-sequence models. While aiming for the best generation quality is important, ultimately it is also necessary to develop models that can assess the quality of their output. In this work, we propose to use the similarity between training and test conditions as a measure for models' confidence. We investigate methods solely using the similarity as well as methods combining it with the posterior probability. While traditionally only target tokens are annotated with confidence measures, we also investigate methods to annotate source tokens with confidence. By learning an internal alignment model, we can significantly improve confidence projection over using state-of-the-art external alignment tools. We evaluate the proposed methods on downstream confidence estimation for machine translation (MT). We show improvements on segment-level confidence estimation as well as on confidence estimation for source tokens. In addition, we show that the same methods can also be applied to other tasks using sequence-to-sequence models. On the automatic speech recognition (ASR) task, we are able to find 60% of the errors by looking at 20% of the data.
机译:最近,使用神经序列到序列模型在各种自然语言处理任务中已经取得了显着的进步。尽管追求最佳发电质量很重要,但最终也有必要开发可评估其输出质量的模型。在这项工作中,我们建议使用训练条件和测试条件之间的相似性来衡量模型的置信度。我们研究仅使用相似性的方法以及将其与后验概率结合的方法。传统上,只有目标令牌使用置信度量度,但我们也研究了置信度注释源令牌的方法。通过学习内部对准模型,与使用最新的外部对准工具相比,我们可以显着改善置信度预测。我们评估针对机器翻译(MT)的下游置信度估计的拟议方法。我们展示了对段级别的置信度估计以及对源令牌的置信度估计的改进。此外,我们证明了相同的方法也可以使用序列到序列模型应用于其他任务。在自动语音识别(ASR)任务上,通过查看20%的数据,我们能够找到60%的错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号