Modeling Confidence in Sequence-to-Sequence Models

机译：序列到序列模型中的置信度建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, significant improvements have been achieved in various natural language processing tasks using neural sequence-to-sequence models. While aiming for the best generation quality is important, ultimately it is also necessary to develop models that can assess the quality of their output. In this work, we propose to use the similarity between training and test conditions as a measure for models' confidence. We investigate methods solely using the similarity as well as methods combining it with the posterior probability. While traditionally only target tokens are annotated with confidence measures, we also investigate methods to annotate source tokens with confidence. By learning an internal alignment model, we can significantly improve confidence projection over using state-of-the-art external alignment tools. We evaluate the proposed methods on downstream confidence estimation for machine translation (MT). We show improvements on segment-level confidence estimation as well as on confidence estimation for source tokens. In addition, we show that the same methods can also be applied to other tasks using sequence-to-sequence models. On the automatic speech recognition (ASR) task, we are able to find 60% of the errors by looking at 20% of the data.

机译：最近，使用神经序列到序列模型在各种自然语言处理任务中已经取得了显着的进步。尽管追求最佳发电质量很重要，但最终也有必要开发可评估其输出质量的模型。在这项工作中，我们建议使用训练条件和测试条件之间的相似性来衡量模型的置信度。我们研究仅使用相似性的方法以及将其与后验概率结合的方法。传统上，只有目标令牌使用置信度量度，但我们也研究了置信度注释源令牌的方法。通过学习内部对准模型，与使用最新的外部对准工具相比，我们可以显着改善置信度预测。我们评估针对机器翻译（MT）的下游置信度估计的拟议方法。我们展示了对段级别的置信度估计以及对源令牌的置信度估计的改进。此外，我们证明了相同的方法也可以使用序列到序列模型应用于其他任务。在自动语音识别（ASR）任务上，通过查看20％的数据，我们能够找到60％的错误。

著录项

来源
《International natural language generation conference》|2019年|575-583|共9页
会议地点
作者
Jan Niehues; Ngoc-Quan Pham;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A systematic review on sequence-to-sequence learning with neural network and its models [J] . Hana Yousuf, Michael Lahzi, Said A. Salloum, International Journal of Electrical and Computer Engineering . 2021,第3期

机译：用神经网络及其模型进行序列与序列学习的系统综述
2. Improved Customer Lifetime Value Prediction With Sequence-To-Sequence Learning and Feature-Based Models [J] . Bauer Josef, Jannach Dietmar ACM transactions on knowledge discovery from data . 2021,第5期

机译：用序列到序列学习和基于功能的模型提高客户终身值预测
3. Prosodic Features Control by Symbols as Input of Sequence-to-Sequence Acoustic Modeling for Neural TTS [J] . Kiyoshi KURIHARA, Nobumasa SEIYAMA, Tadashi KUMANO IEICE transactions on information and systems . 2021,第2期

机译：韵律特征通过符号控制作为神经TTS的序列到序列声学建模的输入
4. Modeling Confidence in Sequence-to-Sequence Models [C] . Jan Niehues, Ngoc-Quan Pham International natural language generation conference . 2019

机译：建模序列到序列模型的信心
5. Using Water Quality Models in Management - A Multiple Model Assessment, Analysis of Confidence, and Evaluation of Climate Change Impacts. [D] . Irby, Isaac David. 2017

机译：在管理中使用水质模型-多种模型评估，信心分析和气候变化影响评估。
6. Detecting insertion substitution and deletion errors in radiology reports using neural sequence-to-sequence models [O] . John Zech, Jessica Forde, Joseph J. Titano, 2019

机译：使用神经序列到序列模型检测放射学报告中的插入替换和删除错误
7. Characterizing the Impact of Using Features Extracted from Pre-trained Models on the Quality of Video Captioning Sequence-to-Sequence Models [O] . Menatallh Hammad, May Hammad, Mohamed Elshenawy 2020

机译：使用从预先训练的模型提取的功能对视频标题序列到序列模型质量的影响

Modeling Confidence in Sequence-to-Sequence Models

摘要

著录项

相似文献

相关主题

期刊订阅