首页> 外文会议>1st EMNLP workshop blackboxNLP: analyzing and interpreting neural networks for NLP 2018 >Analyzing Learned Representations of a Deep ASR Performance Prediction Model
【24h】

Analyzing Learned Representations of a Deep ASR Performance Prediction Model

机译:分析深度ASR性能预测模型的学习表示

获取原文
获取原文并翻译 | 示例

摘要

This paper addresses a relatively new task: prediction of ASR performance on unseen broadcast programs. In a previous paper, we presented an ASR performance prediction system using CNNs that encode both text (ASR transcript) and speech, in order to predict word error rate. This work is dedicated to the analysis of speech signal embeddings and text em-beddings learnt by the CNN while training our prediction model. We try to better understand which information is captured by the deep model and its relation with different conditioning factors. It is shown that hidden layers convey a clear signal about speech style, accent and broadcast type. We then try to leverage these 3 types of information at training time through multi-task learning. Our experiments show that this allows to train slightly more efficient ASR performance prediction systems that - in addition - simultaneously tag the analyzed utterances according to their speech style, accent and broadcast program origin.
机译:本文解决了一个相对较新的任务:在看不见的广播节目上预测ASR性能。在先前的论文中,我们提出了一种使用CNN的ASR性能预测系统,该系统同时对文本(ASR笔录)和语音进行编码,以预测单词错误率。这项工作致力于分析CNN在训练我们的预测模型时学习到的语音信号嵌入和文本嵌入。我们试图更好地了解深度模型捕获的信息及其与不同条件因素的关系。结果表明,隐藏层传达了有关语音样式,口音和广播类型的清晰信号。然后,我们尝试通过多任务学习在训练时利用这三种类型的信息。我们的实验表明,这允许训练稍微更有效的ASR性能预测系统,此外,该系统还可以根据语音的语音风格,口音和广播节目的来源同时标记分析的语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号