Analyzing Learned Representations of a Deep ASR Performance Prediction Model

机译：分析深度ASR性能预测模型的学习表示

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper addresses a relatively new task: prediction of ASR performance on unseen broadcast programs. In a previous paper, we presented an ASR performance prediction system using CNNs that encode both text (ASR transcript) and speech, in order to predict word error rate. This work is dedicated to the analysis of speech signal embeddings and text em-beddings learnt by the CNN while training our prediction model. We try to better understand which information is captured by the deep model and its relation with different conditioning factors. It is shown that hidden layers convey a clear signal about speech style, accent and broadcast type. We then try to leverage these 3 types of information at training time through multi-task learning. Our experiments show that this allows to train slightly more efficient ASR performance prediction systems that - in addition - simultaneously tag the analyzed utterances according to their speech style, accent and broadcast program origin.

机译：本文解决了一个相对较新的任务：在看不见的广播节目上预测ASR性能。在先前的论文中，我们提出了一种使用CNN的ASR性能预测系统，该系统同时对文本（ASR笔录）和语音进行编码，以预测单词错误率。这项工作致力于分析CNN在训练我们的预测模型时学习到的语音信号嵌入和文本嵌入。我们试图更好地了解深度模型捕获的信息及其与不同条件因素的关系。结果表明，隐藏层传达了有关语音样式，口音和广播类型的清晰信号。然后，我们尝试通过多任务学习在训练时利用这三种类型的信息。我们的实验表明，这允许训练稍微更有效的ASR性能预测系统，此外，该系统还可以根据语音的语音风格，口音和广播节目的来源同时标记分析的语音。

著录项

来源
《1st EMNLP workshop blackboxNLP: analyzing and interpreting neural networks for NLP 2018》|2018年|9-15|共7页
会议地点 Brussels(BE)
作者
Zied Elloumi; Laurent Besacier; Olivier Galibert; Benjamin Lecouteux;
展开▼
作者单位

Laboratoire national de metrologie et d'essais (LNE) , France,Univ. Grenoble Alpes, CNRS, Grenoble INP, LIG, F-38000 Grenoble, France;

Univ. Grenoble Alpes, CNRS, Grenoble INP, LIG, F-38000 Grenoble, France;

Laboratoire national de metrologie et d'essais (LNE) , France;

Univ. Grenoble Alpes, CNRS, Grenoble INP, LIG, F-38000 Grenoble, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Analyzing Learned Molecular Representations for Property Prediction [J] . Yang Kevin, Swanson Kyle, Jin Wengong, Journal of chemical information and modeling . 2019,第8期

机译：分析性能预测的学习分子表征
2. Understanding Deep Representations Learned in Modeling Users Likes [J] . Sharath Chandra Guntuku, Joey Tianyi Zhou, Sujoy Roy, IEEE Transactions on Image Processing . 2016,第8期

机译：了解在建模用户喜欢过程中学习到的深层表示形式
3. Model approach to grammatical evolution: deep-structured analyzing of model and representation [J] . He Pei, Deng Zelin, Gao Chongzhi, Soft computing: A fusion of foundations, methodologies and applications . 2017,第18期

机译：语法演化模型方法：模型与代表深构造分析
4. Analyzing Learned Representations of a Deep ASR Performance Prediction Model [C] . Zied Elloumi, Laurent Besacier, Olivier Galibert, Conference on empirical methods in natural language processing . 2018

机译：分析深度ASR性能预测模型的学习表征
5. Efficient Methods in Deep Learning Lifecycle: Representation, Prediction and Model Compression [D] . Sha, Long. 2021

机译：深度学习生命周期的有效方法：表示，预测和模型压缩
6. Analyzing Learned Molecular Representations for PropertyPrediction [O] . Kevin Yang, *, Kyle Swanson, -1

机译：分析学习到的分子表征的性质预测
7. Analyzing Learned Representations of a Deep ASR Performance Prediction Model [O] . Zied Elloumi, Laurent Besacier, Olivier Galibert, 2018

机译：分析深度ASR性能预测模型的学习表征
8. Modeling and Analyzing the Propagation from Environmental through Sonar Performance Prediction. [R] . Cox, H., Heaney, K. D. 2003

机译：通过声纳性能预测建立和分析环境传播。

Analyzing Learned Representations of a Deep ASR Performance Prediction Model

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅