Merging of Native and Non-native Speech for Low-resource Accented ASR

机译：融合低资源突出的ASR的本机和非原生语音

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents our recent study on low-resource automatic speech recognition (ASR) system with accented speech. We propose multi-accent Subspace Gaussian Mixture Models (SGMM) and accent-specific Deep Neural Networks (DNN) for improving non-native ASR performance. In the SGMM framework, we present an original language weighting strategy to merge the globally shared parameters of two models based on native and non-native speech respectively. In the DNN framework, a native deep neural net is fine-tuned to non-native speech. Over the non-native baseline, we achieved relative improvement of 15% for multi-accent SGMM and 34% for accent-specific DNN with speaker adaptation.

机译：本文介绍了我们最近关于具有重音语音的低资源自动语音识别（ASR）系统的研究。我们提出了多口语子空间高斯混合模型（SGMM）和强调特定的深神经网络（DNN），用于改善非本机ASR性能。在SGMM框架中，我们提出了一种原始语言加权策略，以分别基于本机和非原生言论合并两个模型的全局共享参数。在DNN框架中，原生深神经网络被微调到非原生语音。在非本地基线上，我们实现了多口径SGMM的相对提高15％，对于具有扬声器适应的口音DNN，34％。

著录项

来源
《International Conference on Statistical Language and Speech Processing》|2015年||共12页
会议地点
作者
Sarah Samson Juan; Laurent Besacier; Benjamin Lecouteux; Tien-Ping Tan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Automatic speech recognition; Cross-lingual acoustic modelling; Non-native speech; Low-resource system; Multi-accent SGMM; Accent-specific DNN;

机译：自动语音识别;交叉语言声学建模;非原生语音;低资源系统;多口径SGMM;特定于口音的DNN;
入库时间 2022-08-20 22:46:18

相似文献

外文文献
中文文献
专利

1. Application of the pairwise variability index of speech rhythm with particle swarm optimization to the classification of native and non-native accents [J] . Soumaya Gharsellaoui, Sid Ahmed Selouani, Wladyslaw Cichocki, Computer speech and language . 2018,第MARa期

机译：基于粒子群算法的语音节奏成对变异性指数在本地和非本地口音分类中的应用
2. Effects of noise, reverberation and foreign accent on native and non-native listeners' performance of English speech comprehension [J] . Peng Z. Ellen, Wang Lily M. The Journal of the Acoustical Society of America . 2016,第5期

机译：噪声，混响和外来口音对母语和非母语听众英语语音理解能力的影响
3. Accent neutralization for non-native speech using neural style transfer [J] . Kacper Radzikowski Journal of Telecommunications System & Management . 2020,第3期

机译：使用神经风格转移的非原生言论的强调中和
4. Merging of Native and Non-native Speech for Low-resource Accented ASR [C] . Sarah Samson Juan, Laurent Besacier, Benjamin Lecouteux, International conference on statistical language and speech processing . 2015

机译：低资源重音ASR的本地和非本地语音合并
5. Attitudes towards accented speech: A comparative study of native and non-native speakers of American English. [D] . Ben Said, Selim. 2006

机译：对口音的态度：对美国英语为母语和非母语的人的比较研究。
6. Speech-on-speech masking with variable access to the linguistic content of the masker speech for native and non-native speakers of English [O] . Lauren Calandruccio, Ann R. Bradlow, Sumitrajit Dhar -1

机译：语音对语音的掩盖功能可为母语为英语和非母语的英语使用者提供对掩蔽者语音的语言内容的可变访问权限
7. Merging of Native and Non-native Speech for Low-resource Accented ASR [O] . Samson Juan, Sarah, Besacier, Laurent, Lecouteux, Benjamin, 2015

机译：低资源重音ASR的本地和非本地语音合并

Merging of Native and Non-native Speech for Low-resource Accented ASR

摘要

著录项

相似文献

相关主题

期刊订阅