首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Coupled Training of Sequence-to-Sequence Models for Accented Speech Recognition
【24h】

Coupled Training of Sequence-to-Sequence Models for Accented Speech Recognition

机译:耦合训练重音语音识别的序列序列模型

获取原文

摘要

Accented speech poses significant challenges for state-of-the-art automatic speech recognition (ASR) systems. Accent is a property of speech that lasts throughout an utterance in varying degrees of strength. This makes it hard to isolate the influence of accent on individual speech sounds. We propose coupled training for encoder-decoder ASR models that acts on pairs of utterances corresponding to the same text spoken by speakers with different accents. This training regime introduces an L2 loss between the attention-weighted representations corresponding to pairs of utterances with the same text, thus acting as a regularizer and encouraging representations from the encoder to be more accent-invariant. We focus on recognizing accented English samples from the Mozilla Common Voice corpus. We obtain significant error rate reductions on accented samples from a large set of diverse accents using coupled training. We also show consistent improvements in performance on heavily accented samples (as determined by a standalone accent classifier).
机译:强调的语言对最先进的自动语音识别(ASR)系统带来了重大挑战。口音是讲话的财产,持续在整个不同程度的力量中。这使得难以隔离口音对个体语音的影响。我们为编码器解码器ASR模型提出了耦合训练,其与对应于具有不同口音的扬声器所说的相同文本的对应的话语对。该培训制度在对应于具有相同文本的对应的注意力表示之间引入L2损耗,从而充当常规器并鼓励来自编码器的表示更加强调不变。我们专注于识别来自Mozilla常见语音语料库的重音英语样本。我们通过耦合训练从大量不同的重点获得重音样本的重大错误率缩短。我们还显示了在重音样本上的性能方面的一致性改进(由独立的口音分类器确定)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号