首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >An End-to-End Speech Accent Recognition Method Based on Hybrid CTC/Attention Transformer ASR
【24h】

An End-to-End Speech Accent Recognition Method Based on Hybrid CTC/Attention Transformer ASR

机译:基于混合CTC /关注变压器ASR的端到端语音口音识别方法

获取原文

摘要

This paper proposes a novel accent recognition system in the framework of a transformer-based end-to-end speech recognition system. To incorporate the pronunciation and linguistic knowledge into the network, we first pre-train an ASR model in a hybrid CTC/attention manner. Then, focusing on accent recognition, we extend the output token list by inserting accent labels to the transcripts and finetune the network parameters with an accented speech dataset. Our work is evaluated on the Interspeech 2020 Accented English Speech Recognition Challenge. Experiments show that our method achieves an accuracy of 72.39% on the test set and 80.98% on the development set, outperforming the baseline system by a very large margin. Our submitted system ranked second in the accent recognition task in the challenge.
机译:本文提出了一种基于变压器的端到端语音识别系统框架中的新型重音识别系统。 为了将发音和语言知识纳入网络中,我们首先以混合CTC /注意方式预先列车模型。 然后,专注于重点识别,我们通过将重音标签插入成绩单并使用重音语音数据集来扩展输出令牌列表。 我们的作品在Interspeech 2020令人叹为观止的英语语音识别挑战上进行评估。 实验表明,我们的方法在测试集中实现了72.39%的准确性,在开发集中的80.98%,优于基线系统,通过非常大的边距。 我们提交的系统在挑战中排名第二是重点识别任务的第二名。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号