首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >AISpeech-SJTU Accent Identification System for the Accented English Speech Recognition Challenge
【24h】

AISpeech-SJTU Accent Identification System for the Accented English Speech Recognition Challenge

机译:AISPeech-SJTU强调英语语音识别挑战的重点识别系统

获取原文

摘要

This paper describes the AISpeech-SJTU system for the accent identification track of the Interspeech-2020 Accented English Speech Recognition Challenge. In this challenge track, only 160-hour accented English data collected from 8 countries and the auxiliary Librispeech dataset are provided for training. To build an accurate and robust accent identification system, we explore the whole system pipeline in detail. First, we introduce the ASR based phone posteriorgram (PPG) feature to accent identification and verify its efficacy. Then, a novel TTS based approach is carefully designed to augment the very limited accent training data for the first time. Finally, we propose the test time augmentation and embedding fusion schemes to further improve the system performance. Our final system is ranked first in the challenge and outperforms all the other participants by a large margin. The submitted system achieves 83.63% average accuracy on the challenge evaluation data, ahead of the others by more than 10% in absolute terms.
机译:本文介绍了AISPeech-SJTU系统,用于INTERPeech-2020的口径识别轨迹突出了英语语音识别挑战。在这一挑战赛道中,仅提供了从8个国家和辅助LibrisPeech数据集收集的160小时重点的英语数据进行培训。为构建准确且坚固的口音识别系统,我们详细探索整个系统管道。首先,我们介绍基于ASR的电话后验仪(PPG)功能,以重点识别并验证其功效。然后,仔细设计了一种基于TTS的方法,可以首次增强非常有限的口音训练数据。最后,我们提出了测试时间增强和嵌入融合方案,以进一步提高系统性能。我们的最终系统在挑战中排名第一,并通过大幅度优越所有其他参与者。提交的系统在挑战评估数据上实现了83.63%的平均准确性,以至于绝对术语超过10%以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号