首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods
【24h】

The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods

机译:突出的英语语音识别挑战2020:打开数据集,曲目,基线,结果和方法

获取原文

摘要

The variety of accents has posed a big challenge to speech recognition. The Accented English Speech Recognition Challenge (AESRC2020) is designed for providing a common testbed and promoting accent-related research. Two tracks are set in the challenge – English accent recognition (track 1) and accented English speech recognition (track 2). A set of 160 hours of accented English speech collected from 8 countries is released with labels as the training set. Another 20 hours of speech without labels is later released as the test set, including two unseen accents from another two countries used to test the model generalization ability in track 2. We also provide baseline systems for the participants. This paper first reviews the released dataset, track setups, baselines and then summarizes the challenge results and major techniques used in the submissions.
机译:各种口音对语音识别构成了很大的挑战。 强调的英语语音识别挑战(AESRC2020)旨在提供共同的测试平台和促进相关的重点研究。 在挑战中设定了两条轨道 - 英语口音识别(轨道1)和重音的英语语音识别(Track 2)。 从8个国家/地区收集的一套160小时的令人叹为观的英语演讲,以标签作为培训集发布。 后来没有标签的另外20小时的言论作为测试集,包括来自另外两个国家的两个看不见的口音,用于测试轨道2中的模型泛化能力。我们还为参与者提供基线系统。 本文首先审查了发布的数据集,跟踪设置,基准,然后总结了提交中使用的挑战结果和主要技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号