首页> 外文会议>Spoken Language Technology Workshop >IEEE SLT 2021 Alpha-Mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines
【24h】

IEEE SLT 2021 Alpha-Mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines

机译:IEEE SLT 2021 Alpha-Mini语音挑战:打开数据集,曲目,规则和基线

获取原文

摘要

The IEEE Spoken Language Technology Workshop (SLT) 2021 Alpha-mini Speech Challenge (ASC) is intended to improve research on keyword spotting (KWS) and sound source location (SSL) on humanoid robots. Many publications report significant improvements in deep learning based KWS and SSL on open source datasets in recent years. For deep learning model training, it is necessary to expand the data coverage to improve the model robustness. Thus, simulating multi-channel noisy and reverberant data from single-channel speech, noise, echo and room impulsive response (RIR) is widely adopted. However, this approach may generate mismatch between simulated data and recorded data in real application scenarios, especially echo data. In this challenge, we open source a sizable speech, keyword, echo and noise corpus for promoting data-driven methods, particularly deep-learning approaches on KWS and SSL. We also choose Alpha-mini, a humanoid robot produced by UBTECH equipped with a built-in four-microphone array on its head, to record development and evaluation sets under the actual Alpha-mini robot application scenario, including environ-mental noise as well as echo and mechanical noise generated by the robot itself for model evaluation. Furthermore, we illustrate the rules, evaluation methods and baselines for re-searchers to quickly assess their achievements and optimize their models.
机译:IEEE口语技术研讨会(SLT)2021 alpha-mini语音挑战(ASC)旨在改进人类机器人的关键字点击(KWS)和声源位置(SSL)的研究。许多出版物近年来在开源数据集中报告基于深度学习的KWS和SSL的显着改进。对于深度学习模型培训,有必要扩展数据覆盖范围以提高模型鲁棒性。因此,广泛采用了从单通道语音,噪声,回波和室脉冲响应(RIR)的多通道噪声和混响数据。然而,这种方法可以在真实应用场景中的模拟数据和记录数据之间产生不匹配,特别是回声数据。在这一挑战中,我们开源了一个可信语音,关键字,回声和噪声语料库,用于推广数据驱动方法,特别是KWS和SSL上的深度学习方法。我们还选择Alpha-Mini,由Ubtech生产的人形机器人,其头部配备内置的四麦克风阵列,在实际的Alpha-Mini机器人应用场景下记录开发和评估集,包括环境噪音作为机器人本身产生的回声和机械噪声,用于模型评估。此外,我们说明了重新搜索者的规则,评估方法和基线,以便快速评估他们的成就并优化其模型。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号