首页> 外文会议>International Conference on Advanced Cloud and Big Data >Noise processing and multi-task learning for far-field dialect classification
【24h】

Noise processing and multi-task learning for far-field dialect classification

机译:远场方言分类的噪声处理和多任务学习

获取原文

摘要

Deep learning has made great achievements in the field of speech recognition. With the popularization of embedded devices such as Intelligent speaker and the demand for dialect interaction scenes, it poses great challenges to far-field speech recognition and dialect language recognition. In order to solve the dialect language recognition of embedded devices in far-field speech recognition, we propose a deep learning neural network model with multi-task learning. First, we apply the AQPA(audio qualitative pre-analysis) method on the raw data of ten local Chinese dialects to reduce the influencing factors of steady-state and non-steady-state signals. Then we define dialect recognition as the main task and dialect area as the auxiliary task, using the multi-task learning method to improve the accuracy of dialect classification. The experimental results show that our approach improves accuracy with an average of 20% when compared with the single-task model without noise reduction.
机译:深入学习在语音识别领域取得了巨大成就。 随着智能扬声器等嵌入式设备的推广以及对方言交互场景的需求,它对远场语音识别和方言语言识别产生了极大的挑战。 为了解决广播语音识别中嵌入式设备的方言语言识别,我们提出了一种具有多任务学习的深度学习神经网络模型。 首先,我们在10个本地中文方针的原始数据上应用AQPA(音频定性预分析)方法,以减少稳态和非稳态信号的影响因素。 然后我们将方言识别定义为主要任务和方言区域作为辅助任务,使用多任务学习方法来提高方言分类的准确性。 实验结果表明,与没有降噪的单任务模型相比,我们的方法平均提高了20%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号