Noise processing and multi-task learning for far-field dialect classification

机译：远场方言分类的噪声处理和多任务学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep learning has made great achievements in the field of speech recognition. With the popularization of embedded devices such as Intelligent speaker and the demand for dialect interaction scenes, it poses great challenges to far-field speech recognition and dialect language recognition. In order to solve the dialect language recognition of embedded devices in far-field speech recognition, we propose a deep learning neural network model with multi-task learning. First, we apply the AQPA(audio qualitative pre-analysis) method on the raw data of ten local Chinese dialects to reduce the influencing factors of steady-state and non-steady-state signals. Then we define dialect recognition as the main task and dialect area as the auxiliary task, using the multi-task learning method to improve the accuracy of dialect classification. The experimental results show that our approach improves accuracy with an average of 20% when compared with the single-task model without noise reduction.

机译：深入学习在语音识别领域取得了巨大成就。随着智能扬声器等嵌入式设备的推广以及对方言交互场景的需求，它对远场语音识别和方言语言识别产生了极大的挑战。为了解决广播语音识别中嵌入式设备的方言语言识别，我们提出了一种具有多任务学习的深度学习神经网络模型。首先，我们在10个本地中文方针的原始数据上应用AQPA（音频定性预分析）方法，以减少稳态和非稳态信号的影响因素。然后我们将方言识别定义为主要任务和方言区域作为辅助任务，使用多任务学习方法来提高方言分类的准确性。实验结果表明，与没有降噪的单任务模型相比，我们的方法平均提高了20％的准确性。

著录项

来源
《International Conference on Advanced Cloud and Big Data》|2020年|143-148|共6页
会议地点
作者
Hai Wang; Chenguang Qin; Kan Zhang; Ling Gao; Jie Ren;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Learning systems; Noise reduction; Neural networks; Speech recognition; Big Data; Steady-state;

机译：深入学习;学习系统;降噪;神经网络;语音识别;大数据;稳态;

相似文献

外文文献
中文文献
专利

1. Multi-Task Learning for Classification with Dirichlet Process Priors [J] . Xue Ya, Liao Xuejun, Carin Lawrence, Journal of machine learning research . 2007,第Jan期

机译：Dirichlet过程先验的多任务学习分类
2. Dermoscopic attributes classification using deep learning and multi-task learning [J] . Irek Saitov, Tatyana Polevaya, Andrey Filchenkov Procedia Computer Science . 2020,第5期

机译：利用深度学习和多任务学习的Dermoscopic属性分类
3. Automatic QC of denoise processing using a machine learning classification [J] . Ma?za Bekara, Anthony Day First Break . 2019,第9期

机译：使用机器学习分类自动QC的代盘处理
4. Tooth recognition and classification using multi-task learning and post-processing in dental panoramic radiographs [C] . Takumi Morishita, Chisako Muramatsu, Xiangrong Zhou, Conference on Medical Imaging: Computer-Aided Diagnosis . 2021

机译：使用多任务学习和牙科全景射线照相后处理的牙齿识别和分类
5. Bayesian multi-task learning for clustering and classification with non-parametric priors. [D] . An, Qi. 2008

机译：贝叶斯多任务学习，用于使用非参数先验进行聚类和分类。
6. Perception of Dialect Variation in Noise: Intelligibility and Classification [O] . Cynthia G. Clopper, Ann R. Bradlow -1

机译：噪声中方言变化的感知：可理解性和分类
7. Transformation of discriminative single-task classification into generative multi-task classification in machine learning context [O] . Liu Han, Cocea Mihaela, Mohasseb Alaa, 2017

机译：机器学习环境中判别性单任务分类向生成多任务分类的转换

Noise processing and multi-task learning for far-field dialect classification

摘要

著录项

相似文献

相关主题

期刊订阅