A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition

机译：基于中智学的卷积神经网络模型用于噪声语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolutional neural networks are sensitive to unknown noisy condition in the test phase and so their performance degrades for the noisy data classification task including noisy speech recognition. In this research, a new convolutional neural network (CNN) model with data uncertainty handling; referred as NCNN (Neutrosophic Convolutional Neural Network); is proposed for classification task. Here, speech signals are used as input data and their noise is modeled as uncertainty. In this task, using speech spectrogram, a definition of uncertainty is proposed in neutrosophic (NS) domain. Uncertainty is computed for each Time-frequency point of speech spectrogram as like a pixel. Therefore, uncertainty matrix with the same size of spectrogram is created in NS domain. In the next step, a two parallel paths CNN classification model is proposed. Speech spectrogram is used as input of the first path and uncertainty matrix for the second path. The outputs of two paths are combined to compute the final output of the classifier. To show the effectiveness of the proposed method, it has been compared with conventional CNN on the isolated words of Aurora2 dataset. The proposed method achieves the average accuracy of 85.96 in noisy train data. It is more robust against noises with accuracies 90, 88 and 81 in test sets A, B and C, respectively. Results show that the proposed method outperforms conventional CNN with the improvement of 6, 5 and 2 percentage in test set A, test set B and test sets C, respectively. It means that the proposed method is more robust against noisy data and handle these data effectively.

机译：卷积神经网络在测试阶段对未知的嘈杂条件敏感，因此对于包括嘈杂语音识别在内的嘈杂数据分类任务，卷积神经网络的性能会下降。在这项研究中，一种新的具有数据不确定性处理的卷积神经网络（CNN）模型;称为NCNN（中性卷积神经网络）;建议用于分类任务。在这里，语音信号被用作输入数据，其噪声被建模为不确定性。在此任务中，使用语音频谱图，在中智（NS）域中提出了不确定性的定义。对于语音频谱图的每个时频点，像像素一样计算不确定度。因此，在NS域中创建了具有相同频谱图大小的不确定性矩阵。在下一步中，提出了两个并行路径的CNN分类模型。语音频谱图用作第一路径的输入和第二路径的不确定性矩阵。合并两个路径的输出以计算分类器的最终输出。为了显示该方法的有效性，已将其与常规CNN在Aurora2数据集的孤立单词上进行了比较。所提出的方法在嘈杂的列车数据中达到了85.96的平均准确度。它在测试集A，B和C中的精度分别为90、88和81时更加强大。结果表明，所提出的方法在测试集A，测试集B和测试集C上分别比传统的CNN分别提高了6、5和2个百分点。这意味着所提出的方法对噪声数据更鲁棒，并且可以有效地处理这些数据。

著录项

来源
《International Conference on Pattern Recognition and Image Analysis》|2019年|87-92|共6页
会议地点
作者
Elyas Rashno; Ahmad Akbari; Babak Nasersharif;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Noise measurement; Spectrogram; Uncertainty; Convolution; Data models; Task analysis; Feature extraction;

机译：噪声测量频谱图不确定度卷积数据模型任务分析特征提取;

相似文献

外文文献
中文文献
专利

1. 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition [J] . Mingyi Chen, Xuanji He, Jing Yang, IEEE signal processing letters . 2018,第10期

机译：具有注意力模型的3-D卷积递归神经网络用于语音情感识别
2. Speech Emotion Recognition based on Multi-Level Residual Convolutional Neural Networks [J] . Kai Zheng, ZhiGuang Xia, Yi Zhang, Engineering Letters . 2020,第2期

机译：基于多级残余卷积神经网络的语音情感识别
3. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
4. A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition [C] . Elyas Rashno, Ahmad Akbari, Babak Nasersharif International Conference on Pattern Recognition and Image Analysis . 2019

机译：基于中性学噪声识别的卷积神经网络模型
5. Convolutional Neural Networks for Speaker-Independent Speech Recognition. [D] . Belilovsky, Eugene. 2011

机译：用于与说话人无关的语音识别的卷积神经网络。
6. Cascaded Convolutional Neural Network Architecture for Speech Emotion Recognition in Noisy Conditions [O] . Youngja Nam, Chankyu Lee 2021

机译：级联卷积神经网络架构用于嘈杂的条件下的语音情感识别
7. A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition [O] . Elyas Rashno, Ahmad Akbari, Babak Nasersharif 2019

机译：基于中性学噪声识别的卷积神经网络模型

A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅