Temporal Speech Normalization Methods Comparison in Speech Recognition Using Neural Network

机译：颞言语归一化方法使用神经网络语音识别比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech signal is temporally and acoustically varies. Recognition of speech by static input Neural Network requires temporal normalization of the speech to be equal to the number of input nodes of the NN while maintaining the properties of the speech. This paper compares three methods for speech temporal normalization namely the linear, extended linear and zero padded normalizations on isolated speech using different sets of learning parameters on multi layer perceptron neural network with adaptive learning. Although, previous work shows that linear normalization able to give high accuracy up to 95% on similar problem, the result in this experiment shows the opposite. The experimental result shows that zero padded normalization outperformed the two linear normalization methods using all the parameter sets tested. The highest recognition rate using zero padded normalization is 99% while linear and extended linear normalizations give only 74% and 76% respectively. This paper end before conclusion by comparing data used from previous work using linear normalization which gave high accuracy and the data used in this experiment which perform poorer.

机译：语音信号在时间上且声学上变化。通过静态输入神经网络识别语音，需要对语音的时间归一化等于NN的输入节点的数量，同时保持语音的性质。本文比较了语音时间标准化的三种方法，即使用不同层次的学习参数与自适应学习的不同学习参数对隔离语音的线性，扩展线性和零填充训练。虽然以前的工作表明，线性标准化能够在类似问题上提供高达95％的高精度，但该实验的结果表明相反。实验结果表明，零填充标准化优于使用测试所有参数集的两个线性归一化方法。使用零填充标准化的最高识别率为99％，而线性和扩展线性训练分别仅提供74％和76％。本文结束前结束，通过比较了使用线性归一化从先前的工作中使用的数据进行了高精度和该实验中使用的数据，这些实验中使用的数据进行了高精度。

著录项

来源
《International Conference of Soft Computing and Pattern Recognition》|2009年||共6页
会议地点
作者
Md Sah Bin Hj Salam; Dzulkifli Mohamad; Sheikh Hussain Shaikh Salleh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301-53;
关键词
Temporal Normalization; Neural Network; Adaptive Learning; Speech Recognition;

机译：时间正常化;神经网络;自适应学习;语音识别;

相似文献

外文文献
中文文献
专利

1. Nonlinear normalization of input patterns to speaker variability in speech recognition neural networks [J] . Isar Nejadgholi, Seyyed Ali Seyyedsalehi Neural computing & applications . 2009,第1期

机译：语音识别神经网络中输入模式对说话人变异性的非线性归一化
2. Nonlinear normalization of input patterns to speaker variability in speech recognition neural networks [J] . Isar Nejadgholi, Seyyed Ali Seyyedsalehi Neural Computing & Applications . 2009,第1期

机译：语音识别神经网络中输入模式对说话人变异性的非线性归一化
3. Temporal Structure Normalization of Speech Feature for Robust Speech Recognition [J] . Xiao X., Chng E. S., Li H. IEEE signal processing letters . 2007,第7期

机译：语音特征的时态结构归一化，用于鲁棒语音识别
4. Temporal Speech Normalization Methods Comparison in Speech Recognition Using Neural Network [C] . Salam Md Sah Bin Hj, Mohamad Dzulkifli, Salleh Sheikh Hussain Shaikh Soft Computing and Pattern Recognition, 2009. SOCPAR '09 . 2009

机译：神经网络语音识别中的时间语音归一化方法比较
5. Duration normalization for robust recognition of spontaneous speech via missing feature methods. [D] . Nedel, Jon P. 2004

机译：持续时间归一化，可通过缺失特征方法对自发语音进行可靠识别。
6. Human EEG and Recurrent Neural Networks Exhibit Common Temporal Dynamics During Speech Recognition [O] . Saeedeh Hashemnia, Lukas Grasse, Shweta Soni, 2021

机译：人体EEG和经常性神经网络在语音识别期间表现出共同的时间动态
7. A comparative review of dynamic neural networks and hidden Markov model methods for mobile on-device speech recognition [O] . Mustafa Mohammed, Allen Tony, Appiah Kofi 2017

机译：动态神经网络与隐马尔可夫模型方法的移动设备语音识别比较研究

Temporal Speech Normalization Methods Comparison in Speech Recognition Using Neural Network

摘要

著录项

相似文献

相关主题

期刊订阅