首页> 外文会议>International Conference of Soft Computing and Pattern Recognition >Temporal Speech Normalization Methods Comparison in Speech Recognition Using Neural Network
【24h】

Temporal Speech Normalization Methods Comparison in Speech Recognition Using Neural Network

机译:颞言语归一化方法使用神经网络语音识别比较

获取原文

摘要

Speech signal is temporally and acoustically varies. Recognition of speech by static input Neural Network requires temporal normalization of the speech to be equal to the number of input nodes of the NN while maintaining the properties of the speech. This paper compares three methods for speech temporal normalization namely the linear, extended linear and zero padded normalizations on isolated speech using different sets of learning parameters on multi layer perceptron neural network with adaptive learning. Although, previous work shows that linear normalization able to give high accuracy up to 95% on similar problem, the result in this experiment shows the opposite. The experimental result shows that zero padded normalization outperformed the two linear normalization methods using all the parameter sets tested. The highest recognition rate using zero padded normalization is 99% while linear and extended linear normalizations give only 74% and 76% respectively. This paper end before conclusion by comparing data used from previous work using linear normalization which gave high accuracy and the data used in this experiment which perform poorer.
机译:语音信号在时间上且声学上变化。通过静态输入神经网络识别语音,需要对语音的时间归一化等于NN的输入节点的数量,同时保持语音的性质。本文比较了语音时间标准化的三种方法,即使用不同层次的学习参数与自适应学习的不同学习参数对隔离语音的线性,扩展线性和零填充训练。虽然以前的工作表明,线性标准化能够在类似问题上提供高达95%的高精度,但该实验的结果表明相反。实验结果表明,零填充标准化优于使用测试所有参数集的两个线性归一化方法。使用零填充标准化的最高识别率为99%,而线性和扩展线性训练分别仅提供74%和76%。本文结束前结束,通过比较了使用线性归一化从先前的工作中使用的数据进行了高精度和该实验中使用的数据,这些实验中使用的数据进行了高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号