首页> 外文会议>Information Communication Technologies Conference >The Use of Neural Networks to Improve the Recognition Accuracy of Explosive and Unvoiced Phonemes in Uzbek Language
【24h】

The Use of Neural Networks to Improve the Recognition Accuracy of Explosive and Unvoiced Phonemes in Uzbek Language

机译:使用神经网络提高乌兹别克语爆炸音和清音的识别精度

获取原文

摘要

Currently, speech recognition systems are becoming more widespread, especially in those applications where speech dialogue is the most convenient mean of information control and exchange with technical facilities. Obtaining an effective voice control system is currently an important task, requiring the development of methods to obtain high recognition accuracy of voice commands. Under these conditions, along with reducing the noise effect on the quality of recognition, the task is to increase the accuracy in voice control system operation, to increase the likelihood of correct command recognition under stationary interference. Another requirement in the voice control system is the correct recognition speed, since the system must work in real time. In this paper, we propose an algorithm for voice control by technical facilities, based on Uzbek language. Some sounds in Uzbek language (explosive and unvoiced consonants) differ strongly from the sounds in other languages; when creating control algorithms, the recognition accuracy does not meet the requirements. Therefore, to ensure the necessary processing speed and maintain the required accuracy, it is proposed to introduce an additional normalization with a decrease in feature space. The algorithm is based on the principle of primary separation of speech signal spectral characteristics. Further, the signal spectrum is normalized and its resolution increases due to the use of low-frequency conversion and logarithms. The obtained cepstral coefficients are fed to the input of a previously learned neural network.
机译:当前,语音识别系统正在变得越来越普遍,尤其是在那些语音对话是信息控制和与技术设施交换的最便捷手段的那些应用中。目前,获得有效的语音控制系统是一项重要任务,需要开发获得语音命令的高识别精度的方法。在这种情况下,除了降低噪声对识别质量的影响外,任务是提高语音控制系统操作的准确性,以增加在固定干扰下正确命令识别的可能性。语音控制系统的另一个要求是正确的识别速度,因为该系统必须实时工作。在本文中,我们提出了一种基于乌兹别克语言的技术设施语音控制算法。乌兹别克语中的某些声音(爆炸和清音辅音)与其他语言中的声音有很大差异;创建控制算法时,识别精度不符合要求。因此,为了确保必要的处理速度并保持所需的精度,建议引入附加的归一化,其特征空间减小。该算法基于语音信号频谱特征的主要分离原理。此外,由于使用了低频转换和对数,信号频谱被归一化并且其分辨率提高。所获得的倒谱系数被馈送到先前学习的神经网络的输入。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号