The Use of Neural Networks to Improve the Recognition Accuracy of Explosive and Unvoiced Phonemes in Uzbek Language

机译：使用神经网络提高乌兹别克语爆炸音和清音的识别精度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Currently, speech recognition systems are becoming more widespread, especially in those applications where speech dialogue is the most convenient mean of information control and exchange with technical facilities. Obtaining an effective voice control system is currently an important task, requiring the development of methods to obtain high recognition accuracy of voice commands. Under these conditions, along with reducing the noise effect on the quality of recognition, the task is to increase the accuracy in voice control system operation, to increase the likelihood of correct command recognition under stationary interference. Another requirement in the voice control system is the correct recognition speed, since the system must work in real time. In this paper, we propose an algorithm for voice control by technical facilities, based on Uzbek language. Some sounds in Uzbek language (explosive and unvoiced consonants) differ strongly from the sounds in other languages; when creating control algorithms, the recognition accuracy does not meet the requirements. Therefore, to ensure the necessary processing speed and maintain the required accuracy, it is proposed to introduce an additional normalization with a decrease in feature space. The algorithm is based on the principle of primary separation of speech signal spectral characteristics. Further, the signal spectrum is normalized and its resolution increases due to the use of low-frequency conversion and logarithms. The obtained cepstral coefficients are fed to the input of a previously learned neural network.

机译：当前，语音识别系统正在变得越来越普遍，尤其是在那些语音对话是信息控制和与技术设施交换的最便捷手段的那些应用中。目前，获得有效的语音控制系统是一项重要任务，需要开发获得语音命令的高识别精度的方法。在这种情况下，除了降低噪声对识别质量的影响外，任务是提高语音控制系统操作的准确性，以增加在固定干扰下正确命令识别的可能性。语音控制系统的另一个要求是正确的识别速度，因为该系统必须实时工作。在本文中，我们提出了一种基于乌兹别克语言的技术设施语音控制算法。乌兹别克语中的某些声音（爆炸和清音辅音）与其他语言中的声音有很大差异;创建控制算法时，识别精度不符合要求。因此，为了确保必要的处理速度并保持所需的精度，建议引入附加的归一化，其特征空间减小。该算法基于语音信号频谱特征的主要分离原理。此外，由于使用了低频转换和对数，信号频谱被归一化并且其分辨率提高。所获得的倒谱系数被馈送到先前学习的神经网络的输入。

著录项

来源
《Information Communication Technologies Conference》|2020年|231-234|共4页
会议地点
作者
Muhammadjon Musaev; Ilyos Khujayorov; Mannon Ochilov;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
speech recognition; neural network; CNN; MFCC; phoneme; voice control; FFT; Uzbek phonemes;

机译：语音识别;神经网络; CNN; MFCC;音素;语音控制; FFT;乌兹别克音素;

相似文献

外文文献
中文文献
专利

1. A novel objective function for improved phoneme recognition using time-delay neural networks [J] . Hampshire J.B. II, Waibel A.H. IEEE Transactions on Neural Networks . 1990,第2期

机译：使用时延神经网络改善音素识别的新目标函数
2. Improving Recurrent Neural Networks for Offline Arabic Handwriting Recognition by Combining Different Language Models [J] . Jemni Sana Khamekhem, Kessentini Yousri, Kanoun Slim International Journal of Pattern Recognition and Artificial Intelligence . 2020,第12期

机译：通过组合不同的语言模型，改进反际阿拉伯语手写识别的反复性神经网络
3. MULTIPLE NEURAL NETWORK INTEGRATION USING A BINARY DECISION TREE TO IMPROVE THE ECG SIGNAL RECOGNITION ACCURACY [J] . Hoai Linh TRAN, Van Nam PHAM, Hoang Nam VUONG International Journal of Applied Mathematics and Computer Science . 2014,第3期

机译：使用二叉决策树的多个神经网络集成，以提高ECG信号识别精度
4. The Use of Neural Networks to Improve the Recognition Accuracy of Explosive and Unvoiced Phonemes in Uzbek Language [C] . Muhammadjon Musaev, Ilyos Khujayorov, Mannon Ochilov Information Communication Technologies Conference . 2020

机译：神经网络的使用提高乌兹别克语语言爆炸性和清音音素的识别准确性
5. Achieving Consistent Near-Optimal Pattern Recognition Accuracy Using Particle Swarm Optimization to Pre-Train Artificial Neural Networks. [D] . Nikelshpur, Dmitry O. 2014

机译：使用粒子群优化对训练前的人工神经网络实现一致的近似最佳模式识别精度。
6. Improving the Recognition Accuracy of Memristive Neural Networks via Homogenized Analog Type Conductance Quantization [O] . Qilai Chen, Tingting Han, Minghua Tang, 2020

机译：通过均质模拟电导量化提高忆阻神经网络的识别精度
7. Low Power Tiny Binary Neural Network with improved accuracy in Human Recognition Systems [O] . Antonio De Vita, Danilo Pau, Luigi Di Benedetto, 2020

机译：低功耗微小的二元神经网络，具有提高人力识别系统精度

The Use of Neural Networks to Improve the Recognition Accuracy of Explosive and Unvoiced Phonemes in Uzbek Language

摘要

著录项

相似文献

相关主题

期刊订阅