首页> 外文期刊>Malaysian Journal of Computer Science >Artificial Neural Network-Based Speech Recognition Using Dwt Analysis Applied On Isolated Words From Oriental Languages
【24h】

Artificial Neural Network-Based Speech Recognition Using Dwt Analysis Applied On Isolated Words From Oriental Languages

机译:基于Dwt分析的基于人工神经网络的语音识别在东方语言孤立词中的应用

获取原文
       

摘要

Speech recognition is an emerging research area having its focus on human computer interactions (HCI) and expert systems. Analyzing speech signals are often tricky for processing, due to the non-stationary nature of audio signals. The work in this paper presents a system for speaker independent speech recognition, which is tested on isolated words from three oriental languages, i.e., Urdu,Persian, and Pashto. The proposed approach combines discrete wavelet transform (DWT) and feed-forward artificial neural network (FFANN) for the purpose of speech recognition. DWT is used for feature extraction and the FFANN is utilized for the classification purpose. The task of isolated word recognition is accomplished with speech signal capturing, creating a code bank of speech samples, and then by applying pre-processing techniques.For classifying a wave sample, four layered FFANN model is used with resilient back-propagation (Rprop). The proposed system yields high accuracy for two and five classes.For db-8 level-5 DWT filter 98.40%, 95.73%, and 95.20% accuracy rate is achieved with 10, 15, and 20 classes, respectively. Haar level-5 DWT filter shows 97.20%, 94.40%, and 91% accuracy ratefor 10, 15, and 20 classes, respectively. The proposed system is also compared with a baseline method where it shows better performance. The proposed system can be utilized as a communication interface to computing and mobile devices for low literacy regions.
机译:语音识别是一个新兴的研究领域,其重点是人机交互(HCI)和专家系统。由于音频信号的非平稳性,分析语音信号通常很难处理。本文的工作提出了一种用于说话人独立语音识别的系统,该系统在来自三种东方语言(即乌尔都语,波斯语和普什图语)的孤立单词上进行了测试。所提出的方法结合了离散小波变换(DWT)和前馈人工神经网络(FFANN),用于语音识别。 DWT用于特征提取,而FFANN用于分类。孤立语音识别的任务是通过语音信号捕获,创建语音样本代码库,然后通过应用预处理技术来完成的。为了对波形样本进行分类,使用了四层FFANN模型和弹性反向传播(Rprop) 。所提出的系统可产生两级和五级的高精度。对于db-8级别5 DWT滤波器,分别达到10级,15级和20级时,准确率分别达到98.40%,95.73%和95.20%。 Haar 5级DWT滤波器在10、15和20类中的准确率分别为97.20%,94.40%和91%。拟议的系统还与基线方法进行了比较,该方法显示出更好的性能。所提出的系统可以用作针对低识字区域的计算和移动设备的通信接口。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号