首页> 外文会议>IEEE International Multitopic Conference >Voice controlled Urdu Interface using isolated and continuous speech recognizer
【24h】

Voice controlled Urdu Interface using isolated and continuous speech recognizer

机译:使用隔离和连续语音识别器的语音控制URDU接口

获取原文

摘要

We propose multilayer feed forward Neural Network for Urdu isolated words and Urdu continuous speech recognition. Our application not only recognizes the voice commands but also converts continuous speech into Urdu Fonts. Our work is based on converting analog signal to digital signals, removing silence and noise through filters, extracting 39 Mel Frequency Cepstral Coefficients(MFCCs) from the speech signal, applying k-mean algorithm and using them as an input to multi-layer perceptron neural networks. Urdu text stores in “Jameel Noori Nastaleeq” fonts. Isolated words gain accuracy of 95.6% in speaker dependent mode while 30–40% in speaker independent mode. Continuous speech also shows a good accuracy of 96.5% in speaker dependent mode and 30–40% for speaker independent mode. Current work is being extended to improve accuracy for speaker independent system by improving dataset and feature set.
机译:我们向Urdu孤立的单词和URDU连续语音识别提出多层馈线前锋神经网络。 我们的应用程序不仅识别了语音命令,还识别到urdu字体中的连续演奏。 我们的作品基于将模拟信号转换为数字信号,通过滤波器去除静音和噪声,从语音信号中提取39麦频谱系数(MFCC),应用K均值算法,并将它们用作多层Perceptron神经的输入 网络。 乌尔都语文本存储在“ jameel noori nastaleeq” 字体。 孤立的单词在扬声器依赖模式下获得95.6%的准确性,而扬声器独立模式下的30-40%。 扬声器依赖模式下,连续演讲也显示出96.5%的良好精度,扬声器独立模式为30-40%。 通过改进数据集和功能集,扩展了当前的工作以提高扬声器独立系统的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号