首页> 外文会议>2012 15th IEEE International Multitopic Conference >Voice controlled Urdu Interface using isolated and continuous speech recognizer
【24h】

Voice controlled Urdu Interface using isolated and continuous speech recognizer

机译:使用隔离和连续语音识别器的语音控制Urdu接口

获取原文
获取原文并翻译 | 示例

摘要

We propose multilayer feed forward Neural Network for Urdu isolated words and Urdu continuous speech recognition. Our application not only recognizes the voice commands but also converts continuous speech into Urdu Fonts. Our work is based on converting analog signal to digital signals, removing silence and noise through filters, extracting 39 Mel Frequency Cepstral Coefficients(MFCCs) from the speech signal, applying k-mean algorithm and using them as an input to multi-layer perceptron neural networks. Urdu text stores in “Jameel Noori Nastaleeq” fonts. Isolated words gain accuracy of 95.6% in speaker dependent mode while 30–40% in speaker independent mode. Continuous speech also shows a good accuracy of 96.5% in speaker dependent mode and 30–40% for speaker independent mode. Current work is being extended to improve accuracy for speaker independent system by improving dataset and feature set.
机译:我们提出了用于乌尔都语孤立词和乌尔都语连续语音识别的多层前馈神经网络。我们的应用程序不仅可以识别语音命令,还可以将连续语音转换为Urdu字体。我们的工作基于将模拟信号转换为数字信号,通过滤波器消除静音和噪声,从语音信号中提取39个Mel频率倒谱系数(MFCC),应用k均值算法并将其用作多层感知器神经的输入网络。乌尔都语文本存储使用“ Jameel Noori Nastaleeq”字体。在扬声器相关模式下,隔离的单词的准确度为95.6%,而在扬声器独立模式下为30–40%。连续语音在与说话者相关的模式下也显示出96.5%的良好准确性,而与说话者无关的模式下则具有30–40%的精度。当前的工作正在扩展,以通过改进数据集和功能集来提高独立于扬声器的系统的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号