首页> 外文会议>2012 International Conference on Communications, Devices and Intelligent Systems. >Multiple feature extraction for RNN-based Assamese speech recognition for speech to text conversion application
【24h】

Multiple feature extraction for RNN-based Assamese speech recognition for speech to text conversion application

机译:基于RNN的阿萨姆语语音识别的多特征提取,用于语音到文本的转换应用

获取原文
获取原文并翻译 | 示例

摘要

The current work proposes a prototype model for speech recognition in Assamese language using Linear Predictive Coding (LPC) and Mel frequency cepstral coefficient (MFCC). The speech recognition is a part of a speech to text conversion system. The LPC and MFCC features are extracted by two different Recurrent Neural Networks (RNN), which are used to recognize the vocal extract of Assamese language- a major language in the North Eastern part of India. In this work, decision block is designed by a combined framework of RNN block to extract the features. Using this combined architecture our system is able to generate 10% gain in the recognition rate than the case when individual architectures are used.
机译:当前的工作提出了使用线性预测编码(LPC)和梅尔频率倒谱系数(MFCC)的阿萨姆语语音识别原型模型。语音识别是语音到文本转换系统的一部分。 LPC和MFCC特征是通过两个不同的递归神经网络(RNN)提取的,用于识别阿萨姆语的语音提取物,阿萨姆语是印度东北部的主要语言。在这项工作中,决策块由RNN块的组合框架设计,以提取特征。使用这种组合架构,与使用单个架构的情况相比,我们的系统能够产生10%的识别率增益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号