Multiple feature extraction for RNN-based Assamese speech recognition for speech to text conversion application

机译：基于RNN的阿萨姆语语音识别的多特征提取，用于语音到文本的转换应用

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The current work proposes a prototype model for speech recognition in Assamese language using Linear Predictive Coding (LPC) and Mel frequency cepstral coefficient (MFCC). The speech recognition is a part of a speech to text conversion system. The LPC and MFCC features are extracted by two different Recurrent Neural Networks (RNN), which are used to recognize the vocal extract of Assamese language- a major language in the North Eastern part of India. In this work, decision block is designed by a combined framework of RNN block to extract the features. Using this combined architecture our system is able to generate 10% gain in the recognition rate than the case when individual architectures are used.

机译：当前的工作提出了使用线性预测编码（LPC）和梅尔频率倒谱系数（MFCC）的阿萨姆语语音识别原型模型。语音识别是语音到文本转换系统的一部分。 LPC和MFCC特征是通过两个不同的递归神经网络（RNN）提取的，用于识别阿萨姆语的语音提取物，阿萨姆语是印度东北部的主要语言。在这项工作中，决策块由RNN块的组合框架设计，以提取特征。使用这种组合架构，与使用单个架构的情况相比，我们的系统能够产生10％的识别率增益。

著录项

来源
《2012 International Conference on Communications, Devices and Intelligent Systems.》|2012年|p.600-603|共4页
会议地点 Kolkata(IN);Kolkata(IN)
作者
Dutta Krishna; Sarma Kandarpa Kumar;
展开▼
作者单位

Department of ECE, Gauhati University, Guwahati-781014, Assam, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类通信;通信;
关键词
LPC; MFCC; Moving Average Filter; RNN;

机译：LPC; MFCC;移动平均滤波器; RNN;;

相似文献

外文文献
中文文献
专利

1. RNN-based prosodic modeling for mandarin speech and its application to speech-to-text conversion [J] . Wern-Jun Wang, Yuan-Fu Liao, Sin-Horng Chen Speech Communication . 2002,第3a4期

机译：基于RNN的普通话韵律模型及其在语音到文本转换中的应用
2. Composite Feature Set for Speech to Text Conversion in Assamese [J] . Krishna Dutta, Kandarpa Kumar Sarma Journal of the Instrument Society of India: Proceedings of the national symposium on instrumentation . 2013,第2期

机译：阿萨姆语语音到文本转换的复合功能集
3. Machine learning based sample extraction for automatic speech recognition using dialectal Assamese speech [J] . Agarwalla Swapna, Sarma Kandarpa Kumar Neural Networks: The Official Journal of the International Neural Network Society . 2016,第Null期

机译：基于机器学习的样本提取用于使用方言阿萨姆语语音进行自动语音识别
4. Multiple feature extraction for RNN-based Assamese speech recognition for speech to text conversion application [C] . Dutta Krishna, Sarma Kandarpa Kumar International Conference on Communications, Devices and Intelligent Systems . 2012

机译：基于RNN的assamese语音识别的多种特征提取到文本转换应用程序的语音
5. Two modified methods of feature extraction for automatic speech recognition. [D] . Ge, Wangning. 2013

机译：自动语音识别的特征提取的两种改进方法。
6. On the Speech Properties and Feature Extraction Methods in Speech Emotion Recognition [O] . Juraj Kacur, Boris Puterka, Jarmila Pavlovicova, 2021

机译：语音情感识别中的语音特性和特征提取方法
7. Speech analysis with a non-linear speech model and feature extraction for speech recognition [O] . Τσιάκουλης Πύρρος Γ., Tsiakoulis Pirros G. 2011

机译：具有非线性语音模型和语音识别提取特征的语音分析
8. Speech Recognition, Articulatory Feature Detection, and Speech Synthesis in Multiple Languages [R] . Ore, B. M. 2009

机译：语音识别，发音特征检测和多语言语音合成

Multiple feature extraction for RNN-based Assamese speech recognition for speech to text conversion application

摘要

著录项

相似文献

相关主题

期刊订阅