Bengali Spoken Numerals Recognition by MFCC and GMM Technique

机译：MFCC和GMM技术的孟加拉语口语标号识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech is the standard vocalized communication media. Speech is one of the comfortable way for humans to communicate with each other. Similarly, speech recognition system is eagerly necessary to communicate with computer through voice. Speech recognition in English language already helps us to operate English voice command-based applications. But in rural and semi-urban areas, due to lack of knowledge in English in India, it is necessary to implement automatic speech recognition in regional languages. Here, we have built a Gaussian Mixture Model (GMM)-based Bengali (also called Bangla) isolated spoken numerals recognition system where mel frequency cepstral coefficients denoted as MFCC is taken for feature extraction. The proposed system achieved 91.7% correct prediction for the Bangla numeral data set of 1000 audio samples for 10 classes which is satisfactory for previous Bangla spoken digit recognition.

机译：语音是标准的发声通信媒体。言语是人类互相沟通的舒适方式之一。类似地，语音识别系统急切地需要通过语音与计算机通信。语音识别英语已经帮助我们操作基于英语语音命令的应用程序。但在农村和半城区地区，由于印度英语知识缺乏知识，有必要在区域语言中实施自动演讲。在这里，我们建立了一个高斯混合模型（GMM） - 基于Bengali（也称为Bangla）隔离的口头标数识别系统，其中拍摄为MFCC的MEL频率剖面系数用于特征提取。所提出的系统为10个音频样本的Bangla数字数据集进行了91.7％，对于10个类，这对于先前的Bangla口语数字识别令人满意。

著录项

来源
《International Conference on Emerging Trends and Advances in Electrical Engineering and Renewable Energy》|2020年|85-96|共12页
会议地点
作者
Bachchu Paul; Somnath Bera; Rakesh Paul; Santanu Phadikar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
ASR; Zero crossing; FFT; MFCC; HMM; DTW; GMM;

机译：asr;零横穿;FFT;MFCC;唔;DTW;GMM.;

相似文献

外文文献
中文文献
专利

1. Recognition of Spoken Bengali Numerals Using MLP, SVM, RF Based Models with PCA Based Feature Summarization [J] . Gupta Avisek, Sarkar Kamal The international arab journal of information technology . 2018,第2期

机译：使用基于MLP，SVM，RF的模型和基于PCA的特征汇总识别孟加拉语数字
2. Convolutional Neural Network based Handwritten Bengali and Bengali-English Mixed Numeral Recognition [J] . M. A. H. Akhand, Mahtab Ahmed, M. M. Hafizur Rahman International Journal of Image, Graphics and Signal Processing . 2016,第9期

机译：基于卷积神经网络的手写孟加拉语和孟加拉-英语混合数字识别
3. Improved low-cost recognition system for handwritten Bengali numerals [J] . Md Aktaruzzaman, Tewodros Mulugeta Dagnew, Massimo Walter Rivolta, International Journal of Computer Applications in Technology . 2020,第4期

机译：提高手写孟加拉数字的低成本识别系统
4. On recognition of spoken Bengali numerals [C] . 2010 International Conference on Computer Information Systems and Industrial Management Applications . 2010

机译：关于孟加拉语口语数字的识别
5. A speech recognition IC with an efficient MFCC extraction algorithm and multi-mixture models. [D] . Han, Wei. 2006

机译：具有高效MFCC提取算法和多混合模型的语音识别IC。
6. Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions [O] . Feifan Liu, Gokhan Tur, Dilek Hakkani-Tür, 2011

机译：走向口语临床问题的答案：针对口语临床问题评估和改编自动语音识别系统
7. BDNet: Bengali Handwritten Numeral Digit Recognition based on Densely connected Convolutional Neural Networks [O] . Abu Sufian, Anirudha Ghosh, Avijit Naskar, 2020

机译：BDNET：孟加拉手写数字位数识别浓密连接的卷积神经网络

Bengali Spoken Numerals Recognition by MFCC and GMM Technique

摘要

著录项

相似文献

相关主题

期刊订阅