Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks

Gurpreet Kaur; Mohit Srivastava; Amod Kumar

首页> 外文期刊>Journal of Telecommunications and Information Technology >Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks

【24h】

Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks

机译：使用深神经网络组合扬声器和语音识别的遗传算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Huge growth is observed in the speech and speaker recognition field due to many artificial intelligence algorithms being applied. Speech is used to convey messages via the lan- guage being spoken, emotions, gender and speaker identity. Many real applications in healthcare are based upon speech and speaker recognition, e.g. a voice-controlled wheelchair helps control the chair. In this paper, we use a genetic algo- rithm (GA) for combined speaker and speech recognition, rely- ing on optimized Mel Frequency Cepstral Coefficient (MFCC) speech features, and classification is performed using a Deep Neural Network (DNN). In the first phase, feature extraction using MFCC is executed. Then, feature optimization is per- formed using GA. In the second phase training is conducted using DNN. Evaluation and validation of the proposed work model is done by setting a real environment, and efficiency is calculated on the basis of such parameters as accuracy, precision rate, recall rate, sensitivity, and specificity. Also, this paper presents an evaluation of such feature extraction methods as linear predictive coding coefficient (LPCC), per- ceptual linear prediction (PLP), mel frequency cepstral coef- ficients (MFCC) and relative spectra filtering (RASTA), with all of them used for combined speaker and speech recogni- tion systems. A comparison of different methods based on existing techniques for both clean and noisy environments is made as well.

机译：由于应用了许多人工智能算法，在语音和扬声器识别领域中观察到巨大的增长。演讲用于通过口语，情绪，性别和扬声器身份传达信息。医疗保健中的许多真实应用都基于言语和扬声器识别，例如言语和扬声器识别。语音控制的轮椅有助于控制椅子。在本文中，我们使用用于组合扬声器和语音识别的遗传算法（GA），依赖于优化的MEL频率谱系统系数（MFCC）语音特征，并且使用深神经网络（DNN）进行分类。在第一阶段，执行使用MFCC的特征提取。然后，使用GA每种特征优化。在第二阶段训练中使用DNN进行。通过设定真实环境来完成所提出的工作模型的评估和验证，并且基于这种参数计算效率，作为准确性，精度率，召回率，灵敏度和特异性。此外，本文介绍了这种特征提取方法的评估，作为线性预测编码系数（LPCC），肌发线性预测（PLP），MEL频率谱系核心核心（MFCC）和相对谱滤波（RASTA），所有它们用于组合扬声器和语音识别系统。还制作了基于现有技术的不同方法的比较。

著录项

来源
《Journal of Telecommunications and Information Technology》 |2018年第2期|共9页
作者
Gurpreet Kaur; Mohit Srivastava; Amod Kumar;
展开▼
作者单位

I. K. Gujral Punjab Technical University Kapurthala Jalandhar India;

Chandigarh Engineering College Landran Mohali Punjab India;

Central Scientific Instruments Organisation Chandigarh India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计量学;
关键词
deep neural network; genetic algorithm; LPCC; MFCC; PLP; RASTA-PLP; speaker recognition; speech recognition;

机译：深神经网络;遗传算法;LPCC;MFCC;PLP;rasta-plp;扬声器识别;语音识别;

相似文献

外文文献
中文文献
专利

1. Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks [J] . Gurpreet Kaur, Mohit Srivastava, Amod Kumar Journal of Telecommunications and Information Technology . 2018,第2期

机译：使用深神经网络组合扬声器和语音识别的遗传算法
2. Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation [J] . Huang Zhen, Siniscalchi Sabato Marco, Lee Chin-Hui Pattern recognition letters . 2017,第octa15期

机译：基于深度神经网络的语音识别和说话人自适应的插件最大后验解码器的分层贝叶斯组合
3. A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition [J] . Huang Zhen, Siniscalchi Sabato Marco, Lee Chin-Hui Neurocomputing . 2016,第DECa19期

机译：深度神经网络转移学习的统一方法及其在自动语音识别中的说话人自适应中的应用
4. Neural network speaker dependent isolated Malay speech recognition system: handcrafted vs genetic algorithm [C] . Salam, M.S.H., Mohamad, . 2001

机译：神经网络与说话人相关的孤立马来语语音识别系统：手工与遗传算法
5. Selecting neural network topologies: A hybrid approach combining genetic algorithms and neural networks. [D] . Taylor, Christopher M. 2005

机译：选择神经网络拓扑：结合遗传算法和神经网络的混合方法。
6. Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network [O] . Misbah Farooq, Fawad Hussain, Naveed Khan Baloch, 2020

机译：利用深卷积神经网络对语音情感识别的特征选择算法的影响
7. Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature [O] . Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara 2015

机译：结合了深度神经网络和深度自动编码器的混响语音识别，并增强了电话类功能

Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅