Identification of top-3 spoken Indian languages: An Ensemble learning-based approach

机译：识别前3名英语印度语言：基于集合学习的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech recognition has developed considerably for English but there has not been much development in Indie languages. Speech Recognition in Indic languages is itself challenging which complicates even more in multilingual scenario. There is a pressing need for Indic speech recognition systems and a fully functional variant of the same is yet to be developed. One reason for this is the multi lingual nature of our country in addition to the complexity of the Indic languages. It is very much important to identify the language specific segments from multi lingual speech before attempting recognition. In this paper, we have presented a system to segregate the top 3 spoken languages in India encompassing English, Hindi and Bangla. We have experimented with segregation of Bangla alone from the 3 languages as well driven by the motivation that Bangla is our mother tongue. Experiments were performed on more than 24 hours of data and highest accuracies of 97.13% and 96.44% has been obtained in segregating Bangla from the rest and trilingual segregation respectively with MFCC-based features coupled with Ensemble learning-based classification.

机译：语音识别的英语显着发展，但Indie语言没有太大的发展。广告语言中的语音识别本身挑战，在多语言场景中更加复杂化。对于指示器的语音识别系统，尚未开发相同的功能变体。除了上线语言的复杂性之外，这是我们国家的多语言性质的一个原因。在尝试识别之前，从多语言语音中识别语言特定段是非常重要的。在本文中，我们介绍了一个系统，在印度分离印度的前3名口语，包括英语，印地语和孟加拉。我们已经尝试了孟加拉的分离，因为孟加拉是我们的母语的动机的动力，因此来自3种语言。实验是在24小时的数据上进行的，并且在分别与基于MFCC的特征与基于集合学习的分类的基于MFCC的特征的特征分别获得了97.13％和96.44％的最高精度。

著录项

来源
《IEEE International Conference on Research in Computational Intelligence and Communication Networks》|2018年|292p|共6页
会议地点
作者
Himadri Mukherjee; Ankita Dhar; Sk Md Obaidullah; K. C. Santosh; Santanu Phadikar; Kaushik Roy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Speech recognition; Language identification; MFCC; Ensemble learning;

机译：语音识别;语言识别;MFCC;集合学习;

相似文献

外文文献
中文文献
专利

1. Spoken Indian language identification: a review of features and databases [J] . BAKSHI AARTI, SUNIL KUMAR KOPPARAPU Sadhana . 2018,第4期

机译：口语印度语言识别：功能和数据库的审查
2. Spoken Language Identification using Gaussian Mixture Model-Universal Background Model in Indian Context [J] . Sreedhar Potla, Vishnu Vardhan B. International Journal of Applied Engineering Research . 2018,第5aPta4期

机译：在印度语境中使用高斯混合模型 - 通用背景模型的口语语言识别
3. Spoken Indian language identification: a review of features and databases [J] . Aarti Bakshi, Kopparapu Sunil Kumar Sadhana: Academy Proceedings in Engineering Science . 2018,第4期

机译：口语印度语言识别：对功能和数据库的审查
4. Identification of top-3 spoken Indian languages: An Ensemble learning-based approach [C] . Himadri Mukherjee, Ankita Dhar, Sk Md Obaidullah, IEEE International Conference on Research in Computational Intelligence and Communication Networks . 2018

机译：识别前3名英语印度语言：基于集合学习的方法
5. Spoken Language Identification from Processing and Pattern Analysis of Spectrograms. [D] . Ford, George H., Jr. 2014

机译：频谱图的处理和模式分析中的口头语言识别。
6. Lets All Speak Together! Exploring the Masking Effects of Various Languages on Spoken Word Identification in Multi-Linguistic Babble [O] . Aurore Gautreau, Michel Hoen, Fanny Meunier -1

机译：让我们一起讲话！探索多种语言在多语言Ba语中对口语识别的掩蔽效果
7. Phonotactic Model for Spoken Language Identification in Indian Language Perspective [O] . Sanghamitra Mohanty 2011

机译：印度语言视野下的口语识别语音模型

Identification of top-3 spoken Indian languages: An Ensemble learning-based approach

摘要

著录项

相似文献

相关主题

期刊订阅