首页> 外文会议>IEEE International Conference on Research in Computational Intelligence and Communication Networks >Identification of top-3 spoken Indian languages: An Ensemble learning-based approach
【24h】

Identification of top-3 spoken Indian languages: An Ensemble learning-based approach

机译:识别前3名英语印度语言:基于集合学习的方法

获取原文

摘要

Speech recognition has developed considerably for English but there has not been much development in Indie languages. Speech Recognition in Indic languages is itself challenging which complicates even more in multilingual scenario. There is a pressing need for Indic speech recognition systems and a fully functional variant of the same is yet to be developed. One reason for this is the multi lingual nature of our country in addition to the complexity of the Indic languages. It is very much important to identify the language specific segments from multi lingual speech before attempting recognition. In this paper, we have presented a system to segregate the top 3 spoken languages in India encompassing English, Hindi and Bangla. We have experimented with segregation of Bangla alone from the 3 languages as well driven by the motivation that Bangla is our mother tongue. Experiments were performed on more than 24 hours of data and highest accuracies of 97.13% and 96.44% has been obtained in segregating Bangla from the rest and trilingual segregation respectively with MFCC-based features coupled with Ensemble learning-based classification.
机译:语音识别的英语显着发展,但Indie语言没有太大的发展。广告语言中的语音识别本身挑战,在多语言场景中更加复杂化。对于指示器的语音识别系统,尚未开发相同的功能变体。除了上线语言的复杂性之外,这是我们国家的多语言性质的一个原因。在尝试识别之前,从多语言语音中识别语言特定段是非常重要的。在本文中,我们介绍了一个系统,在印度分离印度的前3名口语,包括英语,印地语和孟加拉。我们已经尝试了孟加拉的分离,因为孟加拉是我们的母语的动机的动力,因此来自3种语言。实验是在24小时的数据上进行的,并且在分别与基于MFCC的特征与基于集合学习的分类的基于MFCC的特征的特征分别获得了97.13%和96.44%的最高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号