首页> 外文会议>2017 2nd IEEE International Conference on Recent Trends in Electronics, Information amp; Communication Technology >Creating language and acoustic models using Kaldi to build an automatic speech recognition system for Kannada language
【24h】

Creating language and acoustic models using Kaldi to build an automatic speech recognition system for Kannada language

机译:使用Kaldi创建语言和声学模型,以构建针对卡纳达语的自动语音识别系统

获取原文
获取原文并翻译 | 示例

摘要

In this paper, creation of the Language Models (LMs) and Acoustic Models (AMs) using Kaldi speech recognition toolkit to build a robust Automatic Speech Recognition (ASR) system for Kannada language is demonstrated. The speech data is collected from the farmers of Karnataka under uncontrolled environment is used for the development of ASR models. The collected speech data needs to be translated to machine level language and hence the Indic Language Transliteration Tool (IT3 to UTF-8) is used for transcription. The dictionary for the collected speech data is created by using Indian Language Speech sound Label (ILSL12) set. The AMs are created by using Gaussian Mixture Model (GMM) and Subspace GMM (SGMM). The 80% and 20% of validated speech data is used for training and testing respectively. The accuracy and Word Error Rate (WER) of ASR models are highlighted and discussed in this work. The developed ASR models can be used in spoken query system which enables the farmers to access the on time agricultural commodity prices and weather information in Kannada language.
机译:在本文中,演示了使用Kaldi语音识别工具包创建语言模型(LMs)和声学模型(AMs),以构建强大的卡纳达语自动语音识别(ASR)系统的方法。语音数据是在不受控制的环境下从卡纳塔克邦的农民那里收集的,用于ASR模型的开发。收集的语音数据需要翻译为机器语言,因此使用印度语音译工具(IT3至UTF-8)进行转录。通过使用印度语语音标签(ILSL12)集创建用于收集的语音数据的字典。通过使用高斯混合模型(GMM)和子空间GMM(SGMM)创建AM。经过验证的语音数据的80%和20%分别用于训练和测试。 ASR模型的准确性和字错误率(WER)在本文中得到强调和讨论。所开发的ASR模型可用于口语查询系统,使农民能够以卡纳达语访问准时农产品价格和天气信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号