Creating language and acoustic models using Kaldi to build an automatic speech recognition system for Kannada language

机译：使用Kaldi创建语言和声学模型，以构建针对卡纳达语的自动语音识别系统

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, creation of the Language Models (LMs) and Acoustic Models (AMs) using Kaldi speech recognition toolkit to build a robust Automatic Speech Recognition (ASR) system for Kannada language is demonstrated. The speech data is collected from the farmers of Karnataka under uncontrolled environment is used for the development of ASR models. The collected speech data needs to be translated to machine level language and hence the Indic Language Transliteration Tool (IT3 to UTF-8) is used for transcription. The dictionary for the collected speech data is created by using Indian Language Speech sound Label (ILSL12) set. The AMs are created by using Gaussian Mixture Model (GMM) and Subspace GMM (SGMM). The 80% and 20% of validated speech data is used for training and testing respectively. The accuracy and Word Error Rate (WER) of ASR models are highlighted and discussed in this work. The developed ASR models can be used in spoken query system which enables the farmers to access the on time agricultural commodity prices and weather information in Kannada language.

机译：在本文中，演示了使用Kaldi语音识别工具包创建语言模型（LMs）和声学模型（AMs），以构建强大的卡纳达语自动语音识别（ASR）系统的方法。语音数据是在不受控制的环境下从卡纳塔克邦的农民那里收集的，用于ASR模型的开发。收集的语音数据需要翻译为机器语言，因此使用印度语音译工具（IT3至UTF-8）进行转录。通过使用印度语语音标签（ILSL12）集创建用于收集的语音数据的字典。通过使用高斯混合模型（GMM）和子空间GMM（SGMM）创建AM。经过验证的语音数据的80％和20％分别用于训练和测试。 ASR模型的准确性和字错误率（WER）在本文中得到强调和讨论。所开发的ASR模型可用于口语查询系统，使农民能够以卡纳达语访问准时农产品价格和天气信息。

著录项

来源
《2017 2nd IEEE International Conference on Recent Trends in Electronics, Information amp; Communication Technology》|2017年|161-165|共5页
会议地点 Bangalore(IN)
作者
Yadava G Thimmaraja; H S Jayanna;
展开▼
作者单位

Department of Electronics and Communication Engineering Siddaganga Institute of Technology, Tumakuru, Karnataka, India;

Department of Information Science and Engineering Siddaganga Institute of Technology, Tumakuru, Karnataka, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech; Hidden Markov models; Speech recognition; Training; Testing; Tools; Dictionaries;

机译：语音;隐马尔可夫模型;语音识别;培训;测试;工具;词典;;

相似文献

外文文献
中文文献
专利

1. Automatic speech recognition system with pitch dependent features for Punjabi language on KALDI toolkit [J] . Guglani Jyoti, Mishra A. N. Applied Acoustics . 2020,第Octa期

机译：在Kaldi Toolkit上的Punjabi语言具有音调依赖功能的自动语音识别系统
2. Creation and Comparison of Language and Acoustic Models Using Kaldi for Noisy and Enhanced Speech Data [J] . Thimmaraja Yadava G, H S Jayanna International Journal of Intelligent Systems and Applications . 2018,第3期

机译：使用Kaldi噪声和增强语音数据创建和比较语言和声学模型
3. Enhancements in automatic Kannada speech recognition system by background noise elimination and alternate acoustic modelling [J] . G. Thimmaraja Yadava, H. S. Jayanna International journal of speech technology . 2020,第1期

机译：通过背景噪声消除和替代声学建模增强了自动Kannada语音识别系统
4. Creating language and acoustic models using Kaldi to build an automatic speech recognition system for Kannada language [C] . Yadava G Thimmaraja, H S Jayanna IEEE International Conference on Recent Trends in Electronics, Information Communication Technology . 2017

机译：使用KALDI创建语言和声学模型为Kannada语言构建自动语音识别系统
5. Arabic language modeling with stem-derived morphemes for automatic speech recognition. [D] . Heintz, Ilana. 2010

机译：具有词干衍生语素的阿拉伯语言建模，可实现自动语音识别。
6. Retrospective Analysis of Clinical Performance of an Estonian Speech Recognition System for Radiology: Effects of Different Acoustic and Language Models [O] . A. Paats, T. Alumäe, E. Meister, 2018

机译：一项爱沙尼亚放射线语音识别系统临床表现的回顾性分析：不同声学和语言模型的影响
7. First Automatic Fongbe Continuous Speech Recognition System: Development of Acoustic Models and Language Models [O] . Laleye, Fréjus,, Besacier, Laurent, Ezin, Eugène,, 2016

机译：首款自动Fongbe连续语音识别系统：声学模型和语言模型的发展

Creating language and acoustic models using Kaldi to build an automatic speech recognition system for Kannada language

摘要

著录项

相似文献

相关主题

期刊订阅