首页> 外文会议>IEEE International Conference on Cloud Computing in Emerging Markets >Pronunciation Training on Isolated Kannada Words Using 'Kannada Kali' - A Cloud Based Smart Phone Application
【24h】

Pronunciation Training on Isolated Kannada Words Using 'Kannada Kali' - A Cloud Based Smart Phone Application

机译:使用“ Kannada Kali”-基于云的智能手机应用程序对孤立的卡纳达语单词进行语音训练

获取原文

摘要

Automated feedback on pronunciation system on a smart phone is useful for a student trying to learn a new language at his or her own pace. The objective of our re-search is to implement a pronunciation training system with minimal language specific data. Our proposed system consists of an Android application as a front-end, and a pronunciation evaluation and mispronunciation detection framework as the back-end hosted on a cloud. We conduct our experiments on spoken isolated words in Kannada. Our pronunciation evaluation(for spoken word) implementation on the cloud involves training a classifier with features from Dynamic Time Warping (DTW) with Mel Frequency Cepstral Coefficients (MFCC) and Line Spectral Frequencies (LSF) and, without directly on LSF (without DTW). We study the performance of different machine learning algorithms for pronunciation rating. We propose a novel semi-supervised approach for detecting mispronounced segments of a word using Self Organizing Maps (SOM) that are also deployed on the cloud. Our implementation of SOM learns the features of an automatically segmented reference speech. The trained SOM is then used to determine the deviations in the learner's pronunciation. We evaluate our system on 1169 Kannada audio samples from students around 18 to 25 years of age. The Kannada words considered are taken from textbooks of first and second grade (considering learners as beginners who do not know Kannada) and include 2 to 5 syllable words. We report accuracy on binary classification and multi-class classification for different classifiers. The mispronounced segments detected using SOM correlate with the human ratings. Our approach of pronunciation evaluation and mispronunciation detection is based on minimal data and does not require a speech recognition system.
机译:智能手机上的语音系统自动反馈对于试图按照自己的步调学习新语言的学生很有用。我们研究的目的是用最少的语言特定数据来实现发音训练系统。我们提出的系统由一个作为前端的Android应用程序和一个作为后端托管在云中的语音评估和错误发音检测框架组成。我们对卡纳达语中孤立的口语进行实验。我们在云上实施的语音评估(针对口语单词)涉及训练具有动态时间规整(DTW),梅尔频率倒谱系数(MFCC)和线谱频率(LSF)以及没有直接在LSF(无DTW)的功能的分类器。 。我们研究了不同机器学习算法对语音评级的性能。我们提出了一种新颖的半监督方法,使用自组织图(SOM)来检测单词的发音错误的部分,该自组织图也部署在云中。我们在SOM的实现中学习了自动分段参考语音的功能。然后,将训练有素的SOM用于确定学习者发音的偏差。我们对来自18至25岁的学生的1169个卡纳达语音频样本进行了评估。所考虑的卡纳达语单词取自一年级和二年级的教科书(考虑作为不懂卡纳达语的初学者的学习者),包括2至5个音节单词。我们报告了针对不同分类器的二进制分类和多分类的准确性。使用SOM检测到的错误发音片段与人类评分相关。我们的语音评估和发音错误检测方法基于最少的数据,不需要语音识别系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号