首页> 外文会议>IEEE International Conference on Cloud Computing in Emerging Markets >Pronunciation Training on Isolated Kannada Words Using 'Kannada Kali' - A Cloud Based Smart Phone Application
【24h】

Pronunciation Training on Isolated Kannada Words Using 'Kannada Kali' - A Cloud Based Smart Phone Application

机译:使用“Kannada Kali”孤立的Kannada单词的发音培训 - 基于云的智能手机应用程序

获取原文

摘要

Automated feedback on pronunciation system on a smart phone is useful for a student trying to learn a new language at his or her own pace. The objective of our re-search is to implement a pronunciation training system with minimal language specific data. Our proposed system consists of an Android application as a front-end, and a pronunciation evaluation and mispronunciation detection framework as the back-end hosted on a cloud. We conduct our experiments on spoken isolated words in Kannada. Our pronunciation evaluation(for spoken word) implementation on the cloud involves training a classifier with features from Dynamic Time Warping (DTW) with Mel Frequency Cepstral Coefficients (MFCC) and Line Spectral Frequencies (LSF) and, without directly on LSF (without DTW). We study the performance of different machine learning algorithms for pronunciation rating. We propose a novel semi-supervised approach for detecting mispronounced segments of a word using Self Organizing Maps (SOM) that are also deployed on the cloud. Our implementation of SOM learns the features of an automatically segmented reference speech. The trained SOM is then used to determine the deviations in the learner's pronunciation. We evaluate our system on 1169 Kannada audio samples from students around 18 to 25 years of age. The Kannada words considered are taken from textbooks of first and second grade (considering learners as beginners who do not know Kannada) and include 2 to 5 syllable words. We report accuracy on binary classification and multi-class classification for different classifiers. The mispronounced segments detected using SOM correlate with the human ratings. Our approach of pronunciation evaluation and mispronunciation detection is based on minimal data and does not require a speech recognition system.
机译:关于智能手机的发音系统的自动反馈对于尝试以他或她自己的步伐学习新语言的学生非常有用。我们的重新搜索的目的是实现具有最小语言特定数据的发音系统。我们建议的系统包括一个Android应用程序作为前端,以及作为云上托管的后端的发音评估和错误发布检测框架。我们对坎卡达的口语孤立词进行了实验。我们(对口头语言)的云实施的发音评估涉及对LSF培训与动态时间规整(DTW)与梅尔频率倒谱系数(MFCC)和线频谱频率(LSF)和功能分类,而不直接(不DTW) 。我们研究不同机器学习算法的发音等级的性能。我们提出了一种新的半监督方法,用于使用在云上部署的自组织地图(SOM)来检测一个单词的错误分子段。我们的SOM实现了解自动分段的参考语音的功能。然后使用训练的SOM来确定学习者发音中的偏差。我们在1169名kannada音频样本中评估了从学生18至25岁的学生的系统。考虑到第一和二年级的教科书(将学员视为不了解Kannaada的初学者),包括2到5个音节词。我们报告不同分类器的二进制分类和多级分类的准确性。使用SOM与人类评级相关检测到的错位段。我们的发音评估方法和错误发布检测方法基于最小数据,不需要语音识别系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号