首页> 外文会议>2011 4th International Congress on Image and Signal Processing >Adaptive frequency cepstral coefficients for word mispronunciation detection
【24h】

Adaptive frequency cepstral coefficients for word mispronunciation detection

机译:自适应频率倒谱系数,用于单词错误发音检测

获取原文

摘要

Systems based on automatic speech recognition (ASR) technology can provide important functionality in computer assisted language learning applications. This is a young but growing area of research motivated by the large number of students studying foreign languages. Here we propose a Hidden Markov Model (HMM)-based method to detect mispronunciations. Exploiting the specific dialog scripting employed in language learning software, HMMs are trained for different pronunciations. New adaptive features have been developed and obtained through an adaptive warping of the frequency scale prior to computing the cepstral coefficients. The optimization criterion used for the warping function is to maximize separation of two major groups of pronunciations (native and non-native) in terms of classification rate. Experimental results show that the adaptive frequency scale yields a better coefficient representation leading to higher classification rates in comparison with conventional HMMs using Mel-frequency cepstral coefficients.
机译:基于自动语音识别(ASR)技术的系统可以在计算机辅助语言学习应用程序中提供重要的功能。这是一个年轻但正在增长的研究领域,其动机是大量学习外语的学生。在这里,我们提出了一种基于隐马尔可夫模型(HMM)的方法来检测错误发音。利用语言学习软件中使用的特定对话框脚本,HMM接受了针对不同发音的培训。在计算倒频谱系数之前,通过对频率标度进行自适应变形,已经开发并获得了新的自适应功能。用于变形函数的优化标准是在分类率方面最大程度地区分两个主要组的语音(本机和非本机)。实验结果表明,与使用梅尔频率倒谱系数的传统HMM相比,自适应频率标度产生了更好的系数表示,从而导致更高的分类率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号