首页> 外文会议>International Conference on Mathematical Optimization Theory and Operations Research >Optimization of Gain in Symmetrized Itakura-Saito Discrimination for Pronunciation Learning
【24h】

Optimization of Gain in Symmetrized Itakura-Saito Discrimination for Pronunciation Learning

机译:对称式板仓斋藤语音学习增益的优化

获取原文

摘要

This paper considers an assessment and evaluation of the pronunciation quality in computer-aided language learning systems. We propose the novel distortion measure for speech processing by using the gain optimization of the symmetrized Itakura-Saito divergence. This dissimilarity is implemented in a complete algorithm for pronunciation learning and improvement. At its first stage, a user has to achieve a stable pronunciation of all sounds by matching them with sounds of an ideal speaker. At the second stage, the recognition of sounds and their short sequences is carried out to guarantee the distinguishability of learned sounds. The training set may contain not only ideal sounds but the best utterances of a user obtained at the previous step. Finally, the word recognition accuracy is estimated by using deep neural networks fine-tuned on the best words from a user. Experimental study shows that the proposed procedure makes it possible to achieve high efficiency for learning of sounds and their sequences even in the presence of noise in an observed utterance.
机译:本文考虑了计算机辅助语言学习系统中语音质量的评估和评估。我们提出了一种新颖的失真度量,用于语音处理,它利用对称的Itakura-Saito发散的增益优化。这种差异在用于语音学习和改进的完整算法中得以实现。在第一阶段,用户必须通过将所有声音与理想扬声器的声音进行匹配来实现所有声音的稳定发音。在第二阶段,对声音及其短序列进行识别,以确保学习到的声音的可区分性。训练集不仅可以包含理想声音,还可以包含在上一步中获得的用户的最佳发声。最后,通过使用深度神经网络对用户的最佳单词进行微调,可以估算单词识别的准确性。实验研究表明,即使在观察到的话语中存在噪声的情况下,所提出的程序也有可能实现学习声音及其顺序的高效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号