首页> 外文会议>International Conference on Text, Speech and Dialogue >Real-Time Vocal Tract Length Normalization in a Phonological Awareness Teaching System
【24h】

Real-Time Vocal Tract Length Normalization in a Phonological Awareness Teaching System

机译:语音意识教学系统中实时声乐长度归一化

获取原文

摘要

Speaker normalization in a speech recognition can significantly improve speech recognition accuracy. One such method, vocal tract length normalization (VTLN), is especially useful when the system has to work reliably for males, females and children. It is just this situation with our phonological awareness teaching system, the "SpeechMaster", which aims at real-time phoneme recognition and feedback. As most VTLN algorithms work off-line, this poses the additional problem of real-time operation. This paper examines how a well-known off-line algorithm can be approximated on-line by machine learning regression techniques. We conclude that, by employing a real-time estimation of VTLN parameters, the recognition error can be reduced by some 14-24 %.
机译:语音识别中的扬声器归一化可以显着提高语音识别准确性。一种这样的方法,声带长度归一化(VTLN),当系统必须可靠地为男性,女性和儿童工作而尤其有用。这是我们的语音意识教学系统,“汉语讲师”的这种情况,旨在实时音素识别和反馈。作为大多数VTLN算法的工作离线,这会带来实时操作的额外问题。本文介绍了如何通过机器学习回归技术在线近似着名的离线算法。我们得出结论,通过采用VTLN参数的实时估计,识别误差可以减少约14-24%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号