首页> 外文会议>International Conference on Control, Automation and Information Sciences >Vietnamese speech recognition using Dynamic Time Warping and Coefficient of Correlation
【24h】

Vietnamese speech recognition using Dynamic Time Warping and Coefficient of Correlation

机译:动态时间规整和相关系数的越南语语音识别

获取原文
获取外文期刊封面目录资料

摘要

The hidden Markov model is supposed as the most common and effective method used in speech recognition for all languages including Vietnamese. However, this method is quite cumbersome and difficult to implement in many embedded systems that have limited resources. Dynamic Time Warping (DTW) method, whereas, has been in much study by many scientists and is proved to be simple and efficient for a relatively small set of words (about 100 words). Though, this method has not been investigated for Vietnamese. This paper will present the investigation result of the combination between Dynamic Time Warping and Correlative Coefficient in Vietnamese speech recognition dependent with the speaker. The vocabulary to be recognized and trained are 124 words. The training data are recorded from 7 people (4 men and 3 women), with four recording time in noise free environment. The recognition outcome achieves the accuracy above 90% on average. In some control areas, such as computer and television control, this accuracy is very promising. DTW, therefore, is proposed as a simple and efficient for Vietnamese speech recognition in many simple control systems.
机译:隐马尔可夫模型被认为是用于包括越南语在内的所有语言的语音识别中最常用和最有效的方法。但是,这种方法非常麻烦并且难以在许多资源有限的嵌入式系统中实现。然而,动态时间规整(DTW)方法已经由许多科学家进行了很多研究,并被证明对于相对较少的一组单词(大约100个单词)而言既简单又有效。但是,越南尚未对这种方法进行调查。本文将介绍依赖于说话者的越南语语音识别中动态时间规整与相关系数的组合的调查结果。要识别和训练的词汇为124个单词。记录了来自7个人(4名男性和3名女性)的培训数据,其中有四次记录时间是在无噪音的环境中进行的。识别结果平均达到90%以上的准确性。在某些控制领域,例如计算机和电视控制,这种准确性非常有前途。因此,在许多简单的控制系统中,DTW被提出为越南语音识别的一种简单而有效的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号