首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >CONFIDENCE INDEX DYNAMIC TIME WARPING FOR LANGUAGE-INDEPENDENT EMBEDDED SPEECH RECOGNITION
【24h】

CONFIDENCE INDEX DYNAMIC TIME WARPING FOR LANGUAGE-INDEPENDENT EMBEDDED SPEECH RECOGNITION

机译:信心指数动态时间扭曲语言无关的嵌入式语音识别

获取原文

摘要

Language-independent embedded speech recognition is a necessary and important application. Considering personal privacy, collection difficulty of all the reference words, and limited storage space of mobile devices, language-independent (LI) embedded speech recognition should be classified into lightweight speaker-dependent (SD) cases. Dynamic time warping (DTW) is the state-of-the-art algorithm for small foot-print SD automatic speech recognition. To decrease the high computational complexity of DTW, and to avoid constraints-induced coarse approximation and inaccuracy problems, we introduce a novel confidence index dynamic time warping (CIDTW) approach. CIDTW defines a new cost function, called the confidence index cost function (CICF), to measure the similarity between merged speech training and testing data, while follows the same DTW process. With extensive experiments on three representative SD datasets, CIDTW achieves better accuracy and overall six times faster speeds compared with DTW.
机译:独立语言嵌入式语音识别是必要和重要的应用程序。考虑到个人隐私,所有参考词的收集难度,以及移动设备的有限存储空间,语言无关(LI)嵌入式语音识别应分为轻量级扬声器依赖(SD)案例。动态时间翘曲(DTW)是用于小型脚印SD自动语音识别的最先进的算法。为了降低DTW的高计算复杂性,并且为了避免约束引起的粗略近似和不准确的问题,我们介绍了一种新的置信度指标动态时间翘曲(CIDTW)方法。 CIDTW定义了一种新的成本函数,称为置信度指数成本函数(CICF),以测量合并的语音培训和测试数据之间的相似性,同时遵循相同的DTW过程。在三个代表性的SD数据集中进行了广泛的实验,CIDTW与DTW相比,CIDTW实现了更好的准确性,总体速度速度更快六倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号