首页> 外文会议> >Speech recognition using temporally connected kernels in mixture density hidden Markov models
【24h】

Speech recognition using temporally connected kernels in mixture density hidden Markov models

机译:在混合密度隐藏马尔可夫模型中使用时间相关核的语音识别

获取原文

摘要

A method is presented for speeding up the performance of the HMM based speech recognition system where the states are modeled by a large number of Gaussian kernels. The emission probabilities of the states are usually dominated by the nearest Gaussians to the input vector. The speedup is gained without deteriorating the recognition accuracy by concentrating on these kernels in the reduced K-best-kernel search. In this work, the time information of the input is encoded to the connections of the kernels. The search for the dominating kernels is then performed along the kernel connections which model the trajectories of the speech in the feature space. In the experiments, speaker-dependent speech recognizers were trained for ten speakers. The number of distance computations between feature vectors and kernel mean vectors was reduced 75% without increasing the average phoneme recognition error, which was 5.7% for the baseline system.
机译:提出了一种用于加速基于HMM的语音识别系统的性能的方法,其中状态由大量的高斯核建模。状态的发射概率通常由最接近输入向量的高斯决定。通过集中精力在减少的K-最佳内核搜索中的这些内核上,可以在不降低识别精度的情况下获得加速。在这项工作中,输入的时间信息被编码为内核的连接。然后,沿着对特征空间中语音轨迹建模的内核连接执行对主导内核的搜索。在实验中,针对说话者的语音识别器接受了十位说话者的培训。在不增加平均音素识别误差的情况下,特征向量和核平均向量之间的距离计算次数减少了75%,而基线系统的平均音素识别误差为5.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号