【24h】

Test of several external posterior weighting functions for multiband Full Combination ASR

机译:多频带全组合ASR的几种外部后验加权功能的测试

获取原文

摘要

Information about speech reliability can be extracted and hen integrated in a recogniser by various means. The full combination (FC) approach allows the wieghting of the posterior values estiamted locally in the time frequency representation, according a speech reliability measure. Since most of the speech segments are voiced, we use a method exploiting the harmonicity of speech tos derive these weights. We test this method together with the direct integration of the a priori SNR. Then, we run speech recognition iwth differnet kind of weighting functions. The weights are continuous or binary values. Thsi corresponds to a soft or to a hard decision function about the speech reliability, which is derived from an observable harmonicity index. Using a binary decision process, the effect is, for each tiem frame, to collapse the set of combinations of sub-bands into a single combination. On the other hand, we substitue empirical values to these terms, including functions of the a priori SNR, which are continuous or discrete, but not based on a probabilistic estiamtion. We establish the average scores in
机译:可以通过各种方式提取有关语音可靠性的信息,并将其集成到识别器中。完全组合(FC)方法允许根据语音可靠性度量对在时频表示中局部估计的后验值进行加权。由于大多数语音段都是有声音的,因此我们使用一种利用语音谐波的方法来导出这些权重的方法。我们将这种方法与先验SNR的直接积分一起进行测试。然后,我们使用不同种类的加权函数运行语音识别。权重是连续值或二进制值。这对应于关于语音可靠性的软判决或硬判决函数,其从可观察的谐波指数得出。使用二进制决策过程,对于每个tiem帧,其效果是将子带组合的集合折叠为单个组合。另一方面,我们用经验值代替这些术语,包括先验SNR的函数,这些函数是连续的或离散的,但不基于概率估计。我们在

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号