【24h】

Band-independent speech-event categories for TRAP based ASR

机译:基于TRAP的ASR的与频段无关的语音事件类别

获取原文
获取原文并翻译 | 示例

摘要

Band-independent categories are investigated for feature estimation in ASR. These categories represent distinct speech-events manifested in frequency-localized temporal patterns of the speech signal. A universal, single estimator is proposed for estimating speech-event posterior probabilities using temporal patterns of critical-band energies for all the bands. The estimated posteriors are used as the input features (referred to as speech-event features) to a back-end recognizer. These features are evaluated on continuous OGI-Digits task. The features are also evaluated on Aurora-2 and Aurora-3 tasks in a Distributed Speech Recognition (DSR) framework. These features are compared with earlier proposed broad-phonetic TRAPs features estimated from temporal patterns using independent estimators in each critical-band.
机译:研究了与频段无关的类别,以进行ASR中的特征估计。这些类别表示在语音信号的频率局部时间模式中体现的独特语音事件。提出了一种通用的单一估计器,用于使用所有频带的临界频带能量的时间模式来估计语音事件后验概率。估计的后验用作后端识别器的输入特征(称为语音事件特征)。这些功能是在连续OGI-数字任务上评估的。还可以在分布式语音识别(DSR)框架中针对Aurora-2和Aurora-3任务评估这些功能。使用每个关键频带中的独立估计器,将这些特征与从时间模式估计的较早提出的宽语音TRAPs特征进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号