首页> 外文会议>Signal and Image Processing >THE DYNAMICALLY-ADJUSTABLE HISTOGRAM PRUNING METHOD FOR EMBEDDED VOICE DIALING
【24h】

THE DYNAMICALLY-ADJUSTABLE HISTOGRAM PRUNING METHOD FOR EMBEDDED VOICE DIALING

机译:嵌入式语音拨号的动态可调整直方图修剪方法

获取原文
获取原文并翻译 | 示例

摘要

Memory and speed are two key factors that must be faced when applying voice dialer to Pocket PCs. To provide a solution, a novel decoding method integrated with the score differences of token paths is proposed, named as "Dynamically-Adjustable Histogram Pruning". Additionally, the computation of likelihood score is accelerated by means of dynamic score lookup table. Furthermore, a new acoustic modeling method based on Extended Initial/Final (XIF) with less dimensioned acoustic feature is proven suitable for embedded speech command recognition. By adopting the methods developed above, we implement a speaker-independent, user definable voice dialing speech recognition system with good performance on a real PDA device. For a 200-Chinese-word vocabulary, its recognition accuracy reaches 97.80%. Meanwhile, it obtains better recognition speed by 80 times and saves decoding space by 30% in comparison to the baseline system using standard Viterbi decoding method.
机译:内存和速度是将语音拨号器应用于Pocket PC时必须面对的两个关键因素。为了提供一种解决方案,提出了一种与令牌路径的分数差异相集成的新颖解码方法,称为“动态可调直方图修剪”。另外,借助动态得分查找表来加速似然得分的计算。此外,事实证明,基于扩展初始/最终(XIF)的具有较小尺寸声学特征的新声学建模方法适用于嵌入式语音命令识别。通过采用以上开发的方法,我们在真实的PDA设备上实现了性能良好的独立于扬声器的,用户可定义的语音拨号语音识别系统。对于200个中文单词的词汇,其识别准确率达到97.80%。同时,与使用标准Viterbi解码方法的基准系统相比,它获得了80倍的更好识别速度,并节省了30%的解码空间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号