首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets
【24h】

A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets

机译:基于冗余谱-时态特征集的有限词汇语音识别范式

获取原文

摘要

Speech recognition techniques have come to rely almost completely on HMM based frameworks. In this paper, we present a novel paradigm for small-vocabulary speech recognition based on a recently proposed word spotting technique. Recent work using discriminative classifiers with ordered spectro-temporal features to detect the presence of keywords obtained encouraging improvements over HMM-based models. We propose to extend this approach to recognize continuous speech in our work. Our method uses discriminative models to predict which words are present in a speech signal and hypothesize their locations. A graph search using dynamic programming is then used to obtain the most likely sequence of words from the hypothesis set produced as a result of combining the results from the discriminative word classifiers. While this approach doesn't perform as well as state-of-the-art ASR systems, it can be particularly useful for languages with small amounts of annotated data available.
机译:语音识别技术几乎完全依赖于基于HMM的框架。在本文中,我们提出了一种基于最近提出的单词发现技术的小词汇语音识别的新颖范例。最近使用区分式分类器和有序的光谱时空特征来检测关键字的存在的工作取得了令人鼓舞的改进,优于基于HMM的模型。我们建议将这种方法扩展为认可我们工作中的连续讲话。我们的方法使用判别模型来预测语音信号中存在哪些单词并假设其位置。然后使用动态编程进行图搜索,以从假设集获得最有可能的单词序列,该假设集是将区分性词分类器的结果组合而成的。尽管这种方法的性能不如最新的ASR系统,但对于带有少量可用注释数据的语言而言,它尤其有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号