首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data
【24h】

Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data

机译:对基于示例的语音识别功能的研究,涉及数千小时的无监督,嘈杂的数据

获取原文

摘要

The acoustic models in state-of-the-art speech recognition systems are based on phones in context that are represented by hidden Markov models. This modeling approach may be limited in that it is hard to incorporate long-span acoustic context. Exemplar-based approaches are an attractive alter-native, in particular if massive data and computational power are available. Yet, most of the data at Google are unsupervised and noisy. This paper investigates an exemplar-based approach under this yet not well understood data regime. A log-linear rescoring framework is used to combine the exemplar-based features on the word level with the first-pass model. This approach guarantees at least baseline performance and focuses on the refined modeling of words with sufficient data. Experimental results for the Voice Search and the YouTube tasks are presented.
机译:最先进的语音识别系统中的声学模型基于上下文中由隐藏马尔可夫模型表示的电话。这种建模方法可能会受到限制,因为很难合并大跨度的声学环境。基于示例的方法是一种有吸引力的替代方法,特别是在可获得大量数据和计算能力的情况下。但是,Google上的大多数数据都是无监督且嘈杂的。本文研究了在这种尚未充分理解的数据机制下基于示例的方法。使用对数线性计分框架将单词级别的基于示例的功能与首过模型相结合。这种方法至少可以保证基线性能,并专注于具有足够数据的单词的精确建模。给出了语音搜索和YouTube任务的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号