Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data

机译：对基于示例的语音识别功能的研究，涉及数千小时的无监督，嘈杂的数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The acoustic models in state-of-the-art speech recognition systems are based on phones in context that are represented by hidden Markov models. This modeling approach may be limited in that it is hard to incorporate long-span acoustic context. Exemplar-based approaches are an attractive alter-native, in particular if massive data and computational power are available. Yet, most of the data at Google are unsupervised and noisy. This paper investigates an exemplar-based approach under this yet not well understood data regime. A log-linear rescoring framework is used to combine the exemplar-based features on the word level with the first-pass model. This approach guarantees at least baseline performance and focuses on the refined modeling of words with sufficient data. Experimental results for the Voice Search and the YouTube tasks are presented.

机译：最先进的语音识别系统中的声学模型基于上下文中由隐藏马尔可夫模型表示的电话。这种建模方法可能会受到限制，因为很难合并大跨度的声学环境。基于示例的方法是一种有吸引力的替代方法，特别是在可获得大量数据和计算能力的情况下。但是，Google上的大多数数据都是无监督且嘈杂的。本文研究了在这种尚未充分理解的数据机制下基于示例的方法。使用对数线性计分框架将单词级别的基于示例的功能与首过模型相结合。这种方法至少可以保证基线性能，并专注于具有足够数据的单词的精确建模。给出了语音搜索和YouTube任务的实验结果。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.4437- 4440|共4页
会议地点 Kyoto(JP)
作者
Heigold, Georg;
展开▼
作者单位

Google Inc. USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A STATISTICAL ANALYSIS ON THE IMPACT OF SPEECH ENHANCEMENT TECHNIQUES ON THE FEATURE VECTORS OF NOISY SPEECH SIGNALS FOR SPEECH RECOGNITION [J] . SWAPNANIL GOGOI, UTPAL BHATTACHARJEE Journal of computer science engineering and information technology research . 2016,第3期

机译：语音增强技术对语音识别中嘈杂语音信号特征向量影响的统计分析
2. A STATISTICAL ANALYSIS ON THE IMPACT OF SPEECH ENHANCEMENT TECHNIQUES ON THE FEATURE VECTORS OF NOISY SPEECH SIGNALS FOR SPEECH RECOGNITION [J] . SWAPNANIL GOGOI, UTPAL BHATTACHARJEE Journal of computer science engineering and information technology research . 2016,第3期

机译：语音增强技术对语音识别中嘈杂语音信号特征向量影响的统计分析
3. Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments [J] . Boril H., Hansen J.H.L. Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第6期

机译：嘈杂不利环境下伦巴第效应的语音识别无监督均衡
4. Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data [C] . Heigold Georg IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：对基于示例性的特征对数千小时的无监督，嘈杂的数据进行调查
5. An unsupervised method for speech detection and segmentation in noisy environments using the parametric trajectory model. [D] . Galligan, Shane. 2006

机译：使用参数轨迹模型在嘈杂环境中进行语音检测和分段的无监督方法。
6. Correction: Computational Phenotype Discovery Using Unsupervised Feature Learning over Noisy Sparse and Irregular Clinical Data [O] . Thomas A. Lasko, Joshua C. Denny, Mia A. Levy -1

机译：校正：使用基于无监督特征学习的嘈杂稀疏和不规则临床数据进行计算表型发现
7. INVESTIGATIONS ON EXEMPLAR-BASED FEATURES FOR SPEECH RECOGNITION TOWARDS THOUSANDS OF HOURS OF UNSUPERVISED, NOISY DATA [O] . Georg Heigold, Patrick Nguyen, Mitchel Weintraub, 2013

机译：对数千小时未经监督的嘈杂数据进行语音识别的基于示例功能的调查

Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data

摘要

著录项

相似文献

相关主题

期刊订阅