A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets

机译：基于冗余谱-时态特征集的有限词汇语音识别范式

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech recognition techniques have come to rely almost completely on HMM based frameworks. In this paper, we present a novel paradigm for small-vocabulary speech recognition based on a recently proposed word spotting technique. Recent work using discriminative classifiers with ordered spectro-temporal features to detect the presence of keywords obtained encouraging improvements over HMM-based models. We propose to extend this approach to recognize continuous speech in our work. Our method uses discriminative models to predict which words are present in a speech signal and hypothesize their locations. A graph search using dynamic programming is then used to obtain the most likely sequence of words from the hypothesis set produced as a result of combining the results from the discriminative word classifiers. While this approach doesn't perform as well as state-of-the-art ASR systems, it can be particularly useful for languages with small amounts of annotated data available.

机译：语音识别技术几乎完全依赖于基于HMM的框架。在本文中，我们提出了一种基于最近提出的单词发现技术的小词汇语音识别的新颖范例。最近使用区分式分类器和有序的光谱时空特征来检测关键字的存在的工作取得了令人鼓舞的改进，优于基于HMM的模型。我们建议将这种方法扩展为认可我们工作中的连续讲话。我们的方法使用判别模型来预测语音信号中存在哪些单词并假设其位置。然后使用动态编程进行图搜索，以从假设集获得最有可能的单词序列，该假设集是将区分性词分类器的结果组合而成的。尽管这种方法的性能不如最新的ASR系统，但对于带有少量可用注释数据的语言而言，它尤其有用。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.3176-3179|共4页
会议地点
作者
Sourish Chaudhuri; Bhiksha Raj; Tony Ezzat;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
word-spotting; spectro-temporal features; speech recognition;

机译：单词发现光谱时态特征语音识别;

相似文献

外文文献
中文文献
专利

1. Nonlinear spectro-temporal features based on a cochlear model for automatic speech recognition in a noisy situation [J] . ChoiY.-S., LeeS.-Y. Neural Networks: The Official Journal of the International Neural Network Society . 2013,第Null期

机译：基于人工耳蜗模型的非线性时空特征在嘈杂情况下的自动语音识别
2. Nonlinear spectro-temporal features based on a cochlear model for automatic speech recognition in a noisy situation [J] . ChoiY.-S., LeeS.-Y. Neural Networks: The Official Journal of the International Neural Network Society . 2013,第Null期

机译：基于嘈杂情况下自动语音识别的基于Cochlear模型的非线性光谱 - 时间特征
3. A clustering based feature selection method in spectro-temporal domain for speech recognition [J] . Nafiseh Esfandian, Farbod Razzazi, Alireza Behrad Engineering Applications of Artificial Intelligence . 2012,第6期

机译：光谱时域中基于聚类的语音识别特征选择方法
4. A feature extraction method for speech recognition based on temporal tracking of clusters in spectro-temporal domain [C] . Esfandian Nafiseh, Razzazi Farbod, Behrad Alireza The 16th CSI International Symposium on Artificial Intelligence amp; Signal Processing. . 2012

机译：基于时空聚类时间跟踪的语音识别特征提取方法
5. Array-based Spectro-temporal Masking for Automatic Speech Recognition. [D] . Moghimi, Amir R. 2014

机译：基于阵列的频谱时域掩蔽，用于自动语音识别。
6. Particle Swarm Optimization Based Feature Enhancement and Feature Selection for Improved Emotion Recognition in Speech and Glottal Signals [O] . Hariharan Muthusamy, Kemal Polat, Sazali Yaacob -1

机译：基于粒子群优化的特征增强和特征选择用于语音和声门信号中的情感识别
7. Auditory-based acoustic distinctive features and spectral cues for automatic speech recognition using a multistream paradigm [O] . Sid-ahmed Selouani, Hesham Tolba 2002

机译：基于听觉的声学特征和频谱提示，用于使用多流范例进行自动语音识别
8. Spire Based Speaker-Independent Continuous Speech Recognition Using Mixed Feature Sets [R] . Dawson, R. G. 1987

机译：基于混合特征集的基于尖端的扬声器无关连续语音识别

A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets

摘要

著录项

相似文献

相关主题

期刊订阅