INCORPORATING PHONETIC KNOWLEDGE INTO AN EVOLUTIONARY SUBSPACE APPROACH FOR ROBUST SPEECH RECOGNITION

S.A. Selouani; D. OShaughnessy; J. Caelen

首页> 外文期刊>International Journal of Computers & Applications >INCORPORATING PHONETIC KNOWLEDGE INTO AN EVOLUTIONARY SUBSPACE APPROACH FOR ROBUST SPEECH RECOGNITION

【24h】

INCORPORATING PHONETIC KNOWLEDGE INTO AN EVOLUTIONARY SUBSPACE APPROACH FOR ROBUST SPEECH RECOGNITION

机译：将语音知识纳入进化的子空间方法中以进行强健的语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The reliability of automatic speech recognition (ASR) systems is closely related to the parameterization process which is expected to accurately characterize the phonetic, dynamic and static components in speech. For this purpose, ASR methods build speech sound models based on large speech corpora that attempt to include common sources of variability that may occur in real-life conditions. Nevertheless, not all variabilities can reasonably be covered. For that reason, the performance of current ASR systems, whose designs are predicated on relatively noise-free conditions, degrades rapidly in the presence of high-level adverse conditions. To cope with mismatched (adverse) conditions and to achieve noise robustness, we present in this paper an original approach that operates in two steps. The first one consists of integrating in the front-end process, besides mean-subtracted mel-frequency cepstral coefficients, acoustic distinctive features that provides a more convenient interface to higher-level components of ASR systems. The second step consists of combining subspace filtering and Genetic Algorithms to get less-variant parameters. The advantages of this approach include that no estimation of noise is required and the recognition system is not modified. The effectiveness of the method is assessed in high interfering car noise by using a noisy subset of the TIMIT database. Obtained results show that the proposed method reduces drastically the word error rate for a wide range of signal-to-noise ratios.

机译：自动语音识别（ASR）系统的可靠性与参数化过程密切相关，该过程有望准确表征语音中的语音，动态和静态成分。为此，ASR方法基于大型语音库构建语音模型，这些语音库试图包括现实情况下可能出现的常见变异性来源。然而，并非所有的变化都可以合理地涵盖。因此，当前的ASR系统（其设计基于相对无噪声的条件）的性能会在存在严重不利条件的情况下迅速降低。为了应对不匹配的（不利）条件并实现噪声鲁棒性，我们在本文中提出了一种原始方法，该方法分两步进行。第一个功能包括在前端过程中进行整合，除了均值减去梅尔频率倒谱系数外，声学独特的功能还为ASR系统的更高级别的组件提供了更方便的接口。第二步包括将子空间过滤和遗传算法相结合，以获取变化较小的参数。该方法的优点包括不需要估计噪声并且不修改识别系统。通过使用TIMIT数据库的嘈杂子集，可以在高干扰汽车噪声中评估该方法的有效性。所得结果表明，对于宽范围的信噪比，该方法可以大大降低单词错误率。

著录项

来源
《International Journal of Computers & Applications》 |2007年第2期|p.143-154|共12页
作者
S.A. Selouani; D. OShaughnessy; J. Caelen;
展开▼
作者单位

Universite de Moncton, Campus de Shippagan, NB, Canada, E8S 1K9;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算机的应用;
关键词
speech recognition; genetic algorithms; hidden markov models; eigen-decomposition; distinctive cues; noise removal;

机译：语音识别遗传算法隐马尔可夫模型特征分解特征线索噪声去除;

相似文献

外文文献
中文文献
专利

1. Incorporating finer acoustic phonetic features in lexicon for Hindi language speech recognition [J] . Journal of information and optimization sciences . 2019,第8期

机译：在词典中纳入更精细的声学语音特征以进行印地语语音识别
2. Incorporating phonetic properties in hidden Markov models for speech recognition [J] . Ramachandrula N. V. Sitaram, Thippur Sreenivas The Journal of the Acoustical Society of America . 1997,第2期

机译：在隐马尔可夫模型中整合语音属性以进行语音识别
3. Optimisation of phonetic aware speech recognition through multi-objective evolutionary algorithms [J] . Bird Jordan J., Wanner Elizabeth, Ekart Aniko, Expert systems with applications . 2020,第Sepa期

机译：通过多目标进化算法优化语音意识语音识别
4. INCORPORATE STATISTICAL PATTERN RECOGNITION APPROACH AND ACOUSTIC-PHONETIC APPROACH FOR MANDARIN CONSONANT RECOGNITION [C] . Ming-Tzaw Lin Signal and Image Processing . 2002

机译：普通话辅音识别的统筹统计模式识别和语音识别方法
5. Synergy of acoustic-phonetics and auditory modeling towards robust speech recognition. [D] . Deshmukh, Om D. 2006

机译：语音和听觉建模对强大语音识别的协同作用。
6. Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data [O] . Ayesha Pervaiz, Fawad Hussain, Huma Israr, 2020

机译：通过训练数据的噪声增强将噪声鲁棒性纳入语音命令识别中
7. AN INVESTIGATION OF SUBSPACE MODELING FOR PHONETIC AND SPEAKER VARIABILITY IN AUTOMATIC SPEECH RECOGNITION [O] . Richard Rose, Shou-chun Yin, Yun Tang 2015

机译：自动语音识别中声音和扬声器可变性的子空间建模研究
8. Speech Recognition: Acoustic-Phonetic Knowledge Acquisition and Representation [R] . Zue, V. W. 1988

机译：语音识别：声学 - 语音知识获取与表征

INCORPORATING PHONETIC KNOWLEDGE INTO AN EVOLUTIONARY SUBSPACE APPROACH FOR ROBUST SPEECH RECOGNITION

摘要

著录项

相似文献

相关主题

期刊订阅