Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients

Tjong Wan Sen; Bambang Riyanto Trilaksono; Arry Akhmad Arman; Rila Mandala

首页> 外文期刊>Journal of ICT Research and Applications >Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients

【24h】

Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients

机译：使用复数小波包变换系数的鲁棒性自动语音识别功能

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

To improve the performance of phoneme based Automatic Speech Recognition (ASR) in noisy environment; we developed a new technique that could add robustness to clean phonemes features. These robust features are obtained from Complex Wavelet Packet Transform (CWPT) coefficients. Since the CWPT coefficients represent all different frequency bands of the input signal, decomposing the input signal into complete CWPT tree would also cover all frequencies involved in recognition process. For time overlapping signals with different frequency contents, e. g. phoneme signal with noises, its CWPT coefficients are the combination of CWPT coefficients of phoneme signal and CWPT coefficients of noises. The CWPT coefficients of phonemes signal would be changed according to frequency components contained in noises. Since the numbers of phonemes in every language are relatively small (limited) and already well known, one could easily derive principal component vectors from clean training dataset using Principal Component Analysis (PCA). These principal component vectors could be used then to add robustness and minimize noises effects in testing phase. Simulation results, using Alpha Numeric 4 (AN4) from Carnegie Mellon University and NOISEX-92 examples from Rice University, showed that this new technique could be used as features extractor that improves the robustness of phoneme based ASR systems in various adverse noisy conditions and still preserves the performance in clean environments.

机译：在嘈杂的环境中提高基于音素的自动语音识别（ASR）的性能;我们开发了一种新技术，可以为清除音素功能增加鲁棒性。这些强大的功能是从复数小波包变换（CWPT）系数获得的。由于CWPT系数代表输入信号的所有不同频带，因此将输入信号分解为完整的CWPT树也将覆盖识别过程中涉及的所有频率。对于具有不同频率内容的时间重叠信号，例如。 G。带有噪声的音素信号，其CWPT系数是音素信号的CWPT系数和噪声的CWPT系数的组合。音素信号的CWPT系数将根据噪声中包含的频率分量而变化。由于每种语言的音素数量相对较少（有限）并且已经众所周知，因此可以使用主成分分析（PCA）从干净的训练数据集中轻松导出主成分向量。然后可以使用这些主成分矢量来增加鲁棒性，并在测试阶段将噪声影响降至最低。使用卡内基梅隆大学的Alpha Numeric 4（AN4）和莱斯大学的NOISEX-92实例进行的仿真结果表明，该新技术可以用作特征提取器，以提高基于音素的ASR系统在各种不利噪声条件下的鲁棒性，并且仍然保持清洁环境下的性能。

著录项

来源
《Journal of ICT Research and Applications》 |2009年第2期|共11页
作者
Tjong Wan Sen; Bambang Riyanto Trilaksono; Arry Akhmad Arman; Rila Mandala;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Emotion recognition from speech using wavelet packet transform and prosodic features [J] . Gupta Manish, Bharti Shambhu Shankar, Agarwal Suneeta Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2018,第2期

机译：使用小波包变换和韵律特征来讲话的情感认识
2. Novel Sub-band Spectral Centroid Weighted Wavelet Packet Features with Importance-Weighted Support Vector Machines for Robust Speech Emotion Recognition [J] . Huang Yongming, Ao Wu, Zhang Guobao Wireless personal communications: An Internaional Journal . 2017,第3期

机译：新型子带谱质心加权小波包具有强大的语音情感识别的重要性加权支持向量机
3. Robust Speech Recognition Using Perceptual Wavelet Denoising and Mel-frequency Product Spectrum Cepstral Coefficient Features [J] . M.C.A. Korba, D. Messadeg, R. Djemili, Informatica: An International Journal of Computing and Informatics . 2008,第3期

机译：使用感知小波降噪和梅尔频率乘积谱倒谱系数特征的稳健语音识别
4. Using the Modulation Complex Wavelet Transform for Feature Extraction in Automatic Speech Recognition [C] . Yasunori Momomura, Kenji Okada, Takayuki Arai, European conference on speech communication and technology . 2001

机译：在自动语音识别中使用调制复杂小波变换进行特征提取
5. Wavelet-based feature extraction for robust speech recognition. [D] . Walker, Shonda Lachelle. 2003

机译：基于小波的特征提取，可实现强大的语音识别。
6. Automatic recognition of breast invasive ductal carcinoma based on terahertz spectroscopy with wavelet packet transform and machine learning [O] . Wenquan Liu, Rui Zhang, Yu Ling, 2020

机译：小波包变换和机器学习的太赫兹光谱自动识别乳腺浸润性导管癌
7. Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients [O] . Tjong Wan Sen, Bambang Riyanto Trilaksono, Arry Akhmad Arman, 2013

机译：基于复小波包变换系数的鲁棒自动语音识别特征

Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients

摘要

著录项

相似文献

相关主题

期刊订阅