Source-filter Separation of Speech Signal in the Phase Domain

机译：相域中语音信号的源滤波器分离

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deconvolution of the speech excitation (source) and vocal tractud(filter) components through log-magnitude spectral processingudis well-established and has led to the well-known cepstral featuresudused in a multitude of speech processing tasks. This paperudpresents a novel source-filter decomposition based on processingudin the phase domain. We show that separation betweenudsource and filter in the log-magnitude spectra is far fromudperfect, leading to loss of vital vocal tract information. It isuddemonstrated that the same task can be better performed byudtrend and fluctuation analysis of the phase spectrum of theudminimum-phase component of speech, which can be computedudvia the Hilbert transform. Trend and fluctuation can be separatedudthrough low-pass filtering of the phase, using additivity ofudvocal tract and source in the phase domain. This results in separatedudsignals which have a clear relation to the vocal tract andudexcitation components. The effectiveness of the method is putudto test in a speech recognition task. The vocal tract componentudextracted in this way is used as the basis of a feature extractionudalgorithm for speech recognition on the Aurora-2 database.udThe recognition results shows upto 8.5% absolute improvementudin comparison with MFCC features on average (0-20dB).

机译：通过对数幅度频谱处理对语音激励（源）和声道 ud（滤波器）的分量进行去卷积已经很成熟，并导致了众所周知的倒谱特征在许多语音处理任务中被使用。本文介绍了一种基于相域处理的新型源滤波器分解方法。我们显示，对数幅度谱中 udsource与过滤器之间的分离远非 udperfect，导致重要声道信息的丢失。说明语音的最佳相位分量的相位谱的趋势和波动分析可以更好地执行同一任务，这可以通过希尔伯特变换来计算。可以使用相位域中的声道和信号源的相加性，通过相位的低通滤波来分离趋势和波动。这导致分离的 udsignal与声道和 udexcitation组件有明确的关系。该方法的有效性在语音识别任务中进行了测试。以这种方式去声道的声道成分被用作特征提取的基础，用于在Aurora-2数据库上进行语音识别的算法。 ud识别结果显示，与MFCC功能相比，绝对改善率高达8.5％ udin（0 -20dB）。

著录项

作者
Loweimi E.; Barker J.; Hain T.;
展开▼
作者单位

展开▼
年度 2015
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Partial separation method for solving permutation problem in frequency domain blind source separation of speech signals [J] . V.G. Reju, Soo Ngee Koh, Ing Yann Soon Neurocomputing . 2008,第10a12期

机译：语音信号频域盲源分离中置换问题的局部分离方法
2. Implementation of Frequency Domain Approach Using Instantaneous Mixing Auto Recursive for Separation of Speech Signals [J] . Palagan C. Anna, Geetha K. Parimala Journal of computational and theoretical nanoscience . 2016,第10期

机译：使用瞬时混合自动递归对语音信号分离的频域方法的实现
3. Exploiting all combinations of microphone sensors in overdetermined frequency domain blind separation of speech signals [J] . Yonggang Zhang, Jonathon A. Chambers International Journal of Adaptive Control and Signal Processing . 2011,第1期

机译：在超频域语音信号盲分离中利用麦克风传感器的所有组合
4. A JOINT PROBABILISTIC-DETERMINISTIC APPROACH USING SOURCE-FILTER MODELING OF SPEECH SIGNAL FOR SINGLE CHANNEL SPEECH SEPARATION [C] . M. H. Radfar, R. M. Dansereau, A. Sayadiyan IEEE Workshop on Machine Learning for Signal Processing . 2006

机译：单频道语音分离的语音信号源滤波器建模的联合概率确定方法
5. Blind Source Separation of Speech Signals: Exploiting Second Order Statistics [D] . Madanagopal, Vishaal. 2018

机译：语音信号的盲来源分离：利用二阶统计
6. Effects of electrode separation between speech and noise signals on consonant identification in cochlear implants [O] . Bom Jun Kwon -1

机译：语音和噪声信号之间的电极分离对人工耳蜗中辅音识别的影响
7. MLSP 2007 DATA ANALYSIS COMPETITION: FREQUENCY-DOMAIN BLIND SOURCE SEPARATION FOR CONVOLUTIVE MIXTURES OF SPEECH/AUDIO SIGNALS [O] . Hiroshi Sawada, Shoko Araki, Shoji Makino 2008

机译：MLSP 2007数据分析竞争：语音/音频信号的混合混合的频域盲源分离

Source-filter Separation of Speech Signal in the Phase Domain

摘要

著录项

相似文献

相关主题

期刊订阅