Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization

Yoshioka T.Nakatani T.Miyoshi M.Okuno H. G.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization

【24h】

Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization

机译：通过联合优化实现语音混合的盲分离和混响

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper proposes a method for performing blind source separation (BSS) and blind dereverberation (BD) at the same time for speech mixtures. In most previous studies, BSS and BD have been investigated separately. The separation performance of conventional BSS methods deteriorates as the reverberation time increases while many existing BD methods rely on the assumption that there is only one sound source in a room. Therefore, it has been difficult to perform both BSS and BD when the reverberation time is long. The proposed method uses a network, in which dereverberation and separation networks are connected in tandem, to estimate source signals. The parameters for the dereverberation network (prediction matrices) and those for the separation network (separation matrices) are jointly optimized. This enables a BD process to take a BSS process into account. The prediction and separation matrices are alternately optimized with each depending on the other; hence, we call the proposed method the conditional separation and dereverberation (CSD) method. Comprehensive evaluation results are reported, where all the speech materials contained in the complete test set of the TIMIT corpus are used. The CSD method improves the signal-to-interference ratio by an average of about 4 dB over the conventional frequency-domain BSS approach for reverberation times of 0.3 and 0.5 s. The direct-to-reverberation ratio is also improved by about 10 dB.

机译：本文提出了一种用于语音混合的同时执行盲源分离（BSS）和盲去混响（BD）的方法。在之前的大多数研究中，BSS和BD均已分别进行了研究。传统BSS方法的分离性能会随着混响时间的增加而恶化，而许多现有的BD方法都依赖于一个假设，即房间中只有一个声源。因此，当混响时间长时，难以同时执行BSS和BD。所提出的方法使用将去混响和分离网络串联在一起的网络来估计源信号。共同优化去混响网络的参数（预测矩阵）和分离网络的参数（分离矩阵）。这使得BD流程可以将BSS流程考虑在内。预测矩阵和分离矩阵彼此交替优化。因此，我们将所提出的方法称为条件分离和去混响（CSD）方法。报告了综合评估结果，其中使用了TIMIT语料库完整测试集中包含的所有语音材料。 CSD方法在0.3和0.5 s的混响时间上比常规频域BSS方法平均提高了约4 dB的信号干扰比。直接混响比也提高了约10 dB。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2011年第1期|p.69-84|共16页
作者
Yoshioka T.Nakatani T.Miyoshi M.Okuno H. G.;
展开▼
作者单位

NTTCommunicationScienceLaboratories,NipponTelegraphandTelephoneCorporation,Kyoto,Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Blind source separation (BSS); blind dereverberation (BD); conditional separation and dereverberation (CSD);

机译：盲源分离（BSS）;盲混响（BD）;条件分离混响（CSD）;

相似文献

外文文献
中文文献
专利

1. A Blind Channel Identification-Based Two-Stage Approach to Separation and Dereverberation of Speech Signals in a Reverberant Environment [J] . Huang Y., Benesty J., Chen J. IEEE Transactions on Speech and Audio Proceessing . 2005,第5期

机译：基于盲通道识别的两阶段混响环境中语音信号分离与去混响方法
2. Preconditioned optimization algorithms solving the problem of the non unitary joint block diagonalization: application to blind separation of convolutive mixtures [J] . Cherrak Omar, Ghennioui Hicham, Thirion-Moreau Nadege, Multidimensional systems and signal processing . 2018,第4期

机译：求解非全整体关节块对角化问题的预处理优化算法：掺杂卷曲混合物盲分离的应用
3. Joint source separation and dereverberation using constrained spectral divergence optimization [J] . Karan Nathwani, Rajesh M. Hegde Signal processing . 2015,第jana期

机译：联合源分离和去混响使用约束谱发散优化
4. Online Adaptation for Jointly Optimized Blind Source Separation and Dereverberation of Speech Mixtures [C] . Timo Schuster, Stefan Feldes ITG-Symposium on Speech Communication . 2018

机译：在线自适应联合优化的混合语音盲源分离和去混响
5. Sensitivity analysis of blind separation of speech mixtures . [D] . Bulek, Savaskan. 2010

机译：语音混合盲分离的灵敏度分析。
6. Selective-Tap Blind Dereverberation for Two-Microphone Enhancement of Reverberant Speech [O] . Kostas Kokkinakis, Philipos C. Loizou -1

机译：选择性抽头盲混响去除的混响语音的双麦克风增强
7. An integrated method for blind separation and dereverberation of convolutive audio mixtures [O] . Miyoshi Masato, Nakatani Tomohiro, Yoshioka Takuya 2008

机译：卷积音频混合物的盲分离和混响的集成方法
8. Blind Adaptive Dereverberation of Speech Signals Using a Microphone Array [R] . Bakir, T. S. , Mersereau, R. M. 2003

机译：使用麦克风阵列进行语音信号的盲自适应去混响

Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅