An iterative model-based approach to cochannel speech separation

Ke Hu; DeLiang Wang

首页> 外文期刊>EURASIP Journal on Audio, Speech, and Music Processing >An iterative model-based approach to cochannel speech separation

【24h】

An iterative model-based approach to cochannel speech separation

机译：基于迭代模型的同信道语音分离方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cochannel speech separation aims to separate two speech signals from a single mixture. In a supervised scenario, the identities of two speakers are given, and current methods use pre-trained speaker models for separation. One issue in model-based methods is the mismatch between training and test signal levels. We propose an iterative algorithm to adapt speaker models to match the signal levels in testing. Our algorithm first obtains initial estimates of source signals using unadapted speaker models and then detects the input signal-to-noise ratio (SNR) of the mixture. The input SNR is then used to adapt the speaker models for more accurate estimation. The two steps iterate until convergence. Compared to search-based SNR detection methods, our method is not limited to given SNR levels. Evaluations demonstrate that the iterative procedure converges quickly in a considerable range of SNRs and improves separation results significantly. Comparisons show that the proposed system performs significantly better than related model-based systems.

机译：同频道语音分离旨在从单个混合物中分离两个语音信号。在有监督的情况下，给出了两个说话者的身份，并且当前的方法使用预先训练的说话者模型进行分离。基于模型的方法中的一个问题是训练和测试信号电平之间的不匹配。我们提出一种迭代算法，以调整扬声器模型以匹配测试中的信号电平。我们的算法首先使用不适合的扬声器模型获得源信号的初始估计，然后检测混合物的输入信噪比（SNR）。输入SNR然后用于调整扬声器模型，以进行更准确的估计。这两个步骤反复进行，直到收敛为止。与基于搜索的SNR检测方法相比，我们的方法不限于给定的SNR级别。评估表明，迭代过程在相当大的SNR范围内快速收敛，并显着改善了分离结果。比较表明，所提出的系统的性能明显优于相关的基于模型的系统。

著录项

来源
《EURASIP Journal on Audio, Speech, and Music Processing》 |2013年第1期|1-11|共11页
作者
Ke Hu; DeLiang Wang;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. An iterative model-based approach to cochannel speech separation [J] . Ke Hu, DeLiang Wang EURASIP journal on audio, speech, and music processing . 2013,第1期

机译：基于迭代模型的同信道语音分离方法
2. An Unsupervised Approach to Cochannel Speech Separation [J] . Hu K., Wang D. Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第1期

机译：同道语音分离的无监督方法
3. Model-based sequential organization in cochannel speech [J] . Yang Shao, DeLiang Wang IEEE transactions on audio, speech and language processing . 2006,第1期

机译：同道语音中基于模型的顺序组织
4. Speaker identification enhancement under cochannel conditions using sinusoidal model-based usable speech detection [C] . Khanwalkar S.S., Smolenski B.Y., Yantorno R.E. Intelligent Signal Processing and Communication Systems, 2004. ISPACS 2004. Proceedings of 2004 International Symposium on . 2004

机译：使用基于正弦模型的可用语音检测增强同频道条件下的说话人识别
5. Model-based Single-microphone Speech Separation Using Conditional Random Fields. [D] . Yeung, Yu Ting. 2014

机译：使用条件随机场的基于模型的单麦克风语音分离。
6. A Real-Time Speech Separation Method Based on Camera and Microphone Array Sensors Fusion Approach [O] . Ching-Feng Liu, Wei-Siang Ciou, Peng-Ting Chen, 2020

机译：基于摄像头和麦克风阵列传感器融合方法的实时语音分离方法
7. An iterative model-based approach to cochannel speech separation [O] . 2013

机译：基于迭代模型的同信道语音分离方法

An iterative model-based approach to cochannel speech separation

摘要

著录项

相似文献

相关主题

期刊订阅