A hypothesis testing approach for real-time multichannel speech separation using time-frequency masks

机译：使用时频模板进行实时多通道语音分离的假设测试方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a new approach to time-frequency mask generation for real-time multichannel speech separation. Whereas conventional approaches select the strongest source in each time-frequency bin, we perform a binary hypothesis test to determine whether a target source is present or not. We derive a generalized likelihood ratio test and extend it to underdetermined mixtures by aggregating the outputs of several tests with different interference models. This approach is justified by the nonstationarity and time-frequency disjointedness of speech signals. This computationally simple method is suitable for real-time source separation in resource-constrained and latency-critical applications.

机译：我们提出了一种用于实时多通道语音分离的时频掩码生成的新方法。常规方法在每个时频点中选择最强的源，而我们执行二元假设检验以确定目标源是否存在。通过汇总具有不同干扰模型的多个测试的输出，我们得出了广义似然比测试，并将其扩展到不确定混合。语音信号的非平稳性和时频脱节性证明了这种方法的合理性。这种计算简单的方法适用于资源受限和延迟关键的应用程序中的实时源分离。

著录项

来源
《IEEE International Workshop on Machine Learning for Signal Processing》|2016年|1-6|共6页
会议地点
作者
Ryan M. Corey; Andrew C. Singer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech; Time-frequency analysis; Signal to noise ratio; Microphones; Testing; Real-time systems; Interference;

机译：语音;时频分析;信噪比;麦克风;测试;实时系统;干扰;

相似文献

外文文献
中文文献
专利

1. Blind separation of underdetermined Convolutive speech mixtures by time-frequency masking with the reduction of musical noise of separated signals [J] . Zohrevandi Mahbanou, Setayeshi Saeed, Rabiee Azam, Multimedia Tools and Applications . 2021,第8期

机译：通过时频掩模在减少分离信号的音乐噪声的时频掩模盲分离
2. Impact of phase estimation on single-channel speech separation based on time-frequency masking [J] . Mayer Florian, Williamson Donald S., Mowlaee Pejman, The Journal of the Acoustical Society of America . 2017,第6期

机译：相位估计对基于时频掩蔽的单通道语音分离的影响
3. Online blind speech separation using multiple acoustic speaker tracking and time-frequency masking [J] . P. Pertila Computer speech and language . 2013,第3期

机译：使用多个声学扬声器跟踪和时频掩蔽的在线盲语音分离
4. A hypothesis testing approach for real-time multichannel speech separation using time-frequency masks [C] . Ryan M. Corey, Andrew C. Singer IEEE International Workshop on Machine Learning for Signal Processing . 2016

机译：使用时频掩模的实时多通道语音分离的假设检测方法
5. Multichannel signal decomposition and separation in the time-frequency domain. [D] . Shan, Zeyong. 2009

机译：时频域中的多通道信号分解和分离。
6. Time-Frequency Masking for Speech Separation and Its Potential for Hearing Aid Design [O] . DeLiang Wang 2008

机译：语音分离的时频掩蔽及其在助听器设计中的潜力
7. Localization based stereo speech source separation using probabilistic time-frequency masking and deep neural networks [O] . Yang Yu, Wenwu Wang, Peng Han 2016

机译：使用概率时频掩蔽和深度神经网络的基于本地化的立体声语音源分离
8. Isolating the Energetic Component of Speech-on-Speech Masking With Ideal Time-Frequency Segregation [R] . Brungart, D. S., Chang, P. S., Simpson, B. D., 2006

机译：用理想的时频分离隔离语音掩蔽的能量分量

A hypothesis testing approach for real-time multichannel speech separation using time-frequency masks

摘要

著录项

相似文献

相关主题

期刊订阅