On generalization of classification based speech separation

机译：基于分类的语音分离一般化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Monaural speech separation is a very challenging problem. Recent studies utilize supervised learning methods to estimate the ideal binary mask (IBM) to solve the problem. In a supervised learning framework, the issue of generalization to conditions different from those used in training is paramount. This paper describes methods that require only a small training corpus but can generalize to unseen conditions. The system utilizes support vector machines to learn classification cues and then employs a rethresholding method to estimate the IBM. A distribution fitting method is used to address unseen signal-to-noise ratio conditions and an iterative voice activity detection is used to address unseen noise conditions. Systematic evaluations show that the proposed approach produces high quality IBM estimates under unseen conditions.

机译：单耳语音分离是一个非常具有挑战性的问题。最近的研究利用监督学习方法来估计解决问题的理想二进制掩码（IBM）。在有监督的学习框架中，将条件泛化为不同于训练中所使用的条件的问题至关重要。本文介绍了只需要少量训练语料库但可以推广到看不见情况的方法。该系统利用支持向量机来学习分类线索，然后采用重新阈值方法来估计IBM。分布拟合方法用于解决看不见的信噪比条件，迭代语音活动检测用于解决看不见的噪声条件。系统评估表明，所提出的方法在看不见的情况下可以产生高质量的IBM估计。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.4541- 4544|共4页
会议地点 Kyoto(JP)
作者
Han, Kun;
展开▼
作者单位

Department of Computer Science and Engineering & Center for Cognitive Science The Ohio State University Columbus 43210-1277 USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:38:22

相似文献

外文文献
中文文献
专利

1. Deep neural networks based binary classification for single channel speaker independent multi-talker speech separation [J] . Saleem Nasir, Khattak Muhammad Irfan Applied Acoustics . 2020,第Octa期

机译：基于深度神经网络的单通道扬声器独立多讲车语音分离二进制分类
2. Towards Scaling Up Classification-Based Speech Separation [J] . Wang Y., Wang D. Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第7期

机译：逐步扩大基于分类的语音分离
3. Long short-term memory for speaker generalization in supervised speech separation [J] . Chen Jitong, Wang DeLiang The Journal of the Acoustical Society of America . 2017,第6期

机译：监督言论分离中的扬声器概括的长短期内存
4. On generalization of classification based speech separation [C] . Han Kun IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：基于分类的语言分离的概括
5. On Generalization of Supervised Speech Separation [D] . Chen, Jitong. 2017

机译：有监督语音分离的一般化
6. Long short-term memory for speaker generalization in supervised speech separation [O] . Jitong Chen, DeLiang Wang -1

机译：长时短时记忆用于监督语音分离中的说话人泛化
7. Classification and Separation Techniques based on Fundamental Frequency for Speech Enhancement [O] . Cabañas-Molero Pablo-Antonio 2016

机译：基于基频的语音增强分类与分离技术
8. Speech Segregation based on Binary Classification. [R] . Wang, D. 2016

机译：基于二进制分类的语音分离。

On generalization of classification based speech separation

摘要

著录项

相似文献

相关主题

期刊订阅