首页> 外文会议>2018 52nd Asilomar Conference on Signals, Systems, and Computers >A new feature set for masking-based monaural speech separation

【24h】

A new feature set for masking-based monaural speech separation

机译：基于蒙版的单声道语音分离的新功能集

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a new feature based on a gammatone filter bank for improving monaural speech separation using neural networks. This new feature encodes not only the local information of cochleagram, and spectrotemporal context, similar to previous approaches, but also captures time-frequency dynamics in the spectrotemporal context using an image processing technique. Speech separation was achieved by computing optimal time-frequency masks using two types of neural networks (DNN and LSTM) to determine the interactions between feature and training model properties. The performance of our feature was evaluated in a variety of simulated environments having different non-stationary noises and reverberation times and quantified using three objective measures. Experimental results show that the proposed monaural feature set improves the objective speech intelligibility, speech quality and signal-to-noise ratio compared to prior feature sets in noisy and reverberant environments with particular benefit in speech intelligibility.

机译：我们提出了一个基于gammatone滤波器库的新功能，用于使用神经网络改善单声道语音分离。与以前的方法类似，此新功能不仅对耳蜗图和时空上下文进行本地编码，而且还使用图像处理技术捕获时空上下文中的时频动态。语音分离是通过使用两种类型的神经网络（DNN和LSTM）计算最佳时频掩码来确定特征与训练模型属性之间的相互作用来实现的。我们在各种具有不同非平稳噪声和混响时间的模拟环境中对我们功能的性能进行了评估，并使用三个客观指标对其进行了量化。实验结果表明，与嘈杂和混响环境中的现有特征集相比，所提出的单声道特征集提高了客观语音清晰度，语音质量和信噪比，特别有利于语音清晰度。

著录项

来源
《2018 52nd Asilomar Conference on Signals, Systems, and Computers 》|2018年|828-832|共5页
会议地点 Pacific Grove(US)
作者
Shadi Pirhosseinloo; Jonathan S. Brumberg;
展开▼
作者单位

Electrical Engineering Computer Science, University of Kansas, Lawrence, KS, USA;

Speech-Language-Hearing, University of Kansas, Lawrence, KS, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Training; Noise measurement; Neural networks; Speech processing; Time-frequency analysis; Reverberation; Feature extraction;

机译：训练;噪声测量;神经网络;语音处理;时频分析;混响;特征提取;;

相似文献

外文文献
中文文献
专利

1. Features for Masking-Based Monaural Speech Separation in Reverberant Conditions [J] . Masood Delfarah, DeLiang Wang Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017 ,第5期

机译：混响条件下基于蒙版的单声道语音分离的功能
2. Monaural speech separation based on MAXVQ and CASA for robust speech recognition [J] . Peng Li, Yong Guan, Shijin Wang, Computer speech and language . 2010 ,第1期

机译：基于MAXVQ和CASA的单声道语音分离可增强语音识别能力
3. Monaural Speech Separation Based on Computational Auditory Scene Analysis and Objective Quality Assessment of Speech [J] . Li P., Guan Y., Xu B., IEEE transactions on audio, speech and language processing . 2006 ,第6期

机译：基于计算听觉场景分析和语音客观质量评估的单声道语音分离
4. A new feature set for masking-based monaural speech separation [C] . Shadi Pirhosseinloo, Jonathan S. Brumberg Asilomar Conference on Signals, Systems, and Computers . 2018

机译：用于基于掩蔽的单声道语音分离的新功能
5. Kinematic measurement and feature sets for automatic speech recognition. [D] . Fain, Daniel Clark. 2001

机译：运动学测量和功能集，用于自动语音识别。
6. Complex Ratio Masking for Monaural Speech Separation [O] . Donald S. Williamson, Yuxuan Wang, DeLiang Wang -1

机译：用于单声道语音分离的复数比率掩蔽
7. NMF based speech and music separation in monaural speech recordings with sparseness and temporal continuity constraints [O] . Tu Ming, Xie Xiang, Jiao Yishan 2013

机译：基于NMF的语音和音乐分离在单声道语音记录中，具有稀疏性和时间连续性约束
8. Deep Ensemble Learning for Monaural Speech Separation. [R] . Wang, D. 2015

机译：单声道语音分离的深度集成学习。

A new feature set for masking-based monaural speech separation

摘要

著录项

相似文献

相关主题

期刊订阅