Binaural Speech Separation Algorithm Based on Long and Short Time Memory Networks

Lin Zhou; Siyuan Lu; Qiuyue Zhong; Ying Chen; Yibin Tang; Yan Zhou

首页> 中文期刊> 《计算机、材料和连续体（英文）》 >Binaural Speech Separation Algorithm Based on Long and Short Time Memory Networks

Binaural Speech Separation Algorithm Based on Long and Short Time Memory Networks

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker separation in complex acoustic environment is one of challenging tasks in speech separation.In practice,speakers are very often unmoving or moving slowly in normal communication.In this case,the spatial features among the consecutive speech frames become highly correlated such that it is helpful for speaker separation by providing additional spatial information.To fully exploit this information,we design a separation system on Recurrent Neural Network(RNN)with long short-term memory(LSTM)which effectively learns the temporal dynamics of spatial features.In detail,a LSTM-based speaker separation algorithm is proposed to extract the spatial features in each time-frequency(TF)unit and form the corresponding feature vector.Then,we treat speaker separation as a supervised learning problem,where a modified ideal ratio mask(IRM)is defined as the training function during LSTM learning.Simulations show that the proposed system achieves attractive separation performance in noisy and reverberant environments.Specifically,during the untrained acoustic test with limited priors,e.g.,unmatched signal to noise ratio(SNR)and reverberation,the proposed LSTM based algorithm can still outperforms the existing DNN based method in the measures of PESQ and STOI.It indicates our method is more robust in untrained conditions.

著录项

来源
《计算机、材料和连续体（英文）》 |2020年第6期|1373-1386|共14页
作者
Lin Zhou; Siyuan Lu; Qiuyue Zhong; Ying Chen; Yibin Tang; Yan Zhou;
展开▼
作者单位

School of Information Science and Engineering;

Southeast University;

Nanjing;

210096;

China;

Department of Psychiatry;

Columbia University and NYSPI;

New York;

10032;

USA;

College of Internet of Things Engineering;

Hohai University;

Changzhou;

213022;

China;

展开▼
原文格式 PDF
正文语种 chi
中图分类 TN9;
关键词
Binaural speech separation; long and short time memory networks; feature vectors; ideal ratio mask;

相似文献

中文文献
外文文献
专利

1. Speech Enhancement Algorithm Based on MMSE Short Time Spectral Amplitude in Whispered Speech [J] . Zhi-Heng Lu ,Huai-Zong Shao ,Tai-Liang Ju . 电子科技学刊 . 2009,第002期
2. A Novel Intrusion Detection Algorithm Based on Long Short Term Memory Network [J] . Xinda Hao ,Jianmin Zhou ,Xueqi Shen . 量子计算杂志(英文) . 2020,第2期
3. Short-term Load Forecasting of Regional Distribution Network Based on Generalized Regression Neural Network Optimized by Grey Wolf Optimization Algorithm [J] . Leijiao Ge ,Yiming Xian ,Zhongguan Wang . 中国电机工程学会电力与能源系统学报 . 2021,第005期
4. Solar radio filtering algorithm based on improved long short-term memory [J] . Qing-Fu Du ,Qiao-Man Zhang ,Xin Li . 天文和天体物理学研究 . 2021,第004期
5. NOISY SPEECH ENHANCEMENT ALGORITHM BASED ON SHOT TIME SPECTRUM AND ADDITIVE NOISE [C] . . 第六届全国人机语音通讯学术会议 . 2001
6. Research on Movie Recommendation Method Based on Convolution Neural Network and Long Short Term Memory Network [A] . Yiyuan Tang . 2019

Binaural Speech Separation Algorithm Based on Long and Short Time Memory Networks

摘要

著录项

相似文献

相关主题

期刊订阅