Time-Frequency Masking Strategies for Single-Channel Low-Latency Speech Enhancement Using Neural Networks

机译：神经网络的单通道低延迟语音增强的时频掩蔽策略

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a low-latency neural network based speech enhancement system. Low-latency operation is critical for speech communication applications. The system uses the time-frequency (TF) masking approach to retain speech and remove the non-speech content from the observed signal. The ideal TF mask are obtained by supervised training of neural networks. As the main contribution different neural network models are experimentally compared to investigate computational complexity and speech enhancement performance. The proposed system is trained and tested on noisy speech data where signal-to-noise ratio (SNR) ranges from -5 dB to +5 dB and the results show significant reduction of non-speech content in the resulting signal while still meeting a low-latency operation criterion, which is here considered to be less than 20 ms.

机译：本文提出了一种基于低延迟神经网络的语音增强系统。低延迟操作对于语音通信应用至关重要。该系统使用时频（TF）屏蔽方法来保留语音并从观察到的信号中去除非语音内容。理想的TF蒙版是通过神经网络的监督训练而获得的。作为主要贡献，对不同的神经网络模型进行了实验比较，以研究计算复杂性和语音增强性能。所提议的系统是在信噪比（SNR）从-5 dB到+5 dB的嘈杂语音数据上进行训练和测试的，结果表明所产生信号中的非语音内容显着减少，同时仍然满足较低的要求-等待时间操作标准，在这里被认为小于20 ms。

著录项

来源
《International Workshop on Acoustic Signal Enhancement》|2018年|51-55|共5页
会议地点 Tokyo(JP)
作者
Mikko Parviainen; Pasi Pertilä; Tuomas Virtanen; Peter Grosche;
展开▼
作者单位

Tampere University of Technology Laboratory of Signal Processing Tampere FINLAND;

Hua;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech enhancement; Noise measurement; Training; Time-frequency analysis; Biological neural networks; Signal to noise ratio;

机译：语音增强；噪声测量；训练;时频分析；生物神经网络；信噪比;

相似文献

外文文献
中文文献
专利

1. Impact of phase estimation on single-channel speech separation based on time-frequency masking [J] . Mayer Florian, Williamson Donald S., Mowlaee Pejman, The Journal of the Acoustical Society of America . 2017,第6期

机译：相位估计对基于时频掩蔽的单通道语音分离的影响
2. Time-frequency masking based supervised speech enhancement framework using fuzzy deep belief network [J] . Samui Suman, Chakrabarti Indrajit, Ghosh Soumya K. Applied Soft Computing . 2019,第期

机译：基于时频的屏蔽基于模糊深度信仰网络的监督语音增强框架
3. A Time-Frequency Adaptation Based on Quantum Neural Networks for Speech Enhancement [J] . Kun-Ching Wang, Chiun-Li Chin WSEAS Transactions on Information Science and Applications . 2010,第1a3期

机译：基于量子神经网络的时频自适应语音增强
4. Time-Frequency Masking Strategies for Single-Channel Low-Latency Speech Enhancement Using Neural Networks [C] . Mikko Parviainen, Pasi Pertil?, Tuomas Virtanen, International Workshop on Acoustic Signal Enhancement . 2018

机译：使用神经网络进行单通道低延迟语音增强的时频掩蔽策略
5. Speech enhancement algorithms using Kalman filtering and masking properties of human auditory systems. [D] . Ma, Ning. 2005

机译：使用卡尔曼滤波和人类听觉系统掩蔽属性的语音增强算法。
6. Impact of phase estimation on single-channel speech separation based on time-frequency masking [O] . Florian Mayer, Donald S. Williamson, Pejman Mowlaee, -1

机译：基于时频掩蔽的相位估计对单通道语音分离的影响
7. Localization based stereo speech source separation using probabilistic time-frequency masking and deep neural networks [O] . Yang Yu, Wenwu Wang, Peng Han 2016

机译：使用概率时频掩蔽和深度神经网络的基于本地化的立体声语音源分离
8. Enhancing Listener Strategies Using a Payoff Matrix in Speech-on-speech Masking Experiments. [R] . Thompson, E. R., Iyer, N., Simpson, B. D., 2015

机译：在语音语音掩蔽实验中使用支付矩阵增强听众策略。

Time-Frequency Masking Strategies for Single-Channel Low-Latency Speech Enhancement Using Neural Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅