Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks

机译：使用时间上下文和卷积神经网络改善双耳线索的回响语音分离

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Given binaural features as input, such as interaural level difference and interaural phase difference, Deep Neural Networks (DNNs) have been recently used to localize sound sources in a mixture of speech signals and/or noise, and to create time-frequency masks for the estimation of the sound sources in reverberant rooms. Here, we explore a more advanced system, where feed-forward DNNs are replaced by Convolutional Neural Networks (CNNs). In addition, the adjacent frames of each time frame (occurring before and after this frame) are used to exploit contextual information, thus improving the localization and separation for each source. The quality of the separation results is evaluated in terms of Signal to Distortion Ratio (SDR).

机译：给定双耳特征作为输入，例如耳间电平差和耳间相位差，最近已使用深度神经网络（DNN）在混合语音信号和/或噪声的情况下定位声源，并为音频信号创建时频掩码估计混响室内的声源。在这里，我们探索了一个更高级的系统，其中前馈DNN被卷积神经网络（CNN）取代。另外，每个时间帧的相邻帧（在该帧之前和之后发生）都用于利用上下文信息，从而改善了每个源的定位和分离。分离结果的质量根据信号失真比（SDR）进行评估。

著录项

来源
《International conference on latent variable analysis and signal separation》|2018年|361-371|共11页
会议地点
作者
Alfredo Zermini; Qiuqiang Kong; Yong Xu; Mark D. Plumbley; Wenwu Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Convolutional Neural Networks; Binaural cues; Reverberant rooms; Speech separation; Contextual information;

机译：卷积神经网络双耳提示;混响室;语音分离;上下文信息;

相似文献

外文文献
中文文献
专利

1. Speech separation based on reliable binaural cues with two-stage neural network in noisy-reverberant environments [J] . Li Ruwei, Li Tao, Sun Xiaoyue, Applied Acoustics . 2020,第Nova期

机译：基于可靠的双耳线路与双阶段神经网络在嘈杂的环境中的语音分离
2. Learning Deep Binaural Representations With Deep Convolutional Neural Networks for Spontaneous Speech Emotion Recognition [J] . Zhang Shiqing, Chen Aihua, Guo Wenping, Quality Control, Transactions . 2020,第期

机译：学习深层卷积神经网络的深层双耳陈述，用于自发言论情绪识别
3. Deep convolutional neural network-based speech enhancement to improve speech intelligibility and quality for hearing-impaired listeners (Retraction of 2018) [J] . Rahiman P. F. Khaleelur, Jayanthi V. S., Jayanthi A. N. Medical and Biological Engineering and Computing: Journal of the International Federation for Medical and Biological Engineering . 2019,第3期

机译：基于深度卷积神经网络的语言增强，提高听力障碍听众的语音清晰度和质量（2018年撤回）
4. Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks [C] . Alfredo Zermini, Qiuqiang Kong, Yong Xu, International Conference on Latent Variable Analysis and Signal Separation . 2018

机译：使用时间上下文和卷积神经网络改善与双耳卷的混响言语分离
5. Convolutional Neural Networks for Speaker-Independent Speech Recognition. [D] . Belilovsky, Eugene. 2011

机译：用于与说话人无关的语音识别的卷积神经网络。
6. Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training [O] . Arun Narayanan, DeLiang Wang -1

机译：通过语音分离和联合自适应训练提高深度神经网络声学模型的鲁棒性
7. Comparison between the statistical cues in BSS techniques and Binaural cues in CASA approaches for reverberant speech separation [O] . Alinaghi, A, Jackson, PJB, Wang, W 2013

机译：BSS技术中的统计线索与CASA方法中的混响语音分离中的双耳线索之间的比较

Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅