Multichannel speech reinforcement based on binaural unmasking

Junhyeong Pak; Inyong Choi; Yu Gwang Jin; Jong Won Shin

首页> 外文期刊>Signal processing >Multichannel speech reinforcement based on binaural unmasking

【24h】

Multichannel speech reinforcement based on binaural unmasking

机译：基于双耳解掩的多通道语音增强

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Speech reinforcement or near-end listening enhancement is a technique that modifies the far-end signal to mitigate the effect of the near-end noise, usually based on the power spectra of the far-end signal and the near-end noise. Psychoacoustic experiments have shown that the location of a noise source with respect to that of a signal source affects the amount of masking. Since conventional speech reinforce- ment methods obtain spectral gain based only on the power spectra, this psychoacoustic phenomenon called binaural unmasking has not been considered in those approaches. In this paper, we propose a novel speech reinforcement algorithm that modifies the far-end speech signal based on both the power spectrum and the direction-of-arrival (DoA) of the noise. Specifically, we have computed the equivalent frontal noise level from the observed noise level and the estimated DoA, and used it to compute spectral gains as in conventional partial loudness restoration-based speech reinforcement. Experimental results showed that the proposed method outperformed the conventional methods based on partial loudness restoration and speech intelligibility index (SII) optimization in terms of the overall perceived quality through subjective listening tests.

机译：语音增强或近端聆听增强是一种通常根据远端信号和近端噪声的功率谱来修改远端信号以减轻近端噪声影响的技术。心理声学实验表明，噪声源相对于信号源的位置会影响掩膜的数量。由于常规的语音增强方法仅基于功率谱获得频谱增益，因此在这些方法中未考虑这种称为双耳解掩的心理声学现象。在本文中，我们提出了一种新颖的语音增强算法，该算法基于功率谱和噪声的到达方向（DoA）来修改远端语音信号。具体来说，我们已经从观察到的噪声水平和估计的DoA计算了等效的正面噪声水平，并像传统的基于部分响度恢复的语音增强一样，将其用于计算频谱增益。实验结果表明，通过主观听觉测试，该方法优于基于部分响度恢复和语音清晰度指数（SII）优化的常规方法。

著录项

来源
《Signal processing》 |2017年第10期|165-172|共8页
作者
Junhyeong Pak; Inyong Choi; Yu Gwang Jin; Jong Won Shin;
展开▼
作者单位

School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea;

Department of Communication Sciences and Disorders, University of Iowa, Iowa City, ÌA 52242, USA;

Corporate R&D Center, SK Telecom Co., Ltd., Seoul 04539, Korea;

School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech reinforcement; Partial masking effect; Loudness perception; Binaural unmasking; Direction-of-arrival;

机译：言语增强;部分遮盖效果;响度感知;双耳掩盖;到达方向;

相似文献

外文文献
中文文献
专利

1. Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage [J] . Christopher F. Hauth, Simon C. Berning, Birger Kollmeier, Trends in Hearing . 2020,第1期

机译：使用盲双个加工阶段建模双耳揭露语音
2. Binaural speech unmasking and localization in noise with bilateral cochlear implants using envelope and fine-timing based strategies [J] . van Hoesel R, Bohm M, Pesch J, The Journal of the Acoustical Society of America . 2008,第4期

机译：双侧耳蜗植入物的双耳语音解掩和噪声定位，采用基于包络和精细定时的策略
3. Speech intelligibility improvements with hearing aids using bilateral and binaural adaptive multichannel Wiener filtering based noise reduction a [J] . Bram Cornelis, Marc Moonen, Jan Wouters The Journal of the Acoustical Society of America . 2012,第6期

机译：使用双边和双耳自适应多通道维纳滤波的降噪技术，借助助听器提高语音清晰度
4. Binaural loudness based speech reinforcement with a closed-form solution [C] . Shin, Ho Seon, Choi, Min-Seok, Kim, Taesu, IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010 . 2010

机译：基于双耳响度的语音增强和封闭式解决方案
5. Robust Recognition of Binaural Speech Signals Using Techniques Based on Human Auditory Processing [D] . Menon, Anjali I. 2019

机译：基于人类听觉处理技术的双耳语音信号的稳健识别
6. Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage [O] . Christopher F. Hauth, Simon C. Berning, Birger Kollmeier, 2020

机译：使用盲双个加工阶段建模双耳揭露语音
7. Speech intelligibility prediction in reverberation: Towards an integrated model of speech transmission, spatial unmasking, and binaural de-reverberation [O] . Leclère Thibaud, Lavandier Mathieu, Culling John F. 2015

机译：混响中的语音清晰度预测：建立语音传输，空间解掩和双耳混响的集成模型

Multichannel speech reinforcement based on binaural unmasking

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅