首页> 外文OA文献 >Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environmentsudud

【2h】

Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environmentsudud

机译：在混响环境中利用深度神经网络和头部运动实现多源的鲁棒双耳定位 ud UD

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel machine-hearing system that exploits deep neural networks (DNNs) and head movements for robust binaural localisation of multiple sources in reverberant environments. DNNs are used to learn the relationship between the source azimuth and binaural cues, consisting of the complete cross-correlation function (CCF) and interaural level differences (ILDs). In contrast to many previous binaural hearing systems, the proposed approach is not restricted to localisation of sound sources in the frontal hemifield. Due to the similarity of binaural cues in the frontal and rear hemifields, front-back confusions often occur. To address this, a head movement strategy is incorporated in the localisation model to help reduce the front-back errors. The proposed DNN system is compared to a Gaussian mixture model (GMM) based system that employs interaural time differences (ITDs) and ILDs as localisation features. Our experiments show that the DNN is able to exploit information in the CCF that is not available in the ITD cue, which together with head movements substantially improves localisation accuracies under challenging acoustic scenarios in which multiple talkers and room reverberation are present.ud

机译：本文提出了一种新颖的机器听觉系统，该系统利用深度神经网络（DNN）和头部运动来在混响环境中对多个声源进行稳健的双耳定位。 DNN用于了解源方位角和双耳线索之间的关系，其中包括完整的互相关函数（CCF）和耳间电平差（ILD）。与许多以前的双耳听觉系统相反，所提出的方法不限于声波在额叶半场中的定位。由于在前半球和后半球中双耳提示的相似性，经常会发生前后混淆。为了解决这个问题，头部运动策略被纳入定位模型中，以帮助减少前后误差。将提出的DNN系统与基于高斯混合模型（GMM）的系统进行比较，该系统采用双耳时差（ITD）和ILD作为定位特征。我们的实验表明，DNN能够利用ITD提示中未提供的CCF中的信息，在存在多个讲话者和房间混响的挑战性声学场景下，头部运动可以大大提高定位精度。

著录项

作者
Ma N.; May T.; Brown G.J.;
展开▼
作者单位

展开▼
年度 100
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments [J] . Ning Ma, Tobias May, Guy J. Brown Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第12期

机译：利用深层神经网络和头部运动在混响环境中实现多种信号源的稳健双耳定位
2. Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks [J] . Ning Ma, Jose A. Gonzalez, Guy J. Brown Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2018,第11期

机译：结合频谱源模型和深层神经网络对目标声源进行稳健的双耳本地化
3. Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks [J] . Mariam Yiwere, Eun Joo Rhee International Journal of Applied Engineering Research . 2017,第22aPta5期

机译：深神经网络混响条件中声源的距离估计与定位
4. Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues [C] . May Tobias, Ma Ning, Brown Guy J. IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：利用头部运动和双条件提示的多条件训练对多个说话者进行稳健的定位
5. Deep Neural Network for Robust Multiple Object Tracking [D] . Chu, Peng. 2020

机译：用于鲁棒多对象跟踪的深神经网络
6. Multiple Source Localization in a Shallow Water Waveguide Exploiting Subarray Beamforming and Deep Neural Networks [O] . Zhaoqiong Huang, Ji Xu, Zaixiao Gong, 2019

机译：浅水波导中的多源定位利用子阵列波束形成和深层神经网络
7. Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localization of Multiple Sources in Reverberant Environments [O] . Ma Ning, May Tobias, Brown Guy J. 2017

机译：利用深度神经网络和头部运动实现混响环境中多源鲁棒双耳定位
8. Exploiting Hidden Layer Responses of Deep Neural Networks for Language Recognition. [R] . Li, R., Mallidi, S. H., Burget, L., 2016

机译：利用深层神经网络隐藏层响应进行语言识别。

Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environmentsudud

摘要

著录项

相似文献

相关主题

期刊订阅