首页> 外文会议>Audio Engineering Society international convention >A Machine Learning Approach to Detecting Sound-Source Elevation in Adverse Environments
【24h】

A Machine Learning Approach to Detecting Sound-Source Elevation in Adverse Environments

机译:一种在逆境中检测声源高程的机器学习方法

获取原文

摘要

Recent studies have shown that Deep Neural Networks (DNNs) are capable of detecting sound source azimuth direction in adverse environments to a high level of accuracy. This paper expands on these findings by presenting research which explores the use of DNNs in determining sound source elevation. A simple machine-learning system is presented which is capable of predicting source elevation to a relatively high degree of accuracy in both anechoic and reverberant environments. Speech signals spatialized across the front hemifield of the head arc used to train a feedforward neural network. The effectiveness of Gammatonc Filter Energies (GFEs) and the Cross-Correlation Function (CCF) in estimating elevation is investigated. Binaural cues such as Interaural Time Difference (ITD) and Interaural Level Difference (ILD) are also examined. Using a combination of these cues, it was found that source elevation to within 10° could be estimated to an accuracy of approximately 80% for both anechoic and reverberant environments.
机译:最近的研究表明,深度神经网络(DNN)能够以较高的精度检测不利环境中的声源方位方向。本文通过提出研究探索DNN在确定声源高程中的用途,扩大了这些发现。提出了一种简单的机器学习系统,该系统能够在无回声和混响环境中以相对较高的精度预测源高度。跨头部的前半场空间化的语音信号用于训练前馈神经网络。研究了伽玛通滤波能量(GFE)和互相关函数(CCF)在估计高程中的有效性。还检查了双耳提示,例如耳间时间差(ITD)和耳间水平差(ILD)。使用这些提示的组合,发现对于消声和混响环境,可以将源仰角提高到10°以内,其准确度约为80%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号