首页> 美国卫生研究院文献>Computational Intelligence and Neuroscience >Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection
【2h】

Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection

机译:基于深度递归神经网络的自动编码器用于声音新颖性检测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In the emerging field of acoustic novelty detection, most research efforts are devoted to probabilistic approaches such as mixture models or state-space models. Only recent studies introduced (pseudo-)generative models for acoustic novelty detection with recurrent neural networks in the form of an autoencoder. In these approaches, auditory spectral features of the next short term frame are predicted from the previous frames by means of Long-Short Term Memory recurrent denoising autoencoders. The reconstruction error between the input and the output of the autoencoder is used as activation signal to detect novel events. There is no evidence of studies focused on comparing previous efforts to automatically recognize novel events from audio signals and giving a broad and in depth evaluation of recurrent neural network-based autoencoders. The present contribution aims to consistently evaluate our recent novel approaches to fill this white spot in the literature and provide insight by extensive evaluations carried out on three databases: A3Novelty, PASCAL CHiME, and PROMETHEUS. Besides providing an extensive analysis of novel and state-of-the-art methods, the article shows how RNN-based autoencoders outperform statistical approaches up to an absolute improvement of 16.4% average F-measure over the three databases.
机译:在声学新颖性检测的新兴领域,大多数研究工作都致力于概率方法,例如混合模型或状态空间模型。仅最近的研究介绍了以自动编码器形式使用递归神经网络进行声学新奇检测的(伪)生成模型。在这些方法中,借助于长期-短期记忆递归去噪自动编码器,从先前的帧中预测下一短期帧的听觉频谱特征。自动编码器的输入和输出之间的重构误差被用作激活信号以检测新事件。没有证据表明研究专注于比较以往从音频信号中自动识别新颖事件的努力,并对基于递归神经网络的自动编码器进行了广泛而深入的评估。本文稿旨在持续评估我们最近的新颖方法以填补文献中的这一空白,并通过在三个数据库(A3Novelty,PASCAL CHiME和PROMETHEUS)上进行的广泛评估来提供见解。除了提供对新颖方法和最新技术的广泛分析之外,本文还介绍了基于RNN的自动编码器如何比统计方法优于三个数据库上的平均F值达到16.4%的绝对改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号