首页> 外文会议>Iinternational conference on industrial, engineering and other applications of applied intelligence systems >Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification
【24h】

Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification

机译:使用两阶段深度神经网络进行降噪和声源识别训练可降低计算成本

获取原文

摘要

This paper addresses reduction of computational cost in training of a Deep Neural Network (DNN), in particular, for sound identification using highly noise-contaminated sound recorded with a microphone array embedded in an Unmanned Aerial Vehicle (UAV), aiming at people's voice detection quickly and widely in a disastrous situation. It is known that a DNN training method called end-to-end training shows high performance, since it uses a huge neural network with high non-linearity which is trained with a large amount of raw input signals without preprocessing. Its computational cost is, however, expensive due to the high complexity of the neural network. Therefore, we propose two-stage DNN training using two separately-trained networks; denoising of sound sources and sound source identification. Since the huge network is divided into two smaller networks, the complexity of the networks is expected to decrease and each of them can consider a specific model of denoising and identification. This results in faster convergence and computational cost reduction in DNN training. Preliminary results showed that only 71 % of training time was necessary with the proposed two staged network, while maintaining the accuracy of sound source identification, compared to end-to-end training using noisy acoustic signals recorded with an 8 ch circular microphone array embedded in a UAV.
机译:本文致力于降低训练深度神经网络(DNN)时的计算成本,尤其是针对使用高度噪声污染的声音进行声音识别的情况,这种声音被嵌入到无人飞行器(UAV)中的麦克风阵列记录下来,旨在实现人们的语音检测在灾难性的情况下迅速而广泛地发展。众所周知,一种称为端到端训练的DNN训练方法具有很高的性能,因为它使用了具有高非线性度的庞大神经网络,该网络通过大量原始输入信号进行训练而无需进行预处理。但是,由于神经网络的高度复杂性,其计算成本很高。因此,我们建议使用两个单独训练的网络进行两阶段的DNN训练;声源降噪和声源识别。由于将庞大的网络分为两个较小的网络,因此网络的复杂度有望降低,并且每个网络都可以考虑一种特定的降噪和识别模型。这样可以加快DNN训练的收敛速度并降低计算成本。初步结果表明,与使用嵌入8通道圆形麦克风阵列记录的嘈杂声信号进行的端到端训练相比,拟议的两阶段网络在保持声源识别准确性的同时,仅需要71%的训练时间。无人机。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号