Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification

机译：使用两阶段深度神经网络进行降噪和声源识别训练可降低计算成本

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses reduction of computational cost in training of a Deep Neural Network (DNN), in particular, for sound identification using highly noise-contaminated sound recorded with a microphone array embedded in an Unmanned Aerial Vehicle (UAV), aiming at people's voice detection quickly and widely in a disastrous situation. It is known that a DNN training method called end-to-end training shows high performance, since it uses a huge neural network with high non-linearity which is trained with a large amount of raw input signals without preprocessing. Its computational cost is, however, expensive due to the high complexity of the neural network. Therefore, we propose two-stage DNN training using two separately-trained networks; denoising of sound sources and sound source identification. Since the huge network is divided into two smaller networks, the complexity of the networks is expected to decrease and each of them can consider a specific model of denoising and identification. This results in faster convergence and computational cost reduction in DNN training. Preliminary results showed that only 71 % of training time was necessary with the proposed two staged network, while maintaining the accuracy of sound source identification, compared to end-to-end training using noisy acoustic signals recorded with an 8 ch circular microphone array embedded in a UAV.

机译：本文致力于降低训练深度神经网络（DNN）时的计算成本，尤其是针对使用高度噪声污染的声音进行声音识别的情况，这种声音被嵌入到无人飞行器（UAV）中的麦克风阵列记录下来，旨在实现人们的语音检测在灾难性的情况下迅速而广泛地发展。众所周知，一种称为端到端训练的DNN训练方法具有很高的性能，因为它使用了具有高非线性度的庞大神经网络，该网络通过大量原始输入信号进行训练而无需进行预处理。但是，由于神经网络的高度复杂性，其计算成本很高。因此，我们建议使用两个单独训练的网络进行两阶段的DNN训练;声源降噪和声源识别。由于将庞大的网络分为两个较小的网络，因此网络的复杂度有望降低，并且每个网络都可以考虑一种特定的降噪和识别模型。这样可以加快DNN训练的收敛速度并降低计算成本。初步结果表明，与使用嵌入8通道圆形麦克风阵列记录的嘈杂声信号进行的端到端训练相比，拟议的两阶段网络在保持声源识别准确性的同时，仅需要71％的训练时间。无人机。

著录项

来源
《Iinternational conference on industrial, engineering and other applications of applied intelligence systems》|2016年|562-573|共12页
会议地点
作者
Takayuki Morito; Osamu Sugiyama; Satoshi Uemura; Ryosuke Kojima; Kazuhiro Nakadai;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Environment understanding; Deep learning; Sound source identification;

机译：对环境的了解;深度学习;声源识别;
入库时间 2022-08-26 13:46:59

相似文献

外文文献
中文文献
专利

1. Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks [J] . Ning Ma, Jose A. Gonzalez, Guy J. Brown Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2018,第11期

机译：结合频谱源模型和深层神经网络对目标声源进行稳健的双耳本地化
2. Distance Estimation and Localization of Sound Sources in Reverberant Conditions using Deep Neural Networks [J] . Mariam Yiwere, Eun Joo Rhee International Journal of Applied Engineering Research . 2017,第22aPta5期

机译：深神经网络混响条件中声源的距离估计与定位
3. Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks [J] . Emad M. Grais, Gerard Roma, Andrew J. R. Simpson, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第9期

机译：使用深度神经网络的两阶段单通道音频源分离
4. Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification [C] . Takayuki Morito, Osamu Sugiyama, Satoshi Uemura, International Conference on Industrial Engineering and Other Applications of Applied Intelligence . 2016

机译：使用两阶段深神经网络减少计算成本进行去噪和声源识别
5. Comparison of Search Algorithms in Two-Stage Neural Network Training for Optical Character Recognition of Handwritten Digits [D] . Gilley, Patrik Wayne. 2020

机译：两级神经网络训练中搜索算法的比较，用于手写数字的光学字符识别
6. On the Reduction of Computational Complexity of Deep Convolutional Neural Networks [O] . Partha Maji, Robert Mullins 2018

机译：关于深卷积神经网络计算复杂性的降低
7. A Two-Stage Subspace Trust Region Approach for Deep Neural Network Training [O] . Dudar, Viacheslav, Chierchia, Giovanni, Chouzenoux, Emilie, 2017

机译：深度神经网络训练的两阶段子空间信任区域方法

Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification

摘要

著录项

相似文献

相关主题

期刊订阅