首页>
外国专利>
TIME-FREQUENCY MASKING AND DEEP NEURAL NETWORK-BASED SOUND SOURCE DIRECTION ESTIMATION METHOD
TIME-FREQUENCY MASKING AND DEEP NEURAL NETWORK-BASED SOUND SOURCE DIRECTION ESTIMATION METHOD
展开▼
机译:基于时频和深度神经网络的声源方向估计方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A time-frequency masking and deep neural network-based sound source orientation estimation method and device, an electronic device and a storage medium, belonging to the field of computer technology. Said method comprises: acquiring sound signals of multiple channels (S110); performing frame division, windowing and Fourier transformation on a sound signal of each channel in the signals of multiple channels, so as to form a short-time Fourier spectrum of the sound signals of multiple channels (S120); performing an iterative operation on the short-time Fourier spectrum by means of a pre-trained neural network model, so as to calculate ratio filters corresponding to target signals in the sound signals of multiple channels (S130); fusing the plurality of ratio filters to form a single ratio filter (S140); and performing masking and weighting on the signals of multiple channels by means of the single ratio filter, so as to determine the orientation of a target sound source (S150). The time-frequency masking and deep neural network-based sound source direction estimation method and device can have strong robustness in an environment of a low signal-to-noise ratio and strong reverberation, and improve the accuracy and stability of target sound source direction estimation.
展开▼