首页> 外国专利> TIME-FREQUENCY MASKING AND DEEP NEURAL NETWORK-BASED SOUND SOURCE DIRECTION ESTIMATION METHOD

TIME-FREQUENCY MASKING AND DEEP NEURAL NETWORK-BASED SOUND SOURCE DIRECTION ESTIMATION METHOD

机译：基于时频和深度神经网络的声源方向估计方法

页面导航

摘要
著录项
相似文献

摘要

A time-frequency masking and deep neural network-based sound source orientation estimation method and device, an electronic device and a storage medium, belonging to the field of computer technology. Said method comprises: acquiring sound signals of multiple channels (S110); performing frame division, windowing and Fourier transformation on a sound signal of each channel in the signals of multiple channels, so as to form a short-time Fourier spectrum of the sound signals of multiple channels (S120); performing an iterative operation on the short-time Fourier spectrum by means of a pre-trained neural network model, so as to calculate ratio filters corresponding to target signals in the sound signals of multiple channels (S130); fusing the plurality of ratio filters to form a single ratio filter (S140); and performing masking and weighting on the signals of multiple channels by means of the single ratio filter, so as to determine the orientation of a target sound source (S150). The time-frequency masking and deep neural network-based sound source direction estimation method and device can have strong robustness in an environment of a low signal-to-noise ratio and strong reverberation, and improve the accuracy and stability of target sound source direction estimation.

机译：基于时频掩蔽和深度神经网络的声源方向估计方法，装置，电子设备和存储介质，属于计算机技术领域。所述方法包括：获取多个通道的声音信号（S110）;对多个通道的信号中的每个通道的声音信号进行帧划分，加窗和傅立叶变换，以形成多个通道的声音信号的短时傅立叶频谱（S120）;通过预训练的神经网络模型对短时傅立叶频谱进行迭代运算，以计算出对应于多个声道的声音信号中的目标信号的比率滤波器（S130）;融合多个比率滤波器以形成单个比率滤波器（S140）;然后，通过单比例滤波器对多个通道的信号进行掩蔽和加权处理，以确定目标声源的方向（S150）。基于时频掩蔽和深度神经网络的声源方向估计方法及装置在信噪比低，混响强的环境下具有较强的鲁棒性，提高了目标声源方向估计的准确性和稳定性。。

著录项

公开/公告号WO2020042708A1

专利类型
公开/公告日2020-03-05

原文格式PDF
申请/专利权人 ELEVOC TECHNOLOGY CO. LTD.;
展开▼

申请/专利号CNCN2019/090531
发明设计人 WANG ZHONGQIU;LI HAO;
展开▼

申请日2019-06-10
分类号G01S3/802;
国家 WO
入库时间 2022-08-21 11:13:15

相似文献

专利
外文文献
中文文献