Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

Ziyi Wang; Xiaoyan Zhao; Hongjun Rong; Ying Tong; Jingang Shi

首页> 中文期刊> 《新媒体杂志（英文）》 >Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Microphone array-based sound source localization(SSL)is widely used in a variety of occasions such as video conferencing,robotic hearing,speech enhancement,speech recognition and so on.The traditional SSL methods cannot achieve satisfactory performance in adverse noisy and reverberant environments.In order to improve localization performance,a novel SSL algorithm using convolutional residual network(CRN)is proposed in this paper.The spatial features including time difference of arrivals(TDOAs)between microphone pairs and steered response power-phase transform(SRPPHAT)spatial spectrum are extracted in each Gammatone sub-band.The spatial features of different sub-bands with a frame are combine into a feature matrix as the input of CRN.The proposed algorithm employ CRN to fuse the spatial features.Since the CRN introduces the residual structure on the basis of the convolutional network,it reduce the difficulty of training procedure and accelerate the convergence of the model.A CRN model is learned from the training data in various reverberation and noise environments to establish the mapping regularity between the input feature and the sound azimuth.Through simulation verification,compared with the methods using traditional deep neural network,the proposed algorithm can achieve a better localization performance in SSL task,and provide better generalization capacity to untrained noise and reverberation.

著录项

来源
《新媒体杂志（英文）》 |2022年第3期|145-153|共9页
作者
Ziyi Wang; Xiaoyan Zhao; Hongjun Rong; Ying Tong; Jingang Shi;
展开▼
作者单位

School of Information and Communication Engineering Institute of Technology;

University of Oulu;

展开▼
原文格式 PDF
正文语种 chi
中图分类计算技术、计算机技术;
关键词
Convolutional residual network; microphone array; spatial features; sound source localization;

相似文献

中文文献
外文文献

Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

摘要

著录项

相似文献

相关主题

期刊订阅