【24h】

Multiple Speaker Localization in a Smart Room

机译:智能会议室中的多个发言人本地化

获取原文

摘要

in recent years, there has been growing an interest in intelligent system. Human-machine interaction and the automatic analysis of meeting in smart room is an emerging research field. One of the most important tasks in a smart room is localization of multi-speaker that permits a wide spectrum of application. In this paper, by using the Combined of hyperbolae produced by time delay estimation (TDE) between several microphones pair and the head orientation information, a new acoustic multi-speaker localization function has been proposed that we call it OPROD-PHAT function. We implement a grid-based, multiple speaker localization method. On the multiple moving speaker location estimation, the new approach has been proposed, that to find number of active source in each time frame, the power of cross correlation function has been used. After find the loudest source present by maximizing the energy of a steered beamformer, in order to localize other source, the process is repeated by removing the contribution of the first source. The result of simulation show superior performance of proposed system.
机译:近年来,人们对智能系统越来越感兴趣。人机交互和智能会议室会议的自动分析是一个新兴的研究领域。智能房间中最重要的任务之一是对多扬声器进行本地化,从而实现广泛的应用。在本文中,通过使用多个麦克风对之间的时延估计(TDE)产生的双曲线和头部方位信息的组合,提出了一种新的声学多扬声器定位功能,我们将其称为OPROD-PHAT功能。我们实现了基于网格的多说话人定位方法。在多动扬声器位置估计上,提出了一种新的方法,即在每个时间帧中找到有源声源的数量,使用了互相关函数的幂。在通过使转向波束形成器的能量最大化找到当前存在的最大声源之后,为了定位其他源,通过删除第一个源的贡献来重复此过程。仿真结果表明了所提出系统的优越性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号