...
首页> 外文期刊>Consumer Electronics, IEEE Transactions on >Speaker selection and tracking in a cluttered environment with audio and visual information
【24h】

Speaker selection and tracking in a cluttered environment with audio and visual information

机译:在杂乱的环境中使用音频和视频信息进行扬声器选择和跟踪

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Presented in this paper is a data association method using audio and visual data which localizes targets in a cluttered environment and detects who is speaking to a robot. A particle filter is applied to efficiently select the optimal association between the target and the measurements. State variables are composed of target positions and speaking states. To update the speaking state, we first evaluate the incoming sound signal based on cross-correlation and then calculate a likelihood from the audio information. The visual measurement is used to find an optimal association between the target and the observed objects. The number of targets that the robot should interact with is updated from the existence probabilities and associations. Experimental data were collected beforehand and simulated on a computer to verify the performance of the proposed method applied to the speaker selection problem in a cluttered environment. The algorithm was also implemented in a robotic system to demonstrate reliable interactions between the robot and speaking targets.
机译:本文提出了一种使用音频和视频数据的数据关联方法,该方法可在杂乱的环境中定位目标并检测谁在对机器人讲话。应用粒子滤波器可以有效地选择目标和测量之间的最佳关联。状态变量由目标位置和讲话状态组成。为了更新讲话状态,我们首先基于互相关来评估传入的声音信号,然后根据音频信息计算似然度。视觉测量用于找到目标和观察对象之间的最佳关联。机器人应与之交互的目标数量从存在概率和关联中更新。事先收集了实验数据,并在计算机上进行了仿真,以验证在杂乱环境中应用于说话人选择问题的方法的性能。该算法还实现在机器人系统中,以演示机器人和说话目标之间的可靠交互。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号