首页> 外文期刊>IEEE transactions on multimedia >Audio–Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking
【24h】

Audio–Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking

机译:用于多扬声器跟踪的视听粒子流量SMC-PHD滤波

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Sequential Monte Carlo probability hypothesis density (SMC-PHD) filtering is a popular method used recently for audio-visual (AV) multi-speaker tracking. However, due to the weight degeneracy problem, the posterior distribution can be represented poorly by the estimated probability, when only a few particles are present around the peak of the likelihood density function. To address this issue, we propose a new framework where particle flow (PF) is used to migrate particles smoothly from the prior to the posterior probability density. We consider both zero and non-zero diffusion particle flows (ZPF/NPF), and developed two new algorithms, AV-ZPF-SMC-PHD and AV-NPF-SMC-PHD, where the speaker states from the previous frames are also considered for particle relocation. The proposed algorithms are compared systematically with several baseline tracking methods using the AV16.3, AVDIAR and CLEAR datasets, and are shown to offer improved tracking accuracy and average effective sample size (ESS).
机译:序贯蒙特卡罗概率假设密度(SMC-PHD)滤波是最近用于视听(AV)多扬声器跟踪的流行方法。然而,由于重量退化问题,后部分布可以通过估计的概率来表示,当围绕似然密度函数的峰值存在少数颗粒时,当仅存在少数颗粒时。为了解决这个问题,我们提出了一种新的框架,其中用于从后概率密度之前平滑地迁移颗粒流(PF)。我们考虑零和非零扩散粒子流量(ZPF / NPF),并开发了两个新的算法,AV-ZPF-SMC-PHD和AV-NPF-SMC-PHD,其中还考虑了前一帧的扬声器状态用于粒子搬迁。通过使用AV16.3,AVDIAR和CLEAR DataSets系统地系统地使用几个基线跟踪方法系统地进行系统地进行比较,并且被证明可以提高跟踪精度和平均有效样本大小(ESS)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号