Joint Audio-Visual Tracking Based on Dynamically Weighted Linear Combination of Probability State Density

Masaru Tsuchida; Takahito Kawanishi; Hiroshi Murase; Shigeru Takagi

首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >Joint Audio-Visual Tracking Based on Dynamically Weighted Linear Combination of Probability State Density

【24h】

Joint Audio-Visual Tracking Based on Dynamically Weighted Linear Combination of Probability State Density

机译：基于概率状态密度动态加权线性组合的联合视听跟踪

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a method that can be applied to speaker tracking under stabilized, continuous conditions using visual and audio information even when input information is interrupted due to disturbance or occlusion caused by the effects of noise or varying illumination. Using this method, the position of a speaker is expressed based on a likelihood distribution that is obtained through integration of visual information and audio information. First, visual and audio information is integrated as as a weighted linear combination of probability density distribution, which is estimated as a result of the observation of the visual and audio information. In this case, the weight is taken as a variable, which varys in proportion to the maximum value of probability density distributions obtained for each type of information. Next, the result obtained as described above and the weighted linear combination of the distribution in the past are obtained, and the result thus obtained is taken as the likelihood distribution related to the position of the speaker. By changing the weight dynamically, it becomes possible to select the type of information freely or to add weight and, accordingly, to conduct stabilized, continuous tracking even when the speaker cannot be detected momentarily due to occlusion, voice interruption, or noise. We conducted a series of experiments on speaker tracking using circular microphone array and an omni-directional camera. In this way, we have succeeded in confirming it possible to perform stabilized tracking on speakers continuously in spite of occlusion or voice interruption.

机译：本文提出了一种方法，该方法即使在由于噪声或光照变化而引起的干扰或遮挡导致输入信息中断的情况下，也可以使用视觉和音频信息在稳定，连续的条件下应用于扬声器跟踪。使用该方法，基于通过视觉信息和音频信息的整合而获得的似然分布来表达说话者的位置。首先，视听信息被集成为概率密度分布的加权线性组合，该概率密度分布是根据视听信息的观察结果而估计的。在这种情况下，权重被视为一个变量，该变量与为每种类型的信息获得的概率密度分布的最大值成比例地变化。接下来，获得如上所述获得的结果和过去的分布的加权线性组合，并将由此获得的结果作为与说话者的位置有关的似然分布。通过动态改变权重，可以自由选择信息的类型或增加权重，因此，即使由于遮挡，语音中断或噪音而无法立即检测到说话者时，也可以进行稳定的连续跟踪。我们使用圆形麦克风阵列和全向摄像头对扬声器进行了一系列实验。这样，我们成功地确认了即使有遮挡或声音中断，也可以连续对扬声器进行稳定的跟踪。

著录项

来源
《Journal of Advanced Computatioanl Intelligence and Intelligent Informatics》 |2004年第2期|共10页
作者
Masaru Tsuchida; Takahito Kawanishi; Hiroshi Murase; Shigeru Takagi;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类其他计算机;
关键词
Speaker tracking; Integration of visual and audio information; Probability density distribution; Weighted linear combination; Time information;

机译：说话人跟踪;视听信息整合;概率密度分布;加权线性组合;时间信息;

相似文献

外文文献
中文文献
专利

1. Joint Audio-Visual Tracking Based on Dynamically Weighted Linear Combination of Probability State Density [J] . Masaru Tsuchida, Takahito Kawanishi, Hiroshi Murase, Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2004,第2期

机译：基于概率状态密度动态加权线性组合的联合视听跟踪
2. On joint stationary probability density function of nonlinear dynamic systems [J] . Zhang Z., Yasuda K., Wang R. Acta Mechanica . 1998,第1a2期

机译：非线性动力系统的联合平稳概率密度函数
3. Stochastic dynamic analysis for vehicle-track-bridge system based on probability density evolution method [J] . Xiao Xiang, Yan Yu, Chen Bo Engineering Structures . 2019,第JUNa1期

机译：基于概率密度演化法的车辆-行车桥系统的随机动力学分析
4. Multitarget association and tracking in 3-D space based on particle filter with joint multitarget probability density [C] . Jinseok Lee, Byung Guk Kim, Shung Han Cho, IEEE Conference on Advanced Video and Signal Based Surveillance . 2007

机译：基于粒子滤波器的三维空间与关节多靶概率密度的多功能关联及跟踪
5. Site Selection for Higher Density Affordable Rental Housing Development: Applying the Weighted Linear Combination (WLC) Method in the City of Los Angeles, California [D] . Maciel-Cervantes, Marisol. 2017

机译：高密度经济适用房开发的选址：在加利福尼亚洛杉矶市应用加权线性组合（WLC）方法
6. Joint Dwell Time and Bandwidth Optimization for Multi-Target Tracking in Radar Network Based on Low Probability of Intercept [O] . Lintao Ding, Chenguang Shi, Wei Qiu, 2020

机译：基于低拦截概率的雷达网络多目标跟踪联合停留时间和带宽优化
7. Cooperative multiple dynamic object tracking on moving vehicles based on Sequential Monte Carlo Probability Hypothesis Density filter [O] . Jonathan Gan, Milos Vasic, Alcherio Martinoli 2016

机译：基于顺序蒙特卡罗概率假设密度滤波器的移动车辆的协同多动态对象跟踪

Joint Audio-Visual Tracking Based on Dynamically Weighted Linear Combination of Probability State Density

摘要

著录项

相似文献

相关主题

期刊订阅