Analysis of multimodal sequences using geometric video representations

Monaci G; Escoda OD; Vandergheynst P

首页> 外文期刊>Signal processing >Analysis of multimodal sequences using geometric video representations

【24h】

Analysis of multimodal sequences using geometric video representations

机译：使用几何视频表示法分析多峰序列

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel method to correlate audio and visual data generated by the same physical phenomenon, based on sparse geometric representation of video sequences. The video signal is modeled as a sum of geometric primitives evolving through time, that jointly describe the geometric and motion content of the scene. The displacement through time of relevant visual features, like the mouth of a speaker, can thus be compared with the evolution of an audio feature to assess the correspondence between acoustic and visual signals. Experiments show that the proposed approach allows to localize and track the speaker's mouth when several persons are present on the scene, in presence of distracting motion, and without prior face or mouth detection. (c) 2006 Elsevier B.V. All rights reserved.

机译：本文基于视频序列的稀疏几何表示，提出了一种将相同物理现象产生的音频和视频数据进行关联的新方法。视频信号被建模为随时间演变的几何图元的总和，它们共同描述了场景的几何和运动内容。因此，可以将相关视觉特征（如扬声器的嘴巴）随时间的位移与音频特征的演变进行比较，以评估声音和视觉信号之间的对应关系。实验表明，所提出的方法可以在场景中有几个人存在时，分散注意力的情况下并且无需事先进行面部或嘴部检测的情况下定位并跟踪说话者的嘴巴。（c）2006 Elsevier B.V.保留所有权利。

著录项

来源
《Signal processing》 |2006年第12期|p. 3534-3548|共15页
作者
Monaci G; Escoda OD; Vandergheynst P;
展开▼
作者单位

Ecole Polytech Fed Lausanne, Signal Proc Inst, CH-1015 Lausanne, Switzerland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信;
关键词
multimodal data processing; audiovisual association; cross-modal localization; geometric video representation; sparse redundant decomposition; DICTIONARIES;

机译：多模态数据处理视听关联跨模态定位几何视频表示稀疏冗余分解词典;

相似文献

外文文献
中文文献
专利

1. Orthotetrahedral crystal structures M-y(TO4)(z) (T = Si, Ge, P, As, S, Se, Cl, Br, I): geometrical-topological analysis, quasi-binary representation, and comparison with the A(y)X(z) compounds by the method of coordination sequences [J] . Ilyushin GD, Blatov VA, Zakutkin YA Zeitschrift fur Kristallographie: International Journal for Structural, Physical, and Chemical Aspects of Crystalline Materials . 2004,第8期

机译：四面体晶体结构My（TO4）（z）（T = Si，Ge，P，As，S，Se，Cl，Br，I）：几何拓扑分析，准二元表示以及与A（y）的比较X（z）化合物的配位序列方法
2. Mosaic representations of video sequences based on slice image analysis [J] . Shaolei Feng, Hanqing Lu, Songde Ma Pattern recognition letters . 2002,第5期

机译：基于切片图像分析的视频序列的马赛克表示
3. Multimodal medical image registration via common representations learning and differentiable geometric constraints [J] . Liu Cong, Ma Longhua, Lu Zheming, Electronics Letters . 2019,第6期

机译：通过通用表示学习和可区分的几何约束进行多峰医学图像配准
4. Orthogonal Polyhedra in 3D Time-Color Space as a Geometric Model for Representation of Video Sequences with Low or Inexistent Redundancy between Frames [C] . Ricardo Perez-Aguila 4th European conference on colour in graphics, imaging, and vision (and 10th international symposium on multispectral colour science) . 2008

机译：3D时空空间中的正交多面体作为表示帧之间冗余度低或不存在的视频序列的几何模型
5. Content-based video analysis, indexing and representation using multimodal information. [D] . Li, Ying. 2003

机译：使用多模式信息进行基于内容的视频分析，索引和表示。
6. Women’s Empowerment Agency and Self-Determination in Afrobeats Music Videos: A Multimodal Critical Discourse Analysis [O] . Simphiwe Emmanuel Rens 2021

机译：妇女赋予Afrobeats音乐视频的赋权代理和自我决定：多模式关键话语分析
7. Analysis of multimodal sequences using geometric video representations [O] . Gianluca Monaci, Òscar Divorra Escoda, Pierre Vandergheynst 2005

机译：使用几何视频表示法分析多峰序列

Analysis of multimodal sequences using geometric video representations

摘要

著录项

相似文献

相关主题

期刊订阅