Multimodal Semantics Extraction from User-Generated Videos

Francesco Cricri; Kostadin Dabov; Mikko J. Roininen; Sujeet Mate; Igor D. D. Curcio; Moncef Gabbouj

首页> 外文期刊>Advances in multimedia >Multimodal Semantics Extraction from User-Generated Videos

【24h】

Multimodal Semantics Extraction from User-Generated Videos

机译：从用户生成的视频中提取多模式语义

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

User-generated video content has grown tremendously fast to the point of outpacing professional content creation. In this work we develop methods that analyze contextual information of multiple user-generated videos in order to obtain semantic information about public happenings (e.g., sport and live music events) being recorded in these videos. One of the key contributions of this work is a joint utilization of different data modalities, including such captured by auxiliary sensors during the video recording performed by each user. In particular, we analyze GPS data, magnetometer data, accelerometer data, video- and audio-content data. We use these data modalities to infer information about the event being recorded, in terms of layout (e.g., stadium), genre, indoor versus outdoor scene, and the main area of interest of the event. Furthermore we propose a method that automatically identifies the optimal set of cameras to be used in a multicamera video production. Finally, we detect the camera users which fall within the field of view of other cameras recording at the same public happening. We show that the proposed multimodal analysis methods perform well on various recordings obtained in real sport events and live music performances.

机译：用户生成的视频内容已迅速增长，超过了专业内容创建的速度。在这项工作中，我们开发了分析多个用户生成的视频的上下文信息的方法，以获取有关记录在这些视频中的公共事件（例如体育和现场音乐事件）的语义信息。这项工作的主要贡献之一是联合利用了不同的数据模式，包括在每个用户执行视频记录期间由辅助传感器捕获的数据。特别是，我们分析GPS数据，磁力计数据，加速度计数据，视频和音频内容数据。我们使用这些数据模式来推断有关正在记录的事件的信息，包括布局（例如体育场），类型，室内与室外场景以及事件的主要关注区域。此外，我们提出了一种方法，该方法可以自动识别将在多摄像机视频制作中使用的最佳摄像机集。最后，我们检测到属于同一场公开录制的其他摄像机视野范围内的摄像机用户。我们表明，提出的多峰分析方法在真实体育赛事和现场音乐表演中获得的各种录音效果良好。

著录项

来源
《Advances in multimedia》 |2012年第2012期|292064.1-292064.17|共17页
作者
Francesco Cricri; Kostadin Dabov; Mikko J. Roininen; Sujeet Mate; Igor D. D. Curcio; Moncef Gabbouj;
展开▼
作者单位

Department of Signal Processing, Tampere University of Technology, P.O. Box 553, 33101 Tampere, Finland;

Department of Signal Processing, Tampere University of Technology, P.O. Box 553, 33101 Tampere, Finland;

Department of Signal Processing, Tampere University of Technology, P.O. Box 553, 33101 Tampere, Finland;

Nokia Research Center, P.O. Box 1000, 33721 Tampere, Finland;

Nokia Research Center, P.O. Box 1000, 33721 Tampere, Finland;

Department of Signal Processing, Tampere University of Technology, P.O. Box 553, 33101 Tampere, Finland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
入库时间 2022-08-18 00:37:29

相似文献

外文文献
中文文献
专利

1. Multimodal Semantics Extraction from User-Generated Videos [J] . FrancescoCricri, KostadinDabov, Mikko J.Roininen, Advances in multimedia . 2012,第1期

机译：从用户生成的视频中提取多模式语义
2. Multimodal feature extraction and fusion for semantic mining of soccer video: a survey [J] . Payam Oskouie, Sara Alipour, Amir-Masoud Eftekhari-Moghadam Artificial Intelligence Review: An International Science and Engineering Journal . 2014,第2期

机译：足球视频语义挖掘的多峰特征提取与融合研究
3. Fast Summarization of User-Generated Videos: Exploiting Semantic, Emotional, and Quality Clues [J] . Baohan Xu, Xi Wang, Yu-Gang Jiang IEEE multimedia . 2016,第3期

机译：用户生成视频的快速摘要：利用语义，情感和质量线索
4. Extraction of Semantic Relations in Noisy User-Generated Law Enforcement Data [C] . Marijn Schraagen, Floris Bex IEEE International Conference on Semantic Computing . 2019

机译：嘈杂的用户生成的执法数据中的语义关系提取
5. Knowledge extraction in video through the interaction analysis of activities knowledge extraction in video through the interaction analysis of activities [D] . Florez, Omar U. 2013

机译：通过活动的交互分析提取视频中的知识通过活动的交互分析提取视频中的知识
6. Medical Information Extraction Model for User-generated Content [O] . Fahad Kamal Alsheref 2019

机译：用户生成内容的医疗信息提取模型
7. Multimodal Extraction of Events and Information about the Recording Activity in User Generated Videos [O] . Francesco Cricri, Kostadin Dabov, Igor D. D. Curcio, 2013

机译：多模式提取事件和用户生成视频中的录制活动信息

Multimodal Semantics Extraction from User-Generated Videos

摘要

著录项

相似文献

相关主题

期刊订阅