首页> 外文期刊>Advances in multimedia >Multimodal Semantics Extraction from User-Generated Videos
【24h】

Multimodal Semantics Extraction from User-Generated Videos

机译:从用户生成的视频中提取多模式语义

获取原文
获取原文并翻译 | 示例
       

摘要

User-generated video content has grown tremendously fast to the point of outpacing professional content creation. In this work we develop methods that analyze contextual information of multiple user-generated videos in order to obtain semantic information about public happenings (e.g., sport and live music events) being recorded in these videos. One of the key contributions of this work is a joint utilization of different data modalities, including such captured by auxiliary sensors during the video recording performed by each user. In particular, we analyze GPS data, magnetometer data, accelerometer data, video- and audio-content data. We use these data modalities to infer information about the event being recorded, in terms of layout (e.g., stadium), genre, indoor versus outdoor scene, and the main area of interest of the event. Furthermore we propose a method that automatically identifies the optimal set of cameras to be used in a multicamera video production. Finally, we detect the camera users which fall within the field of view of other cameras recording at the same public happening. We show that the proposed multimodal analysis methods perform well on various recordings obtained in real sport events and live music performances.
机译:用户生成的视频内容已迅速增长,超过了专业内容创建的速度。在这项工作中,我们开发了分析多个用户生成的视频的上下文信息的方法,以获取有关记录在这些视频中的公共事件(例如体育和现场音乐事件)的语义信息。这项工作的主要贡献之一是联合利用了不同的数据模式,包括在每个用户执行视频记录期间由辅助传感器捕获的数据。特别是,我们分析GPS数据,磁力计数据,加速度计数据,视频和音频内容数据。我们使用这些数据模式来推断有关正在记录的事件的信息,包括布局(例如体育场),类型,室内与室外场景以及事件的主要关注区域。此外,我们提出了一种方法,该方法可以自动识别将在多摄像机视频制作中使用的最佳摄像机集。最后,我们检测到属于同一场公开录制的其他摄像机视野范围内的摄像机用户。我们表明,提出的多峰分析方法在真实体育赛事和现场音乐表演中获得的各种录音效果良好。

著录项

  • 来源
    《Advances in multimedia》 |2012年第2012期|292064.1-292064.17|共17页
  • 作者单位

    Department of Signal Processing, Tampere University of Technology, P.O. Box 553, 33101 Tampere, Finland;

    Department of Signal Processing, Tampere University of Technology, P.O. Box 553, 33101 Tampere, Finland;

    Department of Signal Processing, Tampere University of Technology, P.O. Box 553, 33101 Tampere, Finland;

    Nokia Research Center, P.O. Box 1000, 33721 Tampere, Finland;

    Nokia Research Center, P.O. Box 1000, 33721 Tampere, Finland;

    Department of Signal Processing, Tampere University of Technology, P.O. Box 553, 33101 Tampere, Finland;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-18 00:37:29

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号