首页> 外文OA文献 >The use of the MPEG-7 AVDP profile in 3DTV audiovisual content description
【2h】

The use of the MPEG-7 AVDP profile in 3DTV audiovisual content description

机译:MPEG-7 AVDP配置文件在3DTV视听内容描述中的使用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

A framework devised for the storage of metadata describing 3DTV content, derived from the application of several 3DTV media analysis tools such as shot/scene boundary detection, person detection/tracking/recognition, facial expression recognition, music/speech segmentation, speaker diarization and music genre/mood characterization, in an MPEG 7/AVDP compatible manner will be presented in this contribution. AVDP was designed by having mainly single channel videos in mind. Thus, in order to utilize it for the description of stereoscopic video and multichannel audio content, a number of implementation decisions, that cater to the particularities of such content (storage of stereoscopic quality information, relations between entities in the various channels etc) had to be taken and will be presented in this contribution. Examples of using AVDP to describe the results of analysis algorithms on stereo video and multichannel audio content will be presented. Additionally, several Classification Schemes used in the proposed framework will be discussed, since some terms may be useful in other applications. Finally, the contribution will include a discussion on possible extensions/modifications of the MPEG-7 standard or the AVDP profile to better cover the needs of stereoscopic and mutiview content description. The proposed framework was devised within 3DTVS (3DTV Content Search), a European FP7 project that aims at devising 3DTV audiovisual content analysis description, indexing, search and browsing methods and incorporating such functionalities in 3D audio-visual content archives.
机译:设计用于存储描述3DTV内容的元数据的框架,该框架源于几种3DTV媒体分析工具的应用,例如镜头/场景边界检测,人员检测/跟踪/识别,面部表情识别,音乐/语音分割,说话者区分和音乐本贡献将以MPEG 7 / AVDP兼容的方式呈现流派/情绪特征。 AVDP的设计主要考虑了单频道视频。因此,为了将其用于立体视频和多声道音频内容的描述,必须针对这些内容的特殊性(立体质量信息的存储,各个声道中实体之间的关系等)做出许多实施决策。并会在本文稿中介绍。将提供使用AVDP描述立体声视频和多通道音频内容的分析算法结果的示例。此外,由于某些术语在其他应用程序中可能有用,因此将讨论在提议的框架中使用的几种分类方案。最后,该文稿将包括对MPEG-7标准或AVDP配置文件的可能扩展/修改的讨论,以更好地满足立体和多视图内容描述的需求。拟议的框架是在欧洲FP7项目3DTVS(3DTV内容搜索)中设计的,该项目旨在设计3DTV视听内容分析描述,索引,搜索和浏览方法,并将此类功能纳入3D视听内容档案中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号