首页> 外文期刊>IEE Proceedings. Part K, Vision, image and signal processing >Content browsing and semantic context viewing through JPEG 2000-based scalable video summary
【24h】

Content browsing and semantic context viewing through JPEG 2000-based scalable video summary

机译:通过基于JPEG 2000的可伸缩视频摘要进行内容浏览和语义上下文查看

获取原文
获取原文并翻译 | 示例
       

摘要

The paper presents a novel method and software platform for remote and interactive browsing of a summary of long video sequences as well as revealing the semantic links between shots and scenes in their temporal context. The solution is based on interactive navigation in a scalable mega image resulting from a JPEG 2000 coded key-frame-based video summary. Each key-frame could represent an automatically detected shot, event or scene, which is then properly annotated using some semi-automatic tools or learning methods. The presented system is compliant with the new JPEG 2000 Part 9 'JPIP - JPEG 2000 interactivity, API and protocols,' which lends itself to working under varying transmission channel conditions such as GPRS or 3G wireless networks. While keeping the advantages of a single 2D video summary, like the limited storage cost, the flexibility offered by JPEG 2000 allows the application to highlight interactively key-frames corresponding to the desired content first within a low-quality and low-resolution version of the full video summary. It then offers fine grain scalability for a user to navigate and zoom into particular scenes or events represented by the key-frames. This possibility of visualising key-frames of interest and playing back the corresponding video shots within the context of the whole sequence (e.g. an episode of a media file) enables the user to understand the temporal relations between semantically related events/actions/physical settings, providing a new way to present and search for contents in video sequences.
机译:本文提出了一种新颖的方法和软件平台,用于对长视频序列的摘要进行远程和交互式浏览,并揭示镜头和场景在其时间上下文中的语义联系。该解决方案基于可伸缩的巨型图像中的交互式导航,该图像是基于JPEG 2000编码的关键帧的视频摘要而产生的。每个关键帧都可以代表一个自动检测的镜头,事件或场景,然后使用一些半自动工具或学习方法对其进行正确注释。提出的系统符合新的JPEG 2000第9部分“ JPIP-JPEG 2000交互性,API和协议”,从而使其能够在变化的传输信道条件下工作,例如GPRS或3G无线网络。在保持单个2D视频摘要的优点(如有限的存储成本)的同时,JPEG 2000提供的灵活性使应用程序可以在低质量和低分辨率版本的JPEG中首先突出显示与所需内容相对应的交互式关键帧。完整的视频摘要。然后,它为用户提供了细粒度的可伸缩性,以供用户导航和放大关键帧所代表的特定场景或事件。可视化感兴趣的关键帧并在整个序列(例如媒体文件的情节)的上下文中播放对应的视频镜头的这种可能性使用户能够理解语义相关事件/动作/物理设置之间的时间关系,提供了一种新的方式来呈现和搜索视频序列中的内容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号