...
首页> 外文期刊>Multimedia Tools and Applications >Scene pathfinder: unsupervised clustering techniques for movie scenes extraction
【24h】

Scene pathfinder: unsupervised clustering techniques for movie scenes extraction

机译:场景探路者:用于电影场景提取的无监督聚类技术

获取原文
获取原文并翻译 | 示例
           

摘要

The need for watching movies is in perpetual increase due to the widespread of the internet and the increasing popularity of the video on demand service. The important mass of movies stored in the Internet or in VOD servers need to be structured to accelerate the browsing operation. In this paper, we propose a new system called "The Scene Pathfinder" that aims at segmenting the movies into scenes to give users the opportunity to have a non- sequential access and to watch particular scenes of the movie. This helps them to judge quickly the movie and decide if they have to buy or to download it and avoiding waste of time and money. The proposed approach is multimodal. We use both of visual and auditory information to accomplish the segmentation. We base on the assumption that every movie scene is either action or non- action scene. Non-action scenes are generally characterized by static backgrounds and occur in the same place. For this reason, we base on the content information and on the Kohonen map to extract these kinds of scenes (shots agglomerations). Action scenes are characterized by high tempo and motion. For this reason, we base on tempo features and on the Fuzzy CMeans to classify shots and to localize the action zones. The two processes are complementary. Indeed, the over segmentation that may occur in the extraction of action scenes by basing on the content information is repaired by the Fuzzy clustering. Our system is tested on a varied database and obtained results show the merit of our approach and that our assumptions are well-founded.
机译:由于互联网的普及和视频点播服务的日益普及,观看电影的需求不断增长。需要对存储在Internet或VOD服务器中的电影的重要部分进行结构设计,以加速浏览操作。在本文中,我们提出了一个名为“场景探路者”的新系统,该系统旨在将电影分割成场景,以使用户有机会进行非顺序访问并观看电影的特定场景。这有助于他们快速判断电影并确定是否必须购买或下载电影,从而避免浪费时间和金钱。提议的方法是多模式的。我们使用视觉和听觉信息来完成分割。我们基于每个电影场景都是动作或非动作场景的假设。非动作场景通常以静态背景为特征,并在同一位置发生。因此,我们基于内容信息和Kohonen地图来提取此类场景(镜头聚集)。动作场景的特点是节奏快,动作快。因此,我们基于速度特征和模糊CMeans对镜头进行分类并定位动作区域。这两个过程是相辅相成的。实际上,通过模糊聚类可以修复基于内容信息提取动作场景时可能发生的过度分割。我们的系统在不同的数据库上进行了测试,获得的结果表明了我们方法的优点,并且我们的假设是有根据的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号