首页> 外文会议>IEEE International Conference on Data Engineering Workshop >Finding Important People in Large News Video Databases Using Multimodal and Clustering Analysis
【24h】

Finding Important People in Large News Video Databases Using Multimodal and Clustering Analysis

机译:使用多媒体和聚类分析在大型新闻视频数据库中找到重要人物

获取原文

摘要

The wide availability of large scale databases requires more efficient and scalable tools for data understanding and knowledge discovery. In this paper, we present a method to find important people who have appeared repeatedly in a certain time period from large news video databases. Specifically, we investigate two issues: how to group similar faces to find dominant groups and how to label these groups by the corresponding names for identification. These are challenging problems because firstly people can appear with large appearance variations such as hair styles, illumination conditions and poses that make comparing between similar faces more difficu secondly, the number of people and their occurrence frequencies that are unknown make finding dominant and useful groups more complicated; and finally, the fact that in news video faces and names usually do not appear together can make troubles in aligning faces and names. To handle above problems, we propose using the relevant set correlation based clustering model which can efficiently handle dataset of millions of objects represented in thousands or even millions of dimensions to find groups of similar faces from the large and noisy face dataset. Then in order to identify faces in clusters, names extracted from the transcripts are filtered and used to find the best correspondences by using methods developed in the statistical machine translation literature. Experiments on large video datasets containing hundreds of hours showed that our system can efficiently find out important people by not only their appearance but also their identification.
机译:大规模数据库的广泛可用性需要更高效和可扩展的工具,用于数据理解和知识发现。在本文中,我们提出了一种方法来找到在大型新闻视频数据库的一定时间内反复出现的重要人物。具体来说,我们调查了两个问题:如何将类似的面部分组找到主导群体以及如何通过相应的名称来标识这些组以进行识别。这些都是挑战性问题,因为首先人们可以出现大的外观变化,如发型,照明条件和姿势,使得类似面孔更加困难;其次,人数及其发生频率未知,发现占主导地位和有用的群体更复杂;最后,在新闻视频面和名称中通常不会出现在一起的事实可以在对齐面和名称方面发出麻烦。为了处理上述问题,我们建议使用基于相关的集合相关的群集模型,可以有效地处理数百万甚至数百万尺寸以数百万个对象的数据集,以找到来自大型和嘈杂的面部数据集的类似面孔。然后为了识别集群中的面,从转录物中提取的名称被过滤并用于通过使用统计机器翻译文献中开发的方法来找到最佳的对应关系。含有数百小时的大型视频数据集的实验表明,我们的系统可以通过它们的外观有效地找到重要人物,而且还可以看出重要人物,也可以识别它们。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号