首页> 外文会议>2018 IEEE Conference on Multimedia Information Processing and Retrieval >MMH: Multi-Modal Hash for Instant Mobile Video Search
【24h】

MMH: Multi-Modal Hash for Instant Mobile Video Search

机译:MMH:用于即时移动视频搜索的多模式哈希

获取原文
获取原文并翻译 | 示例

摘要

Mobile devices have been an indispensable part of human life, which enable people to search and browse what they want on the move. Mobile video search, as one of the most important services for users, still faces great challenges under mobile internet scenario, such as the limitation of computation ability, memory, and bandwidth. Therefore, this paper proposes a multi-modal hash based framework for instant mobile video search. In particular, we adopt a efficient deep convolutional neural network, MobileNet, with the hash layer to learn discriminative and compact visual features from videos. Moreover, we also consider hand-crafted local visual descriptor and audio fingerprint to build a multi-modal hash representation of videos. With the multi-modal hash code, two types of hash indexes are built on the server to achieve efficient video search. At last, the multi-modal hash codes are extracted on the mobile devices and transferred in a three- step progressive procedure during the online search stage. The experiments on the real-world dataset show that the proposed framework not only achieves the state-of-the-art accuracy but also obtains excellent efficiency.
机译:移动设备已成为人类生活中不可或缺的一部分,使人们能够搜索和浏览他们在旅途中想要的东西。移动视频搜索作为用户最重要的服务之一,在移动互联网环境下仍然面临着巨大的挑战,例如计算能力,内存和带宽的限制。因此,本文提出了一种基于多模式哈希的即时移动视频搜索框架。尤其是,我们采用具有哈希层的高效深度卷积神经网络MobileNet,以从视频中学习区分性和紧凑的视觉特征。此外,我们还考虑了手工制作的本地视觉描述符和音频指纹,以构建视频的多模式哈希表示。使用多模式哈希码,可以在服务器上构建两种类型的哈希索引,以实现有效的视频搜索。最后,在在线搜索阶段,多模式哈希码在移动设备上提取并以三步渐进过程进行传输。在真实数据集上的实验表明,提出的框架不仅达到了最新的准确性,而且获得了出色的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号