...
首页> 外文期刊>Multimedia, IEEE Transactions on >Interactive Multimodal Visual Search on Mobile Device
【24h】

Interactive Multimodal Visual Search on Mobile Device

机译:移动设备上的交互式多模式视觉搜索

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes a novel multimodal interactive image search system on mobile devices. The system, the Joint search with ImaGe, Speech, And Word Plus (JIGSAW${+}$ ), takes full advantage of the multimodal input and natural user interactions of mobile devices. It is designed for users who already have pictures in their minds but have no precise descriptions or names to address them. By describing it using speech and then refining the recognized query by interactively composing a visual query using exemplary images, the user can easily find the desired images through a few natural multimodal interactions with his/her mobile device. Compared with our previous work JIGSAW, the algorithm has been significantly improved in three aspects: 1) segmentation-based image representation is adopted to remove the artificial block partitions; 2) relative position checking replaces the fixed position penalty; and 3) inverted index is constructed instead of brute force matching. The proposed JIGSAW${+}$ is able to achieve 5% gain in terms of search performance and is ten times faster.
机译:本文介绍了一种新颖的移动设备上的多模式交互式图像搜索系统。与ImaGe,语音和Word Plus(JIGSAW $ {+} $)联合搜索的系统充分利用了移动设备的多模式输入和自然的用户交互作用。它是为那些已经想到了图片但没有精确描述或名称的用户而设计的。通过使用语音进行描述,然后通过使用示例性图像交互组成视觉查询来完善识别的查询,用户可以通过与他/她的移动设备进行的几次自然多模式交互轻松找到所需的图像。与我们先前的工作JIGSAW相比,该算法在三个方面进行了显着改进:1)采用基于分割的图像表示来去除人工块分区; 2)相对位置检查代替固定位置罚款; 3)构造倒排索引而不是蛮力匹配。拟议中的JIGSAW $ {+} $可以在搜索性能方面实现5%的增长,并且快十倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号