首页> 外文会议>2010 IEEE Conference on Computer Vision and Pattern Recognition >Reading between the lines: Object localization using implicit cues from image tags
【24h】

Reading between the lines: Object localization using implicit cues from image tags

机译:读线之间:使用来自图像标签的隐式线索进行对象定位

获取原文

摘要

Current uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage “unspoken” cues that rest within an ordered list of image tags so as to improve object localization. We define three novel implicit features from an image''s tags — the relative prominence of each object as signified by its order of mention, the scale constraints implied by unnamed objects, and the loose spatial links hinted by the proximity of names on the list. By learning a conditional density over the localization parameters (position and scale) given these cues, we show how to improve both accuracy and efficiency when detecting the tagged objects. We validate our approach with 25 object categories from the PASCAL VOC and LabelMe datasets, and demonstrate its effectiveness relative to both traditional sliding windows as well as a visual context baseline.
机译:标记图像的当前使用通常仅利用最明确的信息:命名的名词与图像中某处存在的对象之间的链接。我们建议利用存在于图像标签的有序列表中的“无声”提示,以改善对象定位。我们从图像的标签中定义了三个新颖的隐式特征:每个对象的相对突出度(由其提及顺序表示),未命名对象所隐含的比例约束以及列表上名称的接近性暗示了松散的空间联系。通过根据给定的提示在定位参数(位置和比例)上学习条件密度,我们展示了如何在检测标记的对象时提高准确性和效率。我们使用PASCAL VOC和LabelMe数据集中的25个对象类别验证了我们的方法,并证明了其相对于传统滑动窗口以及视觉上下文基线的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号