首页> 外文会议>International Conference on Computer Vision Theory and Applications >From text vocabularies to visual vocabularies what basis?
【24h】

From text vocabularies to visual vocabularies what basis?

机译:从文字词汇到视觉词汇有什么依据?

获取原文

摘要

The popular “bag-of-visual-words” approach for representing and searching visual documents consists in describing images (or video keyframes) using a set of descriptors, that correspond to quantized low-level features. Most of existing approaches for visual words are inspired from works in text indexing, based on the implicit assumption that visual words can be handled the same way as text words. More specifically, these techniques implicitly rely on the same postulate as in text information retrieval, stating that the words distribution for a natural language globally follows Zipf's law - that is to say, words from a natural language appear in a corpus with a frequency inversely proportional to their rank. However, our study shows that the visual words distribution depends on the choice of low-level features, and also especially on the choice of the clustering method. We also show that when the visual words distribution is close to this of text words, the results of an image retrieval system are increased. To the best of our knowledge, no prior study has yet been carried out to compare the distributions of text words and visual words, with the objective of establishing the theoretical foundations of visual vocabularies.
机译:用于表示和搜索视觉文档的流行“视觉词袋”方法包括使用一组描述符描述图像(或视频关键帧),这些描述符对应于量化的低级特征。现有的大多数视觉单词方法都是基于文本索引中的工作启发而来的,这是基于隐含的假设,即视觉单词可以与文本单词以相同的方式进行处理。更具体地说,这些技术隐含地依赖与文本信息检索中相同的假设,指出自然语言的单词分布在全球范围内遵循齐普夫定律-也就是说,自然语言中的单词出现在语料库中的频率成反比达到他们的等级。然而,我们的研究表明,视觉单词的分布取决于底层特征的选择,尤其取决于聚类方法的选择。我们还表明,当视觉单词分布与文本单词的分布接近时,图像检索系统的结果会增加。据我们所知,尚未进行过任何比较研究文字词和视觉词的分布的研究,目的是建立视觉词汇的理论基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号