首页> 外国专利> Representation and retrieval of images using content vectors derived from image information elements

Representation and retrieval of images using content vectors derived from image information elements

机译:使用从图像信息元素派生的内容向量表示和检索图像

摘要

Image features are generated by performing wavelet transformations at sample points on images stored in electronic form. Multiple wavelet transformations at a point are combined to form an image feature vector. A prototypical set of feature vectors, or atoms, is derived from the set of feature vectors to form an “atomic vocabulary.” The prototypical feature vectors are derived using a vector quantization method (e.g., using neural network self-organization techniques) in which a vector quantization network is also generated. The atomic vocabulary is used to define new images. Meaning is established between atoms in the atomic vocabulary. High-dimensional context vectors are assigned to each atom. The context vectors are then trained as a function of the proximity and co-occurrence of each atom to other atoms in the image. After training, the context vectors associated with the atoms that comprise an image are combined to form a summary vector for the image. Images are retrieved using a number of query methods (e.g., images, image portions, vocabulary atoms, index terms). The user's query is converted into a query context vector. A dot product is calculated between the query vector and the summary vectors to locate images having the closest meaning. The invention is also applicable to video or temporally related images, and can also be used in conjunction with other context vector data domains such as text or audio, thereby linking images to such data domains.
机译:通过在以电子形式存储的图像上的采样点执行小波变换,可以生成图像特征。一个点上的多个小波变换被组合以形成图像特征向量。从特征向量的集合中导出一组典型的特征向量或原子,以形成“原子词汇”。使用向量量化方法(例如,使用神经网络自组织技术)导出原型特征向量,其中还生成向量量化网络。原子词汇用于定义新图像。在原子词汇中的原子之间建立了意义。高维上下文向量被分配给每个原子。然后根据图像中每个原子与其他原子的接近度和共现度来训练上下文向量。训练后,与构成图像的原子关联的上下文向量被合并以形成图像的摘要向量。使用多种查询方法(例如,图像,图像部分,词汇原子,索引词)来检索图像。用户的查询将转换为查询上下文向量。在查询向量和摘要向量之间计算点积,以定位具有最接近含义的图像。本发明还适用于视频或时间相关的图像,并且还可以与诸如文本或音频的其他上下文向量数据域结合使用,从而将图像链接到这样的数据域。

著录项

  • 公开/公告号US6760714B1

    专利类型

  • 公开/公告日2004-07-06

    原文格式PDF

  • 申请/专利权人 FAIR ISSAC CORPORATION;

    申请/专利号US20000675867

  • 发明设计人 WILLIAM R. CAID;ROBERT HECHT-NEILSEN;

    申请日2000-09-29

  • 分类号G06F151/80;

  • 国家 US

  • 入库时间 2022-08-21 23:15:40

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号