首页> 外国专利> System and method for efficiently finding near-similar images in massive databases

System and method for efficiently finding near-similar images in massive databases

机译:在海量数据库中有效查找近似图像的系统和方法

摘要

Massive amounts of multimedia data are stored in databases supporting web pages and servers, including text, graphics, video and audio. Searching and finding matching multimedia images can be time and computationally intensive. A method for storing and retrieving image data includes computing a descriptor, such an a Fourier-Mellin Transform (FMT), corresponding to a multidimensional space indicative of each of the stored images and organizing each of the descriptors according to a set similarity metric. The set similarity metric is based on Locality-Sensitive Hashing (LSH), and orders descriptors near to other descriptors in the database. The set similarity metric employs set theory which allows distance between descriptors to be computed consistent with LSH. A target image for which a match is sought is then received, and a descriptor indicative of the target image is computed. The database is referenced, or mapped, to determine close matches in the database. Mapping includes selecting a candidate match descriptor from among the descriptors in the database and employing a distance metric derived from the similarity metric to determine if the candidate match descriptor is a match to the target descriptor.
机译:大量的多媒体数据存储在支持网页和服务器的数据库中,包括文本,图形,视频和音频。搜索和找到匹配的多媒体图像可能是时间和计算密集型的。一种用于存储和检索图像数据的方法,包括:计算与诸如表示每个存储的图像的多维空间相对应的描述符,例如傅立叶-麦林变换(FMT),并根据一组相似性度量来组织每个描述符。设置的相似性度量基于局部敏感哈希(LSH),并对描述符进行排序,使其接近数据库中的其他描述符。集合相似性度量采用集合理论,该理论允许根据LSH来计算描述符之间的距离。然后接收为其寻求匹配的目标图像,并且计算指示该目标图像的描述符。引用或映射数据库以确定数据库中的紧密匹配项。映射包括从数据库中的描述符中选择候选匹配描述符,并采用从相似性度量得出的距离度量来确定候选匹配描述符是否与目标描述符匹配。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号