首页> 外文学位 >A CUDA Based Parallel Decoding Algorithm for the Leech Lattice Locality Sensitive Hash Family.
【24h】

A CUDA Based Parallel Decoding Algorithm for the Leech Lattice Locality Sensitive Hash Family.

机译:一种基于CUDA的水Lee格局部敏感哈希家族的并行解码算法。

获取原文
获取原文并翻译 | 示例

摘要

Nearest neighbor search is a fundamental requirement of many machine learning algorithms and is essential to fuzzy information retrieval. The utility of efficient database search and construction has broad utility in a variety of computing fields. Applications such as coding theory and compression for electronic communication systems as well as use in artificial intelligence for pattern and object recognition. In this thesis, a particular subset of nearest neighbors is consider, referred to as c-approximate k-nearest neighbors search. This particular variation relaxes the constraints of exact nearest neighbors by introducing a probability of finding the correct nearest neighbor c, which offers considerable advantages to the computational complexity of the search algorithm and the database overhead requirements. Furthermore, it extends the original nearest neighbors algorithm by returning a set of k candidate nearest neighbors, from which expert or exact distance calculations can be considered. Furthermore thesis extends the implementation of c-approximate k-nearest neighbors search so that it is able to utilize the burgeoning GPGPU computing field. The specific form of c-approximate k-nearest neighbors search implemented is based on the locality sensitive hash search from the E2LSH package of Indyk and Andoni [1]. In this paper, the authors utilize the exceptional properties of the Leech Lattice [2], as a subspace quantizer for the locality sensitive hash families. The Leech Lattice is unique in that it provides the closest lattice packing of equal sized spheres in 24 dimensional space. In addition, advances from coding theory provide a very favorable decoding algorithm for finding the nearest lattice center to a query point in euclidean 24 dimensional space [3] [4]. The multilevel construction of the Leech Lattice provides an excellent opportunity for parallelization as it contains the minimization of many independent sub-lattice decodings resulting from the lattices exceptional symmetry among lattices. These decodings are additionally highly floating point computationally intensive, and because of which suggest a favorable implementation on GPGPU architectures such as NVIDIAs CUDA based framework. Furthermore, the overall construction of a locality sensitive hash based, nearest neighbors search algorithm, is able to be parallelized fairly efficiently as the hash decodings are completely independent of one another. The goal of this thesis is to present a CUDA optimized parallel implementation of a bounded distance Leech Lattice decoder [4] for use in query optimized c-approximate k-nearest neighbors using the locality sensitive hash framework of E2LSH. The system will be applied to the approximate image retrieval of SIFT transformed [5] image vectors.;[1] A. Andoni, Nearest Neighbor Search: the Old, the New, and the Impossible. PhD thesis, MASSACHUSETTS INSTITUTE OF TECHNOLOGY, September 2009. [2] J. Leech, "Notes on sphere packings," Canadian Journal of Mathematics, 1967. [3] J. Forney, G. D., "A bounded-distance decoding algorithm for the leech lattice, with generalizations," Information Theory, IEEE Transactions on, vol. 35, pp. 906–909, Jul 1989. [4] O. Amrani and Y. Beery, "Efficient bounded-distance decoding of the hexacode and associated decoders for the leech lattice and the golay code," Communications, IEEE Transactions on, vol. 44, pp. 534–537, May 1996. [5] D. G. Lowe, "Object recognition from local scale-invariant features," in Computer Vision, 1999. The Proceedings of the Seventh IEEE International Conference on, vol. 2, pp. 1150–1157, 1999.
机译:最近邻居搜索是许多机器学习算法的基本要求,并且对于模糊信息检索至关重要。有效的数据库搜索和构建的实用性在各种计算领域中具有广泛的实用性。电子通信系统的编码理论和压缩等应用以及在人工智能中用于模式和对象识别的应用。在本文中,考虑了最近邻居的特定子集,称为c近似k最近邻居搜索。通过引入找到正确的最近邻居c的概率,该特定变化放松了精确最近邻居的约束,这为搜索算法的计算复杂性和数据库开销要求提供了相当大的优势。此外,它通过返回一组k个候选最近邻居来扩展原始最近邻居算法,从中可以考虑专家或精确的距离计算。此外,本文扩展了c近似k近邻搜索的实现,使其能够利用新兴的GPGPU计算领域。实现的c近似k近邻搜索的特定形式基于Indyk和Andoni的E2LSH软件包中的局部敏感哈希搜索[1]。在本文中,作者利用Leech Lattice [2]的特殊属性作为局部敏感哈希族的子空间量化器。水ch格子的独特之处在于,它在24维空间中提供了大小相等的球体的最接近的晶格堆积。此外,编码理论的进步为找到欧氏24维空间中的查询点最近的晶格中心提供了一种非常有利的解码算法[3] [4]。 Leech格的多层结构为并行化提供了极好的机会,因为它包含了由于晶格之间异常的对称性而导致的许多独立子晶格解码的最小化。这些解码还具有很高的浮点计算强度,因此建议在GPGPU架构(如基于NVIDIA CUDA的框架)上实现良好的实现。此外,由于散列解码完全彼此独立,因此可以相当有效地并行化基于位置敏感的散列的最近邻居搜索算法的整体构造。本文的目的是提出一种CUDA优化的有限距离Leech Lattice解码器的并行实现[4],该解码器用于使用E2LSH的局域敏感哈希框架在查询优化的c近似k近邻中使用。该系统将应用于SIFT转换后的[5]个图像矢量的近似图像检索。[1] A. Andoni,最近邻居搜索:旧的,新的和不可能的。博士学位论文,麻省理工学院,2009年9月。[2] J. Leech,“关于球堆积的注意事项”,加拿大数学杂志,1967年。[3] J. Forney,GD,“针对球体的有界距离解码算法水ech格,具有概括性”,信息理论,IEEE Transactions 35,第906–909页,1989年7月。[4] O. Amrani和Y. Beery,“水code格和golay码的六码及相关解码器的有效有界距离解码”,通信,IEEE事务,卷[5] D. G. Lowe,第44页,第534-537页,1996年5月。[5] D. G. Lowe,“来自局部尺度不变特征的对象识别”,计算机视觉,1999年。 2,第1150–1157页,1999年。

著录项

  • 作者

    Carraher, Lee A.;

  • 作者单位

    University of Cincinnati.;

  • 授予单位 University of Cincinnati.;
  • 学科 Engineering Computer.;Computer Science.
  • 学位 M.S.
  • 年度 2012
  • 页码 104 p.
  • 总页数 104
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号