首页> 外文会议>2018 IEEE Data Science Workshop >VECTOR COMPRESSION FOR SIMILARITY SEARCH USING MULTI-LAYER SPARSE TERNARY CODES
【24h】

VECTOR COMPRESSION FOR SIMILARITY SEARCH USING MULTI-LAYER SPARSE TERNARY CODES

机译:使用多层稀疏三元编码进行相似性搜索的矢量压缩

获取原文
获取原文并翻译 | 示例

摘要

It was shown recently that Sparse Ternary Codes (STC) posses superior “coding gain” compared to the classical binary hashing framework and can successfully be used for large-scale search applications. This work extends the STC for compression and proposes a rate-distortion efficient design. We first study a single-layer setup where we show that binary encoding intrinsically suffers from poor compression quality while STC, thanks to the flexibility in design, can have near-optimal rate allocation. We further show that single-layer codes should be limited to very low rates. Therefore, in order to target arbitrarily high rates, we adopt a multi-layer solution inspired by the classical idea of residual quantization. The proposed architecture, while STC in nature and hence suitable for similarity search, can add the “list-refinement” technique as a useful element to the similarity search setup. This can be achieved thanks to the excellent rate-distortion performance of this scheme which we validate on synthetic, as well as large-scale public databases.
机译:最近显示,与经典的二进制散列框架相比,稀疏的三进制代码(STC)具有更好的“编码增益”,并且可以成功地用于大规模搜索应用程序。这项工作扩展了用于压缩的STC,并提出了一种高效的速率失真设计。我们首先研究了一个单层设置,其中我们证明二进制编码本质上受压缩质量差的影响,而STC由于设计的灵活性而可以具有接近最佳的速率分配。我们进一步表明,单层代码应限制在非常低的速率上。因此,为了以任意高的速率为目标,我们采用了经典的残余量化思想启发的多层解决方案。所提出的体系结构虽然具有STC的性质,因此适合于相似性搜索,但可以将“列表细化”技术添加为相似性搜索设置的有用元素。这可以通过我们在合成以及大型公共数据库上验证的该方案出色的速率失真性能来实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号