首页> 外文会议>International conference on very large data bases;VLDB 2008 >Lee Wave: Level-Wise Distribution of Wavelet Coefficients for Processing κNN Queries over Distributed Streams
【24h】

Lee Wave: Level-Wise Distribution of Wavelet Coefficients for Processing κNN Queries over Distributed Streams

机译:Lee Wave:用于处理分布式流上的κNN查询的小波系数的逐级分布

获取原文

摘要

We present LeeWave - a bandwidth-efficient approach to searching range-specified κ-nearest neighbors among distributed streams by LEvEl-wise distribution of WAVElet coefficients. To find the κ most similar streams to a range-specified reference one, the relevant wavelet coefficients of the reference stream can be sent to the peer sites to compute the similarities. However, bandwidth can be unnecessarily wasted if the entire relevant coefficients are sent simultaneously. Instead, we present a level-wise approach by leveraging the multi-resolution property of the wavelet coefficients. Starting from the top and moving down one level at a time, the query initiator sends only the single-level coefficients to a progressively shrinking set of candidates. However, there is one difficult challenge in LeeWave: how does the query initiator prune the candidates without knowing all the relevant coefficients? To overcome this challenge, we derive and maintain a similarity range for each candidate and gradually tighten the bounds of this range as we move from one level to the next. The increasingly tightened similarity ranges enable the query initiator to effectively prune the candidates without causing any false dismissal. Extensive experiments with real and synthetic data show that, when compared with prior approaches, LeeWave uses significantly less bandwidth under a wide range of conditions.
机译:我们提出LeeWave-一种带宽有效的方法,通过WAVElet系数的LEvEl分布在分布式流中搜索范围指定的κ最近邻居。为了找到与范围指定的参考流最相似的κ流,可以将参考流的相关小波系数发送到对等站点以计算相似度。但是,如果同时发送所有相关系数,则会不必要地浪费带宽。相反,我们利用小波系数的多分辨率特性提出了一种逐级方法。从顶部开始,一次向下移动一个级别,查询启动器仅将单级系数发送给逐渐缩小的候选集。但是,LeeWave面临一个难题:查询发起者如何在不知道所有相关系数的情况下删减候选者?为了克服这一挑战,我们为每个候选人推导并维持相似范围,并随着我们从一个级别移至下一个级别,逐渐收紧该范围的界限。越来越严格的相似性范围使查询启动器可以有效地修剪候选者,而不会引起任何误解。与真实数据和合成数据进行的大量实验表明,与以前的方法相比,LeeWave在各种条件下使用的带宽要少得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号