【24h】

FIDS: Monitoring Frequent Items over Distributed Data Streams

机译:FIDS:监视分布式数据流上的频繁项目

获取原文
获取原文并翻译 | 示例

摘要

Many applications require the discovery of items which have occur frequently within multiple distributed data streams. Past solutions for this problem either require a high degree of error tolerance or can only provide results periodically. In this paper we introduce a new algorithm designed for continuously tracking frequent items over distributed data streams providing either exact or approximate answers. We tested the efficiency of our method using two real-world data sets. The results indicated significant reduction in communication cost when compared to naive approaches and an existing efficient algorithm called Top-K Monitoring. Since our method does not rely upon approximations to reduce communication overhead and is explicitly designed for tracking frequent items, our method also shows increased quality in its tracking results.
机译:许多应用程序要求发现在多个分布式数据流中经常发生的项目。过去针对该问题的解决方案要么需要高度的容错能力,要么只能定期提供结果。在本文中,我们介绍了一种新算法,该算法旨在连续跟踪分布式数据流中的频繁项,从而提供准确或近似的答案。我们使用两个实际数据集测试了我们方法的效率。结果表明,与朴素的方法和现有的称为Top-K Monitoring的高效算法相比,通信成本显着降低。由于我们的方法不依赖于近似方法来减少通信开销,并且明确地设计用于跟踪频繁的物品,因此我们的方法在跟踪结果中也显示出更高的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号