...
首页> 外文期刊>ACM transactions on database systems >A Geometric Approach to Monitoring Threshold Functions over Distributed Data Streams
【24h】

A Geometric Approach to Monitoring Threshold Functions over Distributed Data Streams

机译:一种监视分布式数据流阈值函数的几何方法

获取原文
获取原文并翻译 | 示例

摘要

Monitoring data streams in a distributed system is the focus of much research in recent years. Most of the proposed schemes, however, deal with monitoring simple aggregated values, such as the frequency of appearance of items in the streams. More involved challenges, such as the important task of feature selection (e.g., by monitoring the information gain of various features), still require very high communication overhead using naive, centralized algorithms. We present a novel geometric approach which reduces monitoring the value of a function (visa-vis a threshold) to a set of constraints applied locally on each of the streams. The constraints are used to locally filter out data increments that do not affect the monitoring outcome, thus avoiding unnecessary communication. As a result, our approach enables monitoring of arbitrary threshold functions over distributed data streams in an efficient manner. We present experimental results on real-world data which demonstrate that our algorithms are highly scalable, and considerably reduce communication load in comparison to centralized algorithms.
机译:监视分布式系统中的数据流是近年来许多研究的重点。但是,大多数提议的方案都涉及监视简单的聚合值,例如流中项目出现的频率。诸如特征选择的重要任务(例如,通过监视各种特征的信息增益)之类的更复杂的挑战仍然需要使用幼稚的集中式算法的非常高的通信开销。我们提出了一种新颖的几何方法,该方法可将对函数值的监视(相对于阈值)减少为局部应用在每个流上的一组约束。约束用于局部过滤不影响监视结果的数据增量,从而避免不必要的通信。结果,我们的方法能够以有效的方式监视分布式数据流上的任意阈值函数。我们在现实世界的数据上展示了实验结果,这些结果表明我们的算法具有高度可扩展性,并且与集中式算法相比,可大大减少通信负载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号