首页> 外文期刊>The VLDB journal >Conditional heavy hitters: detecting interesting correlations in data streams
【24h】

Conditional heavy hitters: detecting interesting correlations in data streams

机译:有条件的沉重打击者:检测数据流中有趣的关联

获取原文
获取原文并翻译 | 示例
           

摘要

The notion of heavy hitters-items that make up a large fraction of the population-has been successfully used in a variety of applications across sensor and RFID monitoring, network data analysis, event mining, and more. Yet this notion often fails to capture the semantics we desire when we observe data in the form of correlated pairs. Here, we are interested in items that are conditionally frequent: when a particular item is frequent within the context of its parent item. In this work, we introduce and formalize the notion of conditional heavy hitters to identify such items, with applications in network monitoring and Markov chain modeling. We explore the relationship between conditional heavy hitters and other related notions in the literature, and show analytically and experimentally the usefulness of our approach. We introduce several algorithm variations that allow us to efficiently find conditional heavy hitters for input data with very different characteristics, and provide analytical results for their performance. Finally, we perform experimental evaluations with several synthetic and real datasets to demonstrate the efficacy of our methods and to study the behavior of the proposed algorithms for different types of data.
机译:占人口大部分的沉重打击者概念已经成功用于传感器和RFID监视,网络数据分析,事件挖掘等各种应用中。但是,当我们以相关对的形式观察数据时,此概念通常无法捕获我们想要的语义。在这里,我们对条件频繁的项目感兴趣:当某个特定项目在其父项目的上下文中频繁发生时。在这项工作中,我们引入并正式化了条件重击手的概念,以识别此类项目,并将其应用于网络监控和马尔可夫链建模中。我们探讨了有条件的沉重击球手与文献中其他相关概念之间的关系,并通过分析和实验证明了我们方法的有效性。我们介绍了几种算法变体,使我们可以为具有非常不同特征的输入数据有效地找到条件重击手,并为其性能提供分析结果。最后,我们使用几个合成的和真实的数据集进行实验评估,以证明我们的方法的有效性并研究针对不同类型数据的拟议算法的行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号