首页> 外文会议>International conference on very large data bases;VLDB 2010 >Conditioning and Aggregating Uncertain Data Streams:Going Beyond Expectations
【24h】

Conditioning and Aggregating Uncertain Data Streams:Going Beyond Expectations

机译:调节和聚合不确定的数据流:超出预期

获取原文

摘要

Uncertain data streams are increasingly common in real-world deployments and monitoring applications require the evaluation of complex queries on such streams. In this paper, we consider complex queries involving conditioning (e.g., selections and group by's) and aggregation operations on uncertain data streams. To characterize the uncertainty of answers to these queries, one generally has to compute the full probability distribution of each operation used in the query. Computing distributions of aggregates given conditioned tuple distributions is a hard, unsolved problem. Our work employs a new evaluation framework that includes a general data model, approximation metrics, and approximate representations. Within this framework we design fast data-stream algorithms, both deterministic and randomized, for returning approximate distributions with bounded errors as answers to those complex queries. Our experimental results demonstrate the accuracy and efficiency of our approximation techniques and offer insights into the strengths and limitations of deterministic and randomized algorithms.
机译:不确定的数据流在实际部署中越来越普遍,监视应用程序需要评估此类流上的复杂查询。在本文中,我们考虑了涉及条件(例如选择和分组依据)和对不确定数据流进行聚合操作的复杂查询。为了表征这些查询答案的不确定性,通常必须计算查询中使用的每个操作的全部概率分布。给定条件元组分布,计算聚集的分布是一个难题,尚未解决。我们的工作采用了新的评估框架,其中包括通用数据模型,近似指标和近似表示。在此框架内,我们设计了确定性和随机性的快速数据流算法,用于返回带有有限误差的近似分布作为那些复杂查询的答案。我们的实验结果证明了逼近技术的准确性和效率,并为确定性和随机算法的优势和局限性提供了见识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号