Conditioning and Aggregating Uncertain Data Streams:Going Beyond Expectations

机译：调节和聚合不确定的数据流：超出预期

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Uncertain data streams are increasingly common in real-world deployments and monitoring applications require the evaluation of complex queries on such streams. In this paper, we consider complex queries involving conditioning (e.g., selections and group by's) and aggregation operations on uncertain data streams. To characterize the uncertainty of answers to these queries, one generally has to compute the full probability distribution of each operation used in the query. Computing distributions of aggregates given conditioned tuple distributions is a hard, unsolved problem. Our work employs a new evaluation framework that includes a general data model, approximation metrics, and approximate representations. Within this framework we design fast data-stream algorithms, both deterministic and randomized, for returning approximate distributions with bounded errors as answers to those complex queries. Our experimental results demonstrate the accuracy and efficiency of our approximation techniques and offer insights into the strengths and limitations of deterministic and randomized algorithms.

机译：不确定的数据流在实际部署中越来越普遍，监视应用程序需要评估此类流上的复杂查询。在本文中，我们考虑了涉及条件（例如选择和分组依据）和对不确定数据流进行聚合操作的复杂查询。为了表征这些查询答案的不确定性，通常必须计算查询中使用的每个操作的全部概率分布。给定条件元组分布，计算聚集的分布是一个难题，尚未解决。我们的工作采用了新的评估框架，其中包括通用数据模型，近似指标和近似表示。在此框架内，我们设计了确定性和随机性的快速数据流算法，用于返回带有有限误差的近似分布作为那些复杂查询的答案。我们的实验结果证明了逼近技术的准确性和效率，并为确定性和随机算法的优势和局限性提供了见识。

著录项

来源
《International conference on very large data bases;VLDB 2010》|2011年|p.1302-1313|共12页
会议地点
作者
Thanh T. L. Tran; Andrew McGregor; Yanlei Diao; Liping Peng; Anna Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词

相似文献

外文文献
中文文献
专利

1. Sliding-Window Probabilistic Threshold Aggregate Queries on Uncertain Data Streams [J] . Information Sciences: An International Journal . 2020,第期

机译：滑动窗口概率阈值在不确定数据流上的汇总查询
2. Uncertain canonical correlation analysis for multi-view feature extraction from uncertain data streams [J] . Wen-Ping Li, Jing Yang, Jian-Pei Zhang Neurocomputing . 2015,第ptac期

机译：从不确定数据流中提取多视图特征的不确定规范相关性分析
3. Uncertain One-Class Learning and Concept Summarization Learning on Uncertain Data Streams [J] . Liu Bo, Xiao Yanshan, Yu Philip S., IEEE Transactions on Knowledge and Data Engineering . 2014,第2期

机译：不确定数据流上的不确定一类学习和概念总结学习
4. Conditioning and Aggregating Uncertain Data Streams:Going Beyond Expectations [C] . Thanh T. L. Tran, Andrew McGregor, Yanlei Diao, International conference on very large data bases . 2010

机译：调节和聚合不确定数据流：超越预期
5. Frequent Pattern Mining of Uncertain Data Streams. [D] . Jiang, Fan. 2012

机译：不确定数据流的频繁模式挖掘。
6. Hyper-structure mining of frequent patterns in uncertain data streams [O] . Chandima HewaNadungodage, Yuni Xia, Jaehwan John Lee, -1

机译：不确定数据流中频繁模式的超结构挖掘
7. Performance Evaluation in Aggregate Production Planning Using Integrated RED-SWARA Method under Uncertain Condition [O] . Javad Khalili, Alireza Alinezhad 2020

机译：在不确定条件下使用集成红甘蓝法的综合生产规划性能评估
8. Final Data Report: Temporal Changes in the Ecological Condition of Non-Tidal Streams in Upper Pocomoke and Western Branch Watersheds. [R] . Kilian, J., Stranko, S. 2003

机译：最终数据报告：上部pocomoke和西部分支流域非潮汐流生态状况的时间变化。

Conditioning and Aggregating Uncertain Data Streams:Going Beyond Expectations

摘要

著录项

相似文献

相关主题

期刊订阅