首页> 外文会议>Algorithms -ESA 2003 >Estimating Dominance Norms of Multiple Data Streams
【24h】

Estimating Dominance Norms of Multiple Data Streams

机译:估计多个数据流的优势范数

获取原文
获取原文并翻译 | 示例

摘要

There is much focus in the algorithms and database communities on designing tools to manage and mine data streams. Typically, data streams consist of multiple signals. Formally, a stream of multiple signals is (I, a_(I, j)) where I's correspond to the domain, j's index the different signals and a_(I, j) > 0 give the value of the jth signal at point I. We study the problem of finding norms that are cumulative of the multiple signals in the data stream. For example, consider the max-dominance norm, defined as Σ_I max_j {a_(I, j)}. It may be thought as estimating the norm of the "upper envelope" of the multiple signals, or alternatively, as estimating the norm of the "marginal" distribution of tabular data streams. It is used in applications to estimate the "worst case influence" of multiple processes, for example in IP traffic analysis, electrical grid monitoring and financial domain. In addition, it is a natural measure, generalizing the union of data streams or counting distinct elements in data streams. We present the first known data stream algorithms for estimating max-dominance of multiple signals. In particular, we use workspace and time-per-item that are both sublinear (in fact, poly-logarithmic) in the input size. In contrast other notions of dominance on streams a, b ― min-dominance (Σ_I min_j{a_(I,j)}), count-dominance (|{I|a_I > b_I}|) or relative-dominance (Σ_I a_I/max{1, b_I})― are all impossible to estimate accurately with sublinear space.
机译:算法和数据库社区非常关注设计工具来管理和挖掘数据流。通常,数据流由多个信号组成。形式上,多个信号流是(I,a_(I,j)),其中I对应于域,j索引不同信号,并且a_(I,j)> 0给出点i处第j个信号的值。我们研究寻找数据流中多个信号累积的规范的问题。例如,考虑定义为Σ_Imax_j {a_(I,j)}的max-dominance范数。可以将其视为估计多个信号的“上包络”的范数,或者将其视为表格式数据流的“边际”分布的范数。它在应用程序中用于估计多个过程的“最坏情况影响”,例如在IP流量分析,电网监视和金融领域。另外,这是一种自然的方法,可以概括数据流的并集或对数据流中的不同元素进行计数。我们提出了第一个已知的数据流算法,用于估计多个信号的最大支配性。特别是,我们使用输入大小均为亚线性(实际上是对数)的工作空间和每项时间。相反,在流a,b上的其他支配概念―最小支配(Σ_Imin_j {a_(I,j)}),计数支配(| {I | a_I> b_I} |)或相对支配(Σ_Ia_I / max {1,b_I})―不可能用亚线性空间精确估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号