首页> 外文会议>21st annual symposium on parallelism in algorithms and architectures 2009 >Scheduling to minimize staleness and stretch in real-time data warehouses
【24h】

Scheduling to minimize staleness and stretch in real-time data warehouses

机译:进行调度以最大程度地减少陈旧性并扩展实时数据仓库

获取原文
获取外文期刊封面目录资料

摘要

We study scheduling algorithms for loading data feeds into real time data warehouses, which are used in applications such as IP network monitoring, online financial trading, and credit card fraud detection. In these applications, the warehouse collects a large number of streaming data feeds that are generated by external sources and arrive asynchronously. Data for each table are generated at a constant rate, different tables possibly at different rates. For each data feed, the arrival of new data triggers an update that seeks to append the new data to the corresponding table; if multiple updates are pending for the same table, they are batched together before being loaded. At time τ, if a table has been updated with information up to time r≤τ, its staleness is defined as τ--r.
机译:我们研究了用于将数据馈送加载到实时数据仓库中的调度算法,该算法用于IP网络监控,在线金融交易和信用卡欺诈检测等应用中。在这些应用程序中,仓库收集了大量由外部源生成并异步到达的流数据提要。每个表的数据以恒定的速率生成,不同的表可能以不同的速率生成。对于每个数据馈送,新数据的到达都会触发一个更新,该更新试图将新数据附加到相应的表中。如果同一张表有多个更新待处理,则在加载之前将它们分批处理。在时间τ处,如果某个表已更新了直到时间r≤τ的信息,则其陈旧性定义为τ--r。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号