首页> 外文会议>International conference on management of data >E-Cube: Multi-Dimensional Event Sequence Analysis Using Hierarchical Pattern Query Sharing
【24h】

E-Cube: Multi-Dimensional Event Sequence Analysis Using Hierarchical Pattern Query Sharing

机译:电子多维数据集:使用分层模式查询共享的多维事件序列分析

获取原文
获取外文期刊封面目录资料

摘要

Many modern applications, including online financial feeds, tag-based mass transit systems and RFID-based supply chain management systems transmit real-time data streams. There is a need for event stream processing technology to analyze this vast amount of sequential data to enable online operational decision making. Existing techniques such as traditional online analytical processing (OLAP) systems are not designed for real-time pattern-based operations, while state-of-the-art Complex Event Processing (CEP; systems designed for sequence detection do not support OLAP operations. We propose a novel E-Cube model which combines CEP and OLAP techniques for efficient multi-dimensional event pattern analysis at different abstraction levels. Our analysis of the interrelationships in both concept abstraction and pattern refinement among queries facilitates the composition of these queries into an integrated E-Cube hierarchy. Based on this E-Cube hierarchy, strategies of drill-down (refinement from abstract to more specific patterns) and of roll-up (generalization from specific to more abstract patterns) are developed for the efficient workload evaluation. Our proposed execution strategies reuse intermediate results along both the concept and the pattern refinement relationships between queries. Based on this foundation, we design a cost-driven adaptive optimizer called Chase, that exploits the above reuse strategies for optimal E-Cube hierarchy execution. Our experimental studies comparing alternate strategies on a real world financial data stream under different workload conditions demonstrate the superiority of the Chase method. In particular, our Chise execution in many cases performs ten fold faster than the state-of-the-art strategy for real stock market query workloads.
机译:许多现代应用程序,包括在线财务提要,基于标签的大众运输系统和基于RFID的供应链管理系统,都可以传输实时数据流。需要一种事件流处理技术来分析大量的顺序数据,以实现在线操作决策。现有技术(例如传统的在线分析处理(OLAP)系统)不是为基于实时模式的操作而设计的,而最新的复杂事件处理(CEP;为序列检测而设计的系统)则不支持OLAP操作。提出了一个新的E-Cube模型,该模型结合了CEP和OLAP技术,可以在不同的抽象级别上进行有效的多维事件模式分析,我们对查询中概念抽象和模式优化之间的相互关系的分析有助于将这些查询组成一个集成的E -多维数据集层次结构:在此E-Cube层次结构的基础上,开发了向下钻取(从抽象模式扩展到更具体的模式)和向上滚动(从特定模式抽象到更抽象的模式)的策略,以进行有效的工作负载评估。执行策略沿着查询之间的概念和模式优化关系重用中间结果。作为基础,我们设计了一种称为Chase的成本驱动型自适应优化器,该优化器利用上述重用策略来实现最佳E-Cube层次结构执行。我们的实验研究比较了在不同工作负载条件下的真实财务数据流上的替代策略,这证明了Chase方法的优越性。特别是,我们的Chise执行在许多情况下的执行速度比用于实际股市查询工作负载的最新策略快十倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号