【24h】

Data Stream Analytics as Cloud Service for Mobile Applications

机译:数据流分析作为移动应用程序的云服务

获取原文

摘要

Many mobile applications are based on cloud services such as loca-tion service, messaging service, etc. Currently most cloud services are based on statically prepared information rather than the real-time analytics results of dynamically captured events. A paradigm shift is to take Continuous Stream Analytics (CSA) as a cloud service, which, however, poses several specific challenges in scalability, latency, time-window semantics and transaction control.In this work we extend the SQL query engine to unify the processing of static relations and dynamic streams for providing the platform support of CSA service. This platform is significantly differentiated from the current generation of stream processing systems which are in general built separately from the database engine thus unable to take advantage of the functionalities already of-fered by the existing data management technology, and suffer from the over-head of inter-platform data access and movement.To capture the window semantics in CSA, we introduce the cycle-based query model and support it in terms of the cut-and-rewind query execution mechanism. This mechanism allows a SQL query to run cycle by cycle for processing the unbounded stream data chunk by chunk, but without shutting the query instance down between chunks for continuously maintaining the applica-tion state across the execution cycles, as required by sliding-window oriented operations. We also propose the cycle-based transaction model with cycle-based isolation and visibility. To scale-up analytics computation, we introduce the parallel infrastructure with multi-engines cooperated and synchronized based the common data chunking criteria without centralized coordination. To scale-up service provisioning, we investigate how to stage the continuously generated analytics results efficiently through metadata manipulation without physical data moving and copying.We have prototyped our approach by extending the PostgreSQL, resulting in a new kind of tightly integrated, highly efficient platform for providing CSA service. We tested the throughput and latency of this service using a well-known stream processing benchmark and with WebOS based Palm phones. The test results show that the proposed approach is highly competitive. Providing CSA cloud service using HP Neoview parallel database engine is currently explored.The proposed approach has been integrated into the PostgreSQL engine, resulting in a new kind of tightly integrated, highly efficient platform for CSA service provi-sioning. We tested the throughput and latency of this service using a popular stream processing benchmark and with WebOS based Palm phones. Our preliminary experi-ments reveal its merit in "CSA as a Service". Providing CSA cloud service using HP Neoview parallel database engine is currently explored.
机译:许多移动应用程序都基于云服务,例如位置服务,消息传递服务等。当前,大多数云服务基于静态准备的信息,而不是动态捕获的事件的实时分析结果。范式转变是将连续流分析(CSA)作为云服务,但是,这在可伸缩性,延迟,时间窗语义和事务控制方面提出了一些特定的挑战。 在这项工作中,我们扩展了SQL查询引擎,以统一静态关系和动态流的处理,以提供CSA服务的平台支持。该平台与当前的流处理系统有很大的不同,后者通常与数据库引擎分开构建,因此无法利用现有数据管理技术已经提供的功能,并且存在开销大的缺点。跨平台数据访问和移动。 为了捕获CSA中的窗口语义,我们介绍了基于循环的查询模型,并根据剪切和倒带查询执行机制提供了支持。这种机制允许SQL查询逐周期运行,以逐块处理无限制的流数据,但无需关闭块之间的查询实例,从而无需按滑动窗口的要求在整个执行周期内连续保持应用程序状态。操作。我们还提出了基于周期的事务模型,该模型具有基于周期的隔离性和可见性。为了扩大分析计算的规模,我们引入了并行基础架构,其中多引擎基于通用数据分块标准进行协作和同步,而无需集中协调。为了扩大服务供应,我们研究了如何通过元数据操作有效地逐步生成连续生成的分析结果,而无需移动和复制物理数据。 我们通过扩展PostgreSQL来原型化我们的方法,从而产生了一种新型的紧密集成的高效平台,用于提供CSA服务。我们使用众所周知的流处理基准测试和基于WebOS的Palm电话测试了该服务的吞吐量和延迟。测试结果表明,该方法具有很高的竞争力。目前正在探索使用HP Neoview并行数据库引擎提供CSA云服务。该提议的方法已集成到PostgreSQL引擎中,从而形成了一种紧密集成的高效CSA服务提供平台。我们使用流行的流处理基准测试和基于WebOS的Palm电话测试了此服务的吞吐量和延迟。我们的初步实验揭示了其在“ CSA即服务”中的优点。当前正在探索使用HP Neoview并行数据库引擎提供CSA云服务。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号