首页> 外文会议>IEEE International Conference on Big Data Computing Service and Applications >AntsBOA: A New Time Series Pipeline for Big Data Processing, Analyzing and Querying in Online Advertising Application
【24h】

AntsBOA: A New Time Series Pipeline for Big Data Processing, Analyzing and Querying in Online Advertising Application

机译:Antsboa:在线广告应用中的大数据处理,分析和查询新的时序序列管道

获取原文

摘要

This paper presents a new pipeline AntsBOA for big data analyzing, processing and querying. This pipeline is initially designed for online advertising application. However, it is easy to extend to other big data applications. The main idea is that AntsBOA is based on time series technology. The data processing of AntsBOA includes three levels, aggregation, time series and cache. Time series data and cache data are loading to a distributed database system, named Kodiak. Query server then queries these data in Kodiak and replies the result. This pipeline has been run in production for half a year. In our production, prior 16 months performance data is able to populate in less than half an hour. The response time of querying the 16 months performance data is less than several milliseconds in average. In addition, from our production results, cache level speeds up tens of times than aggregation level in term of query time. Time series cache level has a speedup 50% than cache level in term of Hadoop resource. And Time series loading performance speeds up about 10 times than traditional loading. Also our production system is monitored to guarantee in a healthy and stable state. In summary, AntsBOA is an efficient, accurate, recoverable, scalable and fault tolerant pipeline for big data processing, analyzing and querying.
机译:本文介绍了一个新的管道抗身,用于大数据分析,处理和查询。该管道最初是为在线广告应用而设计的。但是,很容易扩展到其他大数据应用程序。主要思想是抗斯波亚基于时间序列技术。 Antsboa的数据处理包括三个级别,聚合,时间序列和缓存。时间序列数据和缓存数据是加载到分布式数据库系统,名为Kodiak。然后查询服务器在Kodiak中查询这些数据并回复结果。该管道已在生产中运行半年。在我们的生产中,前16个月的性能数据能够在不到半小时内填充。查询16个月性能数据的响应时间平均小于几毫秒。此外,从我们的生产结果中,高速缓存级别比查询时间期间的聚合级别升高了几倍。时间序列缓存级别的加速度在Hadoop资源中的高速缓存级别比缓存级别。时间序列加载性能比传统装载的10次加速约10次。此外,我们的生产系统被监控以保证健康稳定的状态。总之,Antsboa是一种有效,准确,可恢复,可扩展,可持续的和容错管道,用于大数据处理,分析和查询。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号