首页> 外文会议>International conference on very large data bases >AnalyticDB: Real-time OLAP Database System at Alibaba Cloud
【24h】

AnalyticDB: Real-time OLAP Database System at Alibaba Cloud

机译:AnalyticDB:阿里云上的实时OLAP数据库系统

获取原文

摘要

With data explosion in scale and variety, OLAP databases play an increasingly important role in serving real-time analysis with low latency (e.g.. hundreds of milliseconds), especially when incoming queries are complex and ad hoc in nature. Moreover, these systems are expected to provide high query concurrency and write throughput, and support queries over structured and complex data types (e.g., JSON, vector and texts). In this paper, we introduce AnalyticDB, a real-time O-LAP database system developed at Alibaba. AnalyticDB maintains all-column indexes in an asynchronous manner with acceptable overhead, which provides low latency for complex ad-hoc queries. Its storage engine extends hybrid row-column layout for fast retrieval of both structured data and data of complex types. To handle large-scale data with high query concurrency and write throughput, AnalyticDB decouples read and write access paths. To further reduce query latency, novel storage-aware SQL optimizer and execution engine are developed to fully utilize the advantages of the underlying storage and indexes. AnalyticDB has been successfully deployed on Alibaba Cloud to serve numerous customers (both large and small). It is capable of holding 100 trillion rows of records, i.e., 10PB+ in size. At the same time, it is able to serve 10m+ writes and 100k+ queries per second, while completing complex queries within hundreds of milliseconds.
机译:随着规模和种类的数据爆炸,OLAP数据库在以低延迟(例如几百毫秒)提供实时分析服务中发挥着越来越重要的作用,尤其是当传入查询非常复杂且临时性时。而且,期望这些系统提供高查询并发性和写入吞吐量,并支持对结构化和复杂数据类型(例如,JSON,向量和文本)的查询。在本文中,我们介绍了AnalyticDB,这是在阿里巴巴开发的实时O-LAP数据库系统。 AnalyticDB以可接受的开销以异步方式维护所有列索引,这为复杂的即席查询提供了低延迟。其存储引擎扩展了混合行-列布局,可快速检索结构化数据和复杂类型的数据。为了处理具有高查询并发性和写入吞吐量的大规模数据,AnalyticDB将读取和写入访问路径解耦。为了进一步减少查询等待时间,开发了新颖的具有存储意识的SQL优化器和执行引擎,以充分利用基础存储和索引的优势。 AnalyticDB已成功部署在阿里云上,可为众多客户(无论大小)提供服务。它能够保存100万亿行记录,即10PB +大小。同时,它每秒可以处理10m +次写操作和100k +次查询,同时在数百毫秒内完成复杂的查询。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号