首页> 外文会议>International conference on very large data bases >Instant Loading for Main Memory Databases
【24h】

Instant Loading for Main Memory Databases

机译:主内存数据库的即时加载

获取原文
获取外文期刊封面目录资料

摘要

eScience and big data analytics applications are facing the challenge of efficiently evaluating complex queries over vast amounts of structured text data archived in network storage solutions. To analyze such data in traditional disk-based database systems, it needs to be bulk loaded, an operation whose performance largely depends on the wire speed of the data source and the speed of the data sink, i.e., the disk. As the speed of network adapters and disks has stagnated in the past, loading has become a major bottleneck. The delays it is causing are now ubiquitous as text formats are a preferred storage format for reasons of portability. But the game has changed: Ever increasing main memory capacities have fostered the development of in-memory database systems and very fast network infrastructures are on the verge of becoming economical. While hardware limitations for fast loading have disappeared, current approaches for main memory databases fail to saturate the now available wire speeds of tens of Gbit/s. With Instant Loading, we contribute a novel CSV loading approach that allows scalable bulk loading at wire speed. This is achieved by optimizing all phases of loading for modern super-scalar multi-core CPUs. Large main memory capacities and Instant Loading thereby facilitate a very efficient data staging processing model consisting of instantaneous toad-work-unload cycles across data archives on a single node. Once data is loaded, updates and queries are efficiently processed with the flexibility, security, and high performance of relational main memory databases.
机译:源极和大数据分析应用程序面临有效地评估在网络存储解决方案中归档的大量结构化文本数据中复杂查询的挑战。为了分析传统的基于磁盘的数据库系统中的这些数据,需要装载批量,其性能在很大程度上取决于数据源的线速和数据接收器的速度,即磁盘。随着网络适配器和磁盘的速度过去已经停滞不前,装载已成为一个主要的瓶颈。导致的延迟现在与文本格式是优选的存储格式,因为便携性的原因是优选的存储格式。但是,游戏已经改变:越来越多的主要内存能力促进了内存数据库系统的开发,并且非常快的网络基础设施正在成为经济的边缘。虽然快速加载的硬件限制已经消失,但主内存数据库的当前方法无法使现在的数十的电线速度饱和。随着即时加载,我们有助于一种新型CSV加载方法,允许在线速度进行可扩展的散装加载。这是通过优化现代超标量多核CPU的所有阶段来实现的。因此,大主存储器容量和即时加载,从而促进了一个非常有效的数据暂存处理模型,这些处理模型由单个节点上的数据档案组成的瞬时蟾蜍 - 工作 - 卸载循环。一旦加载数据,有效地处理更新和查询,以具有关系主存储器数据库的灵活性,安全性和高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号