首页> 外文OA文献 >Optimization of Data Accesses for Database Applications
【2h】

Optimization of Data Accesses for Database Applications

机译:优化数据库应用程序的数据访问

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

As the speed of microprocessors increases according to Moore's law, access speeds of the main memory and disks lag far behind. As a result, disk accesses and memory accesses pose significant performance bottlenecks for a wide range of applications. Specifically the database server performance in a data center is often limited, relying on the workload, by disk I/Os or memory accesses. This dissertation investigates techniques that improve the effectiveness of buffer caches and processor caches to bridge these two speed gaps for database servers in a data center environment.To address the disk I/O bottleneck, this dissertation proposes the global management of the database-storage buffer cache hierarchy, which delivers the performance comparable to that of the aggregate cache size of the hierarchy. To manage buffer caches globally, this dissertation answers two challenging questions: 1) without the modification of the existing I/O interface (namely hierarchy-aware), how to collaborate database and storage caches to achieve a global cache; 2) with the extension of the I/O interface (namely aggressively-collaborative), whether the benefit of the consequent performance improvement is worthwhile.To answer the first question, this dissertation proposes an hierarchy-aware method based on the eviction information. The storage cache uses a Client Content Tracking table to obtain the eviction information transparently. Upon the eviction of the database server, the storage server fetches selectively the corresponding block from the disk. Both simulation and implementation results show that the hierarchy-aware method improves the storage cache hit ratio significantly, cache hit ratios increasing by a factor of 5 in simulations and database transaction rate by 22% in real system results.To answer the second question, this dissertation adopts an empirical evaluation approach to explore the large design space for the database-storage collaborative caching. This design space has three dimensions: collaboration approaches (hierarchy-aware and aggressively-collaborative), replacement algorithms and workload specific optimizations. Through both trace-driven simulation and real system implementation, this dissertation evaluates 248 combinations in the design space, which include all the previous proposed solutions and many new approaches. The results indicate that the aggressively-collaborative caching only provides less than on average 2.5% performance improvement in simulation and 1.0% in real system experiments over the hierarchy-aware caching in all the tested cases. In other words, the hierarchy-aware caching, without requiring the modification to the existing I/O interface, can deliver the performance similar to that of the aggressively-collaborative caching.To address the memory access bottleneck for database servers, this dissertation proposes a technique, Hanuman, to improve the processor cache performance. Hanuman reformats data in database buffer caches dynamically to improve the data spatial locality. By adapting data layout to changing database queries, Hanuman improves the spatial locality of data and accordingly the processor cache hit ratio is increased. To determine the best data layout for the occurring workload with multiple queries, Hanuman conducts the heuristic cost analysis for candidate layouts and chooses the best layout that optimizes the estimated number of cache misses. Our result indicates that Hanuman reduces processor cache misses by 63-80% and query execution time by 16-24% for decision support queries.
机译:随着微处理器按照摩尔定律的增加,主存储器和磁盘的访问速度远远落后。结果,磁盘访问和内存访问构成了广泛应用程序的重大性能瓶颈。特别是,数据中心中数据库服务器的性能通常受磁盘I / O或内存访问的限制,具体取决于工作量。本文研究了提高缓冲区高速缓存和处理器高速缓存的效率的技术,以弥合数据中心环境中数据库服务器的这两个速度差距。为解决磁盘I / O瓶颈,本文提出了数据库存储缓冲区的全局管理缓存层次结构,其性能可与该层次结构的总缓存大小相媲美。为了全局管理缓冲区缓存,本文回答了两个具有挑战性的问题:1)在不修改现有I / O接口(即层次结构感知)的情况下,如何协同数据库和存储缓存来实现全局缓存; 2)随着I / O接口的扩展(积极协作)的发展,提高性能的好处是否值得。为了回答第一个问题,本文提出了一种基于逐出信息的层次感知方法。存储缓存使用客户端内容跟踪表来透明地获取收回信息。驱逐数据库服务器后,存储服务器会从磁盘有选择地获取相应的块。仿真结果和实现结果均表明,基于层次的方法可以显着提高存储缓存命中率,在仿真中,缓存命中率提高5倍,而在实际系统结果中数据库事务处理率提高22%。本文采用实证评估的方法,探索了数据库存储协同缓存的大型设计空间。该设计空间具有三个维度:协作方法(层次结构感知和积极协作),替换算法和针对工作负载的优化。通过跟踪驱动仿真和实际系统实现,本文评估了设计空间中的248种组合,包括所有先前提出的解决方案和许多新方法。结果表明,在所有测试案例中,相比于层次结构感知的缓存,积极协作式缓存在模拟中的性能提升仅不到平均水平的2.5%,而在实际系统实验中的性能提升不到1.0%。换句话说,层次结构感知缓存无需修改现有的I / O接口就可以提供与主动协作缓存类似的性能。为了解决数据库服务器的内存访问瓶颈,本文提出了一种解决方案。 Hanuman技术,以提高处理器缓存性能。 Hanuman动态地重新格式化数据库缓冲区高速缓存中的数据,以改善数据的空间局部性。通过使数据布局适应不断变化的数据库查询,Hanuman改善了数据的空间局部性,并因此提高了处理器高速缓存命中率。为了通过多个查询确定正在发生的工作负载的最佳数据布局,Hanuman对候选布局进行启发式成本分析,然后选择最佳布局以优化估计的高速缓存未命中次数。我们的结果表明,对于决策支持查询,Hanuman将处理器高速缓存未命中率降低了63-80%,查询执行时间减少了16-24%。

著录项

  • 作者

    Chen Zhifeng;

  • 作者单位
  • 年度 2005
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号