首页> 外文期刊>Database Systems Journal >Efficient Partitioning of Large Databases without Query Statistics
【24h】

Efficient Partitioning of Large Databases without Query Statistics

机译:没有查询统计信息的大型数据库的有效分区

获取原文
           

摘要

An efficient way of improving the performance of a database management system is distributed processing. Distribution of data involves fragmentation or partitioning, replication, and allocation process. Previous research works provided partitioning based on empirical data about the type and frequency of the queries. These solutions are not suitable at the initial stage of a distributed database as query statistics are not available then. In this paper, I have presented a fragmentation technique, Matrix based Fragmentation (MMF), which can be applied at the initial stage as well as at later stages of distributed databases. Instead of using empirical data, I have developed a matrix, Modified Create, Read, Update and Delete (MCRUD), to partition a large database properly. Allocation of fragments is done simultaneously in my proposed technique. So using MMF, no additional complexity is added for allocating the fragments to the sites of a distributed database as fragmentation is synchronized with allocation. The performance of a DDBMS can be improved significantly by avoiding frequent remote access and high data transfer among the sites. Results show that proposed technique can solve the initial partitioning problem of large distributed databases.
机译:分布式处理是提高数据库管理系统性能的有效方法。数据的分发涉及碎片或分区,复制和分配过程。先前的研究工作是基于有关查询类型和查询频率的经验数据进行分区的。这些解决方案不适用于分布式数据库的初始阶段,因为那时查询统计信息不可用。在本文中,我介绍了一种片段化技术,即基于矩阵的片段化(MMF),该技术可以在分布式数据库的初始阶段和后期应用。我没有使用经验数据,而是开发了一个矩阵,即修改后的创建,读取,更新和删除(MCRUD),可以对大型数据库进行适当的分区。在我提出的技术中,片段的分配是同时完成的。因此,使用MMF不会增加额外的复杂性,因为分片与分配是同步的,因此无法将分片分配到分布式数据库的站点。通过避免站点之间的频繁远程访问和高数据传输,可以显着提高DDBMS的性能。结果表明,该技术可以解决大型分布式数据库的初始分区问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号