首页> 外文期刊>Distributed and Parallel Databases >Efficiently repairing and measuring replica consistency in distributed databases
【24h】

Efficiently repairing and measuring replica consistency in distributed databases

机译:在分布式数据库中有效修复和衡量副本一致性

获取原文
获取原文并翻译 | 示例

摘要

In a distributed database, maintaining large table replicas with frequent asynchronous insertions is a challenging problem that requires carefully managing a tradeoff between consistency and availability. With that motivation in mind, we propose efficient algorithms to repair and measure replica consistency. Specifically, we adapt, extend and optimize distributed set reconciliation algorithms to efficiently compute the symmetric difference between replicated tables in a distributed relational database. Our novel algorithms enable fast synchronization of replicas being updated with small sets of new records, measuring obsolence of replicas having many insertions and deciding when to update a replica, as each table replica is being continuously updated in an asynchronous manner. We first present an algorithm to repair and measure distributed consistency on a large table continuously updated with new records at several sites when the number of insertions is small. We then present a complementary algorithm that enables fast synchronization of a summarization table based on foreign keys when the number of insertions is large, but happening on a few foreign key values. From a distributed systems perspective, in the first algorithm the large table with data is reconciled, whereas in the second case, its summarization table is reconciled. Both distributed database algorithms have linear communication complexity and cubic time complexity in the size of the symmetric difference between the respective table replicas they work on. That is, they are effective when the network speed is smaller than CPU speed at each site. A performance experimental evaluation with synthetic and real databases shows our algorithms are faster than a previous state-of-the art algorithm as well as more efficient than transferring complete tables, assuming large replicated tables and sporadic asynchronous insertions.
机译:在分布式数据库中,维护具有频繁异步插入的大表副本是一个具有挑战性的问题,需要仔细管理一致性和可用性之间的折衷。考虑到这种动机,我们提出了有效的算法来修复和衡量副本的一致性。具体来说,我们调整,扩展和优化分布式集合对帐算法,以有效地计算分布式关系数据库中复制表之间的对称差异。我们的新颖算法能够使复制品的快速同步被一小组新记录更新,测量具有多个插入的复制品的过时性,并决定何时更新复制品,因为每个表复制品都以异步方式不断更新。我们首先提出一种算法,用于在插入数量较小的情况下修复和测量大表上的分布式一致性,该表不断更新,并在多个站点上使用新记录进行更新。然后,我们提出一种补充算法,当插入次数很大但发生在少数几个外键值上时,该算法可以使基于外键的汇总表快速同步。从分布式系统的角度看,在第一种算法中,对具有数据的大表进行协调,而在第二种情况下,对其汇总表进行协调。两种分布式数据库算法在其工作的各个表副本之间的对称差异的大小上均具有线性通信复杂度和立方时间复杂度。也就是说,当网络速度小于每个站点的CPU速度时,它们有效。使用合成数据库和真实数据库进行的性能实验评估表明,假设有大型复制表和零星的异步插入,我们的算法比以前的最新算法更快,并且比传输完整表更高效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号