首页> 外文期刊>Algorithmica >A Faster Algorithm for Cuckoo Insertion and Bipartite Matching in Large Graphs
【24h】

A Faster Algorithm for Cuckoo Insertion and Bipartite Matching in Large Graphs

机译:大图中布谷鸟插入和二部匹配的更快算法

获取原文
获取原文并翻译 | 示例
           

摘要

Hash tables are ubiquitous in computer science for efficient access to large datasets. However, there is always a need for approaches that offer compact memory utilisation without substantial degradation of lookup performance. Cuckoo hashing is an efficient technique of creating hash tables with high space utilisation and offer a guaranteed constant access time. We are given n locations and m items. Each item has to be placed in one of the locations chosen by k random hash functions. By allowing more than one choice for a single item, cuckoo hashing resembles multiple choice allocations schemes. In addition it supports dynamically changing the location of an item among its possible locations. We propose and analyse an insertion algorithm for cuckoo hashing that runs in linear time with high probability and in expectation. Previous work on total allocation time has analysed breadth first search, and it was shown to be linear only in expectation. Our algorithm finds an assignment (with probability 1) whenever it exists. In contrast, the other known insertion method, known as random walk insertion, may run indefinitely even for a solvable instance. We also present experimental results comparing the performance of our algorithm with the random walk method, also for the case when each location can hold more than one item. As a corollary we obtain a linear time algorithm (with high probability and in expectation) for finding perfect matchings in a special class of sparse random bipartite graphs. We support this by performing experiments on a real world large dataset for finding maximum matchings in general large bipartite graphs. We report an order of magnitude improvement in the running time as compared to the Hopkraft-Karp matching algorithm.
机译:哈希表在计算机科学中无处不在,可以有效地访问大型数据集。但是,始终需要在不显着降低查找性能的情况下提供紧凑的存储器利用率的方法。布谷鸟哈希是一种创建具有高空间利用率的哈希表的有效技术,并且可提供恒定的访问时间。我们给了n个地点和m个项目。每个项目都必须放置在由k个随机哈希函数选择的位置之一中。通过允许单个项目有多个选择,布谷鸟哈希类似于多项选择分配方案。另外,它支持在项目可能的位置之间动态更改项目的位置。我们提出并分析了布谷鸟哈希的插入算法,该算法在线性时间中以很高的概率和期望运行。先前有关总分配时间的工作分析了广度优先搜索,并且仅在期望方面显示线性。只要存在,我们的算法就会找到一个分配(概率为1)。相反,另一种已知的插入方法,即随机游走插入,即使对于可解决的实例,也可以无限期地运行。我们还提供了实验结果,将我们的算法与随机游走方法的性能进行了比较,也适用于每个位置可以容纳多个物品的情况。作为推论,我们获得了线性时间算法(具有很高的概率,并且期望很高),用于在特殊类别的稀疏随机二部图中找到完美匹配。我们通过在现实世界的大型数据集上进行实验以在一般的大型二部图中找到最大匹配来支持这一点。与Hopkraft-Karp匹配算法相比,我们报告了运行时间改善了一个数量级。

著录项

  • 来源
    《Algorithmica》 |2019年第9期|3707-3724|共18页
  • 作者

    Khosla Megha; Anand Avishek;

  • 作者单位

    Leibniz Univ Hannover, L3S Res Ctr, Hannover, Germany;

    Leibniz Univ Hannover, L3S Res Ctr, Hannover, Germany;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Cuckoo hashing; Bipartite matching; Load balancing;

    机译:Cuckoo Hashing;二分匹配;负载平衡;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号