...
首页> 外文期刊>The Journal of Artificial Intelligence Research >Solving Large Problems with Heuristic Search: General-Purpose Parallel External-Memory Search
【24h】

Solving Large Problems with Heuristic Search: General-Purpose Parallel External-Memory Search

机译:使用启发式搜索解决大问题:通用并行外部存储器搜索

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Classic best-first heuristic search algorithms, like A*, record every unique state they encounter in RAM, making them infeasible for solving large problems. In this paper, we demonstrate how best-first search can be scaled to solve much larger problems by exploiting disk storage and parallel processing and, in some cases, slightly relaxing the strict best-first node expansion order. Some previous disk-based search algorithms abandon best-first search order in an attempt to increase efficiency. We present two case studies showing that A*, when augmented with Delayed Duplicate Detection, can actually be more efficient than these non-best-first search orders. First, we present a straightforward external variant of A*, called PEDAL, that slightly relaxes best-first order in order to be I/O efficient in both theory and practice, even on problems featuring real-valued node costs. Because it is easy to parallelize, PEDAL can be faster than in-memory IDA* even on domains with few duplicate states, such as the sliding-tile puzzle. Second, we present a variant of PEDAL, called PE2A*, that uses partial expansion to handle problems that have large branching factors. When tested on the problem of Multiple Sequence Alignment, PE2A* is the first algorithm capable of solving the entire Reference Set 1 of the standard BAliBASE benchmark using a biologically accurate cost function. This work shows that classic best-first algorithms like A* can be applied to large real-world problems. We also provide a detailed implementation guide with source code both for generic parallel disk-based best-first search and for Multiple Sequence Alignment with a biologically accurate cost function. Given its effectiveness as a general-purpose problem-solving method, we hope that this makes parallel and disk-based search accessible to a wider audience.
机译:经典的最佳优先启发式搜索算法(例如A *)会记录它们在RAM中遇到的每个唯一状态,从而使它们无法解决大问题。在本文中,我们演示了如何利用磁盘存储和并行处理以及在某些情况下稍微放松严格的最佳第一节点扩展顺序来扩展最佳优先搜索以解决更大的问题。某些以前的基于磁盘的搜索算法放弃了最佳优先搜索顺序,以提高效率。我们提供了两个案例研究,这些研究表明A *当使用延迟重复检测进行增强时,实际上比这些非最佳优先搜索顺序更有效。首先,我们提出了一个简单的A *外部变量,称为PEDAL,它稍微放宽了最佳优先顺序,以便在理论和实践上都具有I / O效率,即使是在具有实值节点成本的问题上也是如此。因为它很容易并行化,所以即使在具有很少重复状态的域(例如滑动拼贴难题)上,PEDAL也可以比内存中的IDA *更快。其次,我们介绍了一种称为PE2A *的PEDAL变体,该变体使用部分扩展来处理具有较大分支因子的问题。在对多序列比对问题进行测试时,PE2A *是第一种能够使用生物学上精确的成本函数来解决标准BAliBASE基准的整个参考集1的算法。这项工作表明,经典的最佳优先算法(如A *)可以应用于大型现实问题。我们还提供了详细的实施指南,其中包含源代码,用于基于并行磁盘的通用最佳优先搜索以及具有生物学上准确的成本函数的多序列比对。鉴于其作为通用问题解决方法的有效性,我们希望这使得更广泛的受众可以访问基于磁盘的并行搜索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号