【24h】

Local Search in Histogram Construction

机译:直方图构造中的本地搜索

获取原文

摘要

The problem of dividing a sequence of values into segments occurs in database systems, information retrieval, and knowledge management. The challenge is to select a finite number of boundaries for the segments so as to optimize an objective error function defined over those segments. Although this optimization problem can be solved in polynomial time, the algorithm which achieves the minimum error does not scale well, hence it is not practical for applications with massive data sets. There is considerable research with numerous approximation and heuristic algorithms. Still, none of those approaches has resolved the quality-efficiency tradeoff in a satisfactory manner. In (Halim, Karras, and Yap 2009), we obtain near linear time algorithms which achieve both the desired scalability and near-optimal quality, thus dominating earlier approaches. In this paper, we show how two ideas from artificial intelligence, an efficient local search and recombination of multiple solutions reminiscent of genetic algorithms, are combined in a novel way to obtain state of the art histogram construction algorithms.
机译:将值序列划分为段的问题出现在数据库系统,信息检索和知识管理中。挑战在于为分段选择有限数量的边界,以优化在这些分段上定义的客观误差函数。尽管可以在多项式时间内解决此优化问题,但是实现最小误差的算法无法很好地扩展,因此对于具有大量数据集的应用程序不切实际。有大量的近似和启发式算法研究。但是,这些方法都没有以令人满意的方式解决质量效率折衷的问题。在(Halim,Karras,and Yap 2009)中,我们获得了接近线性的时间算法,该算法同时实现了所需的可伸缩性和接近最佳的质量,因此在早期方法中占主导地位。在本文中,我们展示了如何以一种新颖的方式将人工智能中的两个想法,即有效的局部搜索和多种解决方案的重组(让人联想到遗传算法)组合在一起,以获取最先进的直方图构造算法。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号