首页> 外文会议>World Congress on Nature and Biologically Inspired Computing >Scaling Genetic Programming for data classification using MapReduce methodology
【24h】

Scaling Genetic Programming for data classification using MapReduce methodology

机译:使用MapReduce方法进行数据分类的扩展遗传编程

获取原文

摘要

Genetic Programming (GP) is an optimization method that has proved to achieve good results. It solves problems by generating programs and applying natural operations on these programs until a good solution is found. GP has been used to solve many classifications problems, however, its drawback is the long execution time. When GP is applied on the classification task, the execution time proportionally increases with the dataset size. Therefore, to manage the long execution time, the GP algorithm is parallelized in order to speed up the classification process. Our GP is implemented based on the MapReduce methodology (abbreviated as MRGP), in order to benefit from the MapReduce concept in terms of fault tolerance, load balancing, and data locality. MRGP does not only accelerate the execution time of GP for large datasets, it also provides the ability to use large population sizes, thus finding the best result in fewer numbers of generations. MRGP is evaluated using different population sizes ranging from 1,000 to 100,000 measuring the accuracy, scalability, and speedup.
机译:遗传编程(GP)是一种优化方法,已被证明可以实现良好的效果。它通过生成程序并在这些程序上应用自然操作来解决问题,直到找到良好的解决方案。 GP已被用来解决许多分类问题,但是,它的缺点是长时间的执行时间。当在分类任务上应用GP时,执行时间与数据集大小成比例地增加。因此,为了管理长执行时间,GP算法并行化以加速分类过程。我们的GP是基于MapReduce方法(缩写为MRGP)来实现的,以便在容错,负载平衡和数据位置中受益MapReduce概念。 MRGP不仅加快GP为大型数据集的执行时间,它还提供了使用大群尺寸的能力,从而找到最佳导致较少的几代人。 MRGP使用不同的群体尺寸评估,范围从1,000到100,000测量准确性,可扩展性和加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号