首页> 外文会议>Information Technology, Networking, Electronic and Automation Control Conference >A Spark-based Ant Lion Algorithm for Parameters Optimization of Random Forest in Credit Classification
【24h】

A Spark-based Ant Lion Algorithm for Parameters Optimization of Random Forest in Credit Classification

机译:信用分类中基于火花的蚁群算法优化随机森林参数

获取原文

摘要

According to the fact that in large scale credit classification there is no suitable method for determining the parameter combination when random forest algorithm is multi-calss classification to get the optimal classification result. Based on Spark, this paper proposed a parallel Ant Lion Optimizer(ALO) algorithm to seek optimal parameter combination. Compared with the traditional stand-alone ALO algorithm, the Spark-based parallel strategy initially divided the whole ant population into several sub-populations, then assigned one sub-population to one partition in the RDD, and specified independent evolution in one partition. Finally it utilized the migration operator to exchange information between sub-populations. In the experiment, the lending club loan data was used to classify the borrower's loan classification. It concluded that the Spark-based parallel Ant Lion Optimizer (SALO) algorithm, compared with traditional Ant Lion Optimizer algorithm, can efficiently find the better parameter combinations to improve the classification accuracy in the random forest parameter tuning process. In addition, under the big data distributed Spark platform, the SALO whose speed is so fast and acceleration effect is so obvious will be used as the next-generation parameter optimizer of the cloud computing platform.
机译:针对大规模信用分类中,随机森林算法为多等级分类时,没有合适的方法来确定参数组合的问题,从而得到最优的分类结果。基于Spark,本文提出了一种并行的Ant Lion Optimizer(ALO)算法,以寻求最优的参数组合。与传统的独立ALO算法相比,基于Spark的并行策略首先将整个蚂蚁种群划分为几个子种群,然后将一个子种群分配给RDD中的一个分区,并在一个分区中指定独立的演化。最后,它利用迁移运算符在子种群之间交换信息。在实验中,借贷俱乐部贷款数据用于对借款人的贷款分类。结论是,与传统的Ant Lion Optimizer算法相比,基于Spark的并行Ant Lion Optimizer(SALO)算法可以有效地找到更好的参数组合,从而提高随机森林参数调整过程中的分类精度。此外,在大数据分布式Spark平台下,速度快,加速效果显着的SALO将被用作云计算平台的下一代参数优化器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号