首页> 外文期刊>Neural computing & applications >Q-learning-based simulated annealing algorithm for constrained engineering design problems
【24h】

Q-learning-based simulated annealing algorithm for constrained engineering design problems

机译:基于Q基于Q学习的制定工程设计问题的模拟退火算法

获取原文
获取原文并翻译 | 示例
       

摘要

Simulated annealing (SA) was recognized as an effective local search optimizer, and it showed a great success in many real-world optimization problems. However, it has slow convergence rate and its performance is widely affected by the settings of its parameters, namely the annealing factor and the mutation rate. To mitigate these limitations, this study presents an enhanced optimizer that integrates Q-learning algorithm with SA in a single optimization model, named QLSA. In particular, the Q-learning algorithm is embedded into SA to enhance its performances by controlling its parameters adaptively at run time. The main characteristics of Q-learning are that it applies reward/penalty technique to keep track of the best performing values of these parameters, i.e., annealing factor and the mutation rate. To evaluate the effectiveness of the proposed QLSA algorithm, a total of seven constrained engineering design problems were used in this study. The outcomes show that QLSA was able to report a mean fitness value of 1.33 on cantilever beam design, 263.60 on three-bar truss design, 1.72 on welded beam design, 5905.42 on pressure vessel design, 0.0126 on compression coil spring design, 0.25 on multiple disk clutch brake design, and 2994.47 on speed reducer design problem. Further analysis was conducted by comparing QLSA with the state-of-the-art population optimization algorithms including PSO, GWO, CLPSO, harmony, and ABC. The reported results show that QLSA significantly (i.e., 95% confidence level) outperforms other studied algorithms.
机译:模拟退火(SA)被认为是一个有效的本地搜索优化器,它在许多真实的优化问题中表现出巨大的成功。然而,它具有缓慢的收敛速度,其性能受到其参数设置的广泛影响,即退火因子和突变率。为了缓解这些限制,本研究提高了一个增强的优化器,它将Q学习算法与SA集成在一个名为QLSA的单个优化模型中。特别地,Q学习算法嵌入到SA中以通过在运行时自适应地控制其参数来增强其性能。 Q学习的主要特征是它应用奖励/惩罚技术,以跟踪这些参数的最佳性能,即退火因子和突变率。为了评估所提出的QLSA算法的有效性,本研究共使用了七种约束的工程设计问题。结果表明,QLSA能够在悬臂梁设计上报告1.33的平均适应值,263.60在三条桁架设计上,1.72焊接梁设计,5905.42对压力容器设计,0.0126对压缩螺旋弹簧设计,0.25磁盘离合器制动器设计和2994.47减速器设计问题。通过将QLSA与最先进的人口优化算法进行比较,进行进一步分析,包括PSO,GWO,CLPSO,和谐和ABC。据报道的结果表明,QLSA显着(即95%的置信水平)优于其他研究的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号