【24h】

GPGPU-Based Parallel Algorithms for Scheduling Against Due Date

机译:基于GPGPU的到期日调度并行算法

获取原文

摘要

We present an in-depth analysis and implementation of parallel programming on two NP-hard combinatorial optimization problems, namely, the Common Due-Date (CDD) problem and the Unrestricted CDD with Controllable Processing Times (UCDDCP). The CDD and UCDDCP require scheduling and sequencing a certain number of jobs with different processing times on a single machine against a common due-date. The goal is to minimize the total weighted penalty incurred due to earliness or tardiness of the jobs and the penalty due to the compression of the processing times of the jobs. In the UCDDCP, the processing time of a job can be reduced by letting the machine work at a faster pace, which, however, comes at a (compression penalty) cost per time unit. Optimization for both is carried out by hybrid algorithms, composed of a metaheuristic that creates good job sequences and an (n) algorithm which finds the optimal completion times for the all the jobs in such sequences created by the metaheuristic algorithms. We investigate both Simulated Annealing (SA) and a Discrete Particle Swarm Algorithm (DPSO) for this purpose. Parallel versions of both algorithms are implemented based on CUDA™. Experiments are carried out on the benchmark instances provided in the OR-library and executed on a Nvidia® graphics processing unit. We find that the parallel SA algorithm performs very well while obtaining speedups of 100× within a deviation of two percent compared to the best known solutions. Furthermore, our parallel algorithms also improve the best known solution values for several benchmark instances.
机译:我们对两个NP困难的组合优化问题(即公共到期日(CDD)问题和具有可控制处理时间的无限制CDD)进行了深入的分析和并行编程的实现。 CDD和UCDDCP要求在同一台计算机上根据共同的截止日期对具有一定处理时间的一定数量的作业进行调度和排序。目的是最小化由于作业的过早或拖延而产生的总加权惩罚以及由于压缩作业的处理时间而导致的惩罚。在UCDDCP中,可以通过使机器以更快的速度工作来减少作业的处理时间,但是,这会增加(压缩损失)每时间单位的成本。两者的优化是由混合算法执行的,混合算法由创建良好作业序列的元启发式算法和(n)算法组成,该算法为由元启发式算法创建的此类序列中的所有作业找到最佳完成时间。为此,我们研究了模拟退火(SA)和离散粒子群算法(DPSO)。两种算法的并行版本均基于CUDA™实现。实验是在OR库中提供的基准实例上进行的,并在Nvidia®图形处理单元上执行。我们发现,与最著名的解决方案相比,并行SA算法在获得100倍加速(误差不超过2%)的同时性能非常好。此外,我们的并行算法还为多个基准实例提高了最著名的解决方案值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号