首页> 外文学位 >Running Real-time Tasks on Embedded Systems Using GPU Computing.
【24h】

Running Real-time Tasks on Embedded Systems Using GPU Computing.

机译:使用GPU计算在嵌入式系统上运行实时任务。

获取原文
获取原文并翻译 | 示例

摘要

The advent of GPU computing allows us to leverage the computation power of GPUs in different domains beyond graphics. Since GPUs provide significant speedups in runtime and superior cost/power-per-performance, they are a good fit for embedded domains running real-time data-parallel tasks. However, meeting real-time constraints with the limited embedded resources on the parallel architecture of GPUs is a nontrivial research challenge. An even more challenging research question is how to develop approaches that allow GPUs to concurrently run multiple real-time embedded tasks. In this dissertation, I aim to address these challenges by first presenting my study which exemplifies the suitability of GPUs for performing embedded real-time tasks. Then, I go one step further and present my research demonstrating that GPUs can be used for running multiple disparate real-time embedded tasks concurrently.;In my first study, I perform template-based speed-limit-sign recognition task in real-time using a low-end GPU with limited resources. To meet real-time performance requirements using constrained resources, I use computer vision techniques well-suited for the parallel GPU architecture, allowing me to achieve a very fast runtime, and I build my pipeline from parametrized modules, allowing the best use of the limited resources by fine-tuning the parameters based on a trade-off between runtime and success rate. This recognition system I developed achieves 90% accuracy over 120 EU speed-limit signs on 45 minutes of video footage, and is superior to the 75% accuracy of a non-real-time SIFT-based recognition system implemented on the same GPU.;In my second study, I take into consideration the specific characteristics of GPU workloads, and I survey a wide spectrum of scheduling strategies for multitasking among real-time embedded tasks. Based on this investigation, I design several schedulers each using a combination of alternative approaches. Then, I compare the performance of schedulers on several workloads with different properties. Based on these comparisons, I determine which scheduling approach is more effective for a given workload and why. Some of the important conclusions of this study are as follows: (a) We should use the approach that runs kernels concurrently if we have small kernels. (b) If we have small kernels and kernel runtimes of higher-priority tasks are generally longer than those of lower-priority tasks, we should use the approach that changes the issue order dynamically to improve results of CPU schedulers running on the Fermi architecture. (c) Due to the limitations of the existing GPU architectures, we should at this time use the approach that performs CPU scheduling instead of the one that performs GPU scheduling. In this second study, I also highlight the shortcomings of current GPU architectures with regard to running multiple real-time tasks, and I recommend new features that, when added to the upcoming architectures, would allow better schedulers to be designed.
机译:GPU计算的出现使我们能够利用图形以外的其他领域的GPU的计算能力。由于GPU可以显着提高运行时的速度,并具有卓越的性能成本/性能,因此它们非常适合运行实时数据并行任务的嵌入式域。但是,在GPU的并行体系结构上使用有限的嵌入式资源来满足实时约束是一项艰巨的研究挑战。更具挑战性的研究问题是如何开发允许GPU同时运行多个实时嵌入式任务的方法。在本文中,我旨在通过首先介绍我的研究来应对这些挑战,该研究例证了GPU在执行嵌入式实时任务方面的适用性。然后,我进一步走了一步,介绍了我的研究,表明GPU可用于同时运行多个不同的实时嵌入式任务。;在我的第一个研究中,我实时执行了基于模板的限速标志识别任务。使用资源有限的低端GPU。为了使用有限的资源满足实时性能要求,我使用了非常适合并行GPU架构的计算机视觉技术,从而使我可以实现非常快的运行时间,并且我从参数化模块中构建了管道,从而可以充分利用有限的通过基于运行时间和成功率之间的折衷来微调参数,从而节省资源。我开发的这种识别系统在45分钟的视频片段上通过120个EU限速标志达到了90%的精度,并且优于在同一GPU上实现的基于SIFT的非实时识别系统的75%的精度。在我的第二项研究中,我考虑了GPU工作负载的特定特征,并且调查了用于实时嵌入式任务中多任务的多种调度策略。基于此调查,我设计了几种调度程序,每种调度程序都使用了多种替代方法。然后,我比较了具有不同属性的多个工作负载上调度程序的性能。根据这些比较,我确定对于给定的工作负载哪种调度方法更有效,以及为什么。这项研究的一些重要结论如下:(a)如果我们有小的内核,则应该使用同时运行内核的方法。 (b)如果我们的内核较小,并且优先级较高的任务的内核运行时间通常比优先级较低的任务的内核运行时间更长,则应使用动态更改发行顺序的方法来改善在Fermi架构上运行的CPU调度程序的结果。 (c)由于现有GPU架构的限制,我们此时应使用执行CPU调度的方法,而不是执行GPU调度的方法。在第二项研究中,我还将重点介绍当前GPU架构在运行多个实时任务方面的不足,并建议将新功能添加到即将推出的架构中,以设计出更好的调度程序。

著录项

  • 作者

    Muyan-Ozcelik, Pinar.;

  • 作者单位

    University of California, Davis.;

  • 授予单位 University of California, Davis.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2014
  • 页码 196 p.
  • 总页数 196
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号