首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Mapping Streaming Applications on Commodity Multi-CPU and GPU On-Chip Processors
【24h】

Mapping Streaming Applications on Commodity Multi-CPU and GPU On-Chip Processors

机译:在商品多CPU和GPU片上处理器上映射流应用程序

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In this paper, we consider the problem of efficiently executing streaming applications on commodity processors composed of several cores and an on-chip GPU. Streaming applications, such as those in vision and video analytic, consist of a pipeline of stages and are good candidates to take advantage of this type of platforms. We also consider that characteristics of the input may change while the application is running. Therefore, we propose a framework that adaptively finds the optimal mapping of the pipeline stages. The core of the framework is an analytical model coupled with information collected at runtime used to dynamically map each pipeline stage to the most efficient device, taking into consideration both performance and energy. Our experimental results show that for the evaluated applications running on two different architectures, our model always predicts the best configuration among the evaluated alternatives, and significantly reduces the amount of information that needs to be collected at runtime. This best configuration has, on the average, 20 percent higher throughput than the configuration recommended by a baseline state of the art approach, while the ratio throughput/energy is 43 percent higher. We have measured improvements in throughput and throughput/energy of up-to 81 and 204 percent, respectively, when the model is used to adapt to a video that changes from low to high definition.
机译:在本文中,我们考虑了在由多个内核和一个片上GPU组成的商用处理器上高效执行流应用程序的问题。流应用程序,例如视觉和视频分析中的应用程序,由阶段的流水线组成,是利用此类平台的很好的候选者。我们还认为,在应用程序运行时,输入的特征可能会更改。因此,我们提出了一个框架,可以自适应地找到流水线阶段的最佳映射。该框架的核心是一个分析模型,结合了运行时收集的信息,这些信息用于将每个管道阶段动态映射到最高效的设备,同时考虑到性能和能源。我们的实验结果表明,对于在两种不同体系结构上运行的评估应用程序,我们的模型始终可以预测评估替代方案中的最佳配置,并显着减少了运行时需要收集的信息量。这种最佳配置的吞吐量平均比最先进的基线方法建议的配置高20%,而吞吐量/能量之比则高43%。当模型用于适应从低清晰度到高清晰度的视频时,我们测得的吞吐量和吞吐量/能量的改善分别高达81%和204%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号