首页> 外文会议>Asia and South Pacific Design Automation Conference >Throughput optimization for streaming applications on CPU-FPGA heterogeneous systems
【24h】

Throughput optimization for streaming applications on CPU-FPGA heterogeneous systems

机译:CPU-FPGA异构系统上流应用程序的吞吐量优化

获取原文

摘要

Streaming processing is an important technology that finds applications in networking, multimedia, signal processing, etc. However, it is very challenging to design and implement streaming applications as they impose complex constraints. First, the tasks involved in the streaming applications must complete the computation under a latency constraint. Second, streaming systems are built under more and more stringent power budget. Hence, power capping technique is employed to manage the power consumption for streaming systems. To accommodate these needs, heterogeneous systems that consist of CPUs and FPGAs are becoming increasingly popular due to their performance and power benefits. In this paper, we optimize the throughput for streaming applications on CPU-FPGA heterogeneous system under latency and power constraints. We develop two algorithms to map the tasks onto the heterogeneous system and order their execution by exploiting the heterogeneity in architectural capabilities and task characteristics. We also employ pipelining to improve the throughput by overlapping the execution of different frames and use frequency scaling to adjust the execution of tasks for power saving. Experiments using a variety of streaming applications show that our heterogeneous solution can successfully meet the latency and power constraints for the cases where the CPU implementation fails. Furthermore, our technique can improve the throughput by 37.32% on average.
机译:流处理是在网络,多媒体,信号处理等中找到应用程序的重要技术。但是,由于流应用程序施加了复杂的约束,因此设计和实现流应用程序非常具有挑战性。首先,流应用程序中涉及的任务必须在延迟约束下完成计算。其次,流系统是在越来越严格的功率预算下构建的。因此,功率封顶技术被用来管理流系统的功率消耗。为了满足这些需求,由CPU和FPGA组成的异构系统由于其性能和功耗优势而变得越来越受欢迎。在本文中,我们优化了在等待时间和功耗约束下CPU-FPGA异构系统上流应用程序的吞吐量。我们开发了两种算法来将任务映射到异构系统上,并通过利用体系结构功能和任务特征中的异构性来命令它们的执行。我们还采用流水线技术,通过重叠不同帧的执行来提高吞吐量,并使用频率缩放调整任务的执行以节省功耗。使用各种流应用程序进行的实验表明,对于CPU实施失败的情况,我们的异构解决方案可以成功满足延迟和功耗方面的限制。此外,我们的技术平均可以将吞吐量提高37.32%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号