首页> 外文会议>2011 17th IEEE International Conference on Parallel and Distributed Systems >Automatic Extraction of Pipeline Parallelism for Embedded Software Using Linear Programming
【24h】

Automatic Extraction of Pipeline Parallelism for Embedded Software Using Linear Programming

机译:使用线性编程自动提取嵌入式软件的管道并行性

获取原文

摘要

The complexity and performance requirements of embedded software are continuously increasing, making Multiprocessor System-on-Chip (MPSoC) architectures more and more important in the domain of embedded and cyber-physical systems. Using multiple cores in a single system reduces problems concerning energy consumption, heat dissipation, and increases performance. Nevertheless, these benefits do not come for free. Porting existing, mostly sequential, applications to MPSoCs requires extracting efficient parallelism to utilize all available cores. Many embedded applications, like network services and multimedia tasks for voice-, image- and video processing, are operating on data streams and thus have a streaming-based structure. Despite the abundance of parallelism in streaming applications, it is a non-trivial task to split and efficiently map sequential applications to MPSoCs. Therefore, we present an algorithm which automatically extracts pipeline parallelism from sequential ANSI-C applications. The presented tool employs an integer linear programming (ILP) based approach enriched with an adequate cost model to automatically control the granularity of the parallelization. By applying our tool to real-life applications, it can be shown that our approach is able to speed up applications by a factor of up to 3.9x on a four-core MPSoC architecture, compared to a sequential execution.
机译:嵌入式软件的复杂性和性能要求不断提高,使得多处理器片上系统(MPSoC)架构在嵌入式和网络物理系统领域中越来越重要。在单个系统中使用多个内核可减少与能耗,散热有关的问题,并提高性能。但是,这些好处并不是免费提供的。将现有的(通常是顺序的)应用程序移植到MPSoC时,需要提取有效的并行性以利用所有可用的内核。许多嵌入式应用程序,例如网络服务和用于语音,图像和视频处理的多媒体任务,都在数据流上运行,因此具有基于流的结构。尽管流应用程序中存在大量并行性,但拆分并有效地将顺序应用程序映射到MPSoC并不是一项艰巨的任务。因此,我们提出了一种从顺序ANSI-C应用程序中自动提取管道并行性的算法。提出的工具采用了基于整数线性规划(ILP)的方法,该方法丰富了适当的成本模型,可以自动控制并行化的粒度。通过将我们的工具应用于现实生活中的应用,可以证明,与顺序执行相比,我们的方法能够在四核MPSoC架构上将应用程序加速高达3.9倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号