首页> 外文学位 >Tiling exploration and memory access optimization for embedded multimedia systems.
【24h】

Tiling exploration and memory access optimization for embedded multimedia systems.

机译:嵌入式多媒体系统的平铺浏览和内存访问优化。

获取原文
获取原文并翻译 | 示例

摘要

To continue the performance improvement in designing real-time portable embedded systems, three issues must be solved: the gap between the increasing processing power and the available battery-energy; the predictability required by the real-time behavior of applications; and the speed gap between processors and memories. It has been widely observed that 50%-75% of the power consumption in embedded multimedia systems is the consequence of memory accesses. Therefore, the responses to the three issues, the battery gap, the predictability and the memory wall, could actually converge into one answer---the efficient utilization of software-managed memories.; Although there is no "ideal" polynomial-time solution to general memory management problems, most embedded applications are dedicated to voice, image and video processing, exhibiting very regular computations for possible analytical means. In the dissertation, we focus on the memory access optimization for embedded multimedia applications. We notice that those applications are data flow intensive, whereas data streams cannot be abstracted in traditional loop models concerned only about execution and data dependence. To overcome the shortcoming, a novel loop representation model, xMDFG (eXtended Multi-dimensional Data Flow Graph), is proposed where data streams are aligned and connected with the computation domain, instead of being abandoned.; In order to take advantage of data locality without violating execution legality, we propose a stream-conscious tiling exploration technique based on the subscript function. The approach exhibits notable improvements in reducing memory access and minimizing communication cost. Since memory access, especially external memory access, makes a prominent contribution towards system level power consumption, this idea is further extended for the purpose of energy reduction with moderate modifications. Both of the two approaches are verified on TI's low power C55X DSP.; Embedded multimedia systems are often designed as SoCs, where either the custom IP designs (as hardware cores), or the CPU coding (as software core), significantly impact data localities and communication demands. To tackle this issue, we improve traditional loop transformation techniques by rescheduling multiple loop nests oriented on data streams. The case study on JPEG 2000 exhibits remarkable improvements in memory access efficiency, overall throughput, and system scalability. The ideas in this dissertation can be incorporated into a preprocessor or optimizing compiler for embedded multimedia systems.
机译:为了在设计实时便携式嵌入式系统中继续提高性能,必须解决三个问题:处理能力的提高和可用电池能量之间的差距;应用程序实时行为所需的可预测性;以及处理器和内存之间的速度差距。广泛观察到,嵌入式多媒体系统中50%-75%的功耗是存储器访问的结果。因此,对电池电量,可预测性和内存壁这三个问题的回答实际上可以融合为一个答案-软件管理内存的有效利用。尽管没有针对一般内存管理问题的“理想”多项式时间解决方案,但是大多数嵌入式应用程序专用于语音,图像和视频处理,并针对可能的分析手段展示出非常规则的计算。本文主要研究嵌入式多媒体应用的内存访问优化。我们注意到这些应用程序是数据流密集型的,而在仅关注执行和数据依赖性的传统循环模型中不能抽象数据流。为了克服该缺点,提出了一种新颖的循环表示模型xMDFG(扩展的多维数据流图),该模型将数据流对齐并与计算域连接,而不是被丢弃。为了在不违反执行合法性的情况下利用数据局部性,我们提出了一种基于下标函数的流意识分块探索技术。该方法在减少内存访问和最小化通信成本方面显示出显着的改进。由于内存访问,特别是外部内存访问,对系统级功耗做出了突出贡献,因此,此思想被进一步扩展,目的是通过适当的修改来降低能耗。两种方法都在TI的低功耗C55X DSP上进行了验证。嵌入式多媒体系统通常被设计为SoC,其中自定义IP设计(作为硬件核心)或CPU编码(作为软件核心)都会极大地影响数据的位置和通信需求。为了解决这个问题,我们通过重新安排面向数据流的多个循环嵌套来改进传统的循环转换技术。 JPEG 2000的案例研究在内存访问效率,整体吞吐量和系统可伸缩性方面显示出显着的改进。本文的思想可以结合到嵌入式多媒体系统的预处理器或优化编译器中。

著录项

  • 作者

    Zhang, Chunhui.;

  • 作者单位

    University of California, Irvine.;

  • 授予单位 University of California, Irvine.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 131 p.
  • 总页数 131
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号