A combined DMA and application-specific prefetching approach for tackling the memory latency bottleneck

Dasygenis M.; Brockmeyer E.; Durinck B.; Catthoor F.; Soudris D.; Thanailakis A.

首页> 外文期刊>IEEE transactions on very large scale integration (VLSI) systems >A combined DMA and application-specific prefetching approach for tackling the memory latency bottleneck

【24h】

A combined DMA and application-specific prefetching approach for tackling the memory latency bottleneck

机译：结合DMA和特定于应用程序的预取方法来解决内存延迟瓶颈

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Memory latency has always been a major issue in embedded systems that execute memory-intensive applications. This is even more true as the gap between processor and memory speed continues to grow. Hardware and software prefetching have been shown to be effective in tolerating the large memory latencies inherit in large off-chip memories; however, both types of prefetching have their shortcomings. Hardware schemes are more complex and require extra circuitry to compute data access strides, while software schemes generate prefetch instructions, which if not computed carefully may hamper performance. On the other hand, some applications domains (such as multimedia) have a uniform and known a priori memory access pattern, that if exploited, could yield significant application performance improvement. With this characteristic in mind, we present our findings on hiding memory latency using the direct memory access (DMA) mode, which is present in all modern systems, combined with a software prefetch mechanism, and a customized on-chip memory hierarchy mapping. Compared to previous approaches, we are able to estimate the performance and power metrics, without actually implementing the embedded system. Experimental results on nine well known multimedia and imaging applications prove the efficiency of our technique. Finally, we verify the performance estimations by implementing and simulating the algorithms on the TI C6201 processor.

机译：内存延迟一直是执行内存密集型应用程序的嵌入式系统中的主要问题。随着处理器和内存速度之间的差距不断扩大，这一点更加真实。硬件和软件预取已被证明可以有效地容忍大片外存储器中继承的大存储延迟。但是，两种类型的预取都有其缺点。硬件方案更加复杂，需要额外的电路来计算数据访问步幅，而软件方案会生成预取指令，如果不仔细计算可能会影响性能。另一方面，某些应用程序域（例如多媒体）具有统一的已知先验内存访问模式，如果被利用，则可以显着提高应用程序性能。考虑到这一特征，我们介绍了使用直接内存访问（DMA）模式隐藏内存延迟的发现，该模式在所有现代系统中都存在，并结合了软件预取机制和定制的片上内存层次结构映射。与以前的方法相比，我们能够估计性能和功耗指标，而无需实际实现嵌入式系统。在九种众所周知的多媒体和影像应用程序上的实验结果证明了我们技术的效率。最后，我们通过在TI C6201处理器上实施和仿真算法来验证性能估计。

著录项

来源
《IEEE transactions on very large scale integration (VLSI) systems》 |2006年第3期|p.279-291|共13页
作者
Dasygenis M.; Brockmeyer E.; Durinck B.; Catthoor F.; Soudris D.; Thanailakis A.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类微电子学、集成电路（IC）;
关键词
embedded systems; file organisation; integrated memory circuits; memory architecture; storage management; TI C6201 processor; application-specific prefetching; computer-aided analysis; data access; direct memory access; embedded systems; memory access pattern; memor;

机译：嵌入式系统;文件组织;集成内存电路;内存体系结构;存储管理;TI C6201处理器;专用预取;计算机辅助分析;数据访问;直接内存访问;嵌入式系统;内存访问模式;内存;

相似文献

外文文献
中文文献
专利

1. A Novel Prefetching Approach for Effective Web Latency Reduction [J] . Xiangrui Yang, Zhigang Sun International journal of computational intelligence research . 2018,第3期

机译：一种有效减少Web延迟的新颖预取方法
2. An Efficient Approach For Optimal Prefetching To Reduce Web Access Latency. [J] . Dinesh Kumar, Reena Patel International Journal of Scientific & Technology Research . 2014,第7期

机译：最佳预取以减少Web访问延迟的有效方法。
3. Prefetching the means for document transfer: a new approach for reducing Web latency [J] . Edith Cohen, Haim Kaplan Computer networks . 2002,第4期

机译：预取文档传输方式：减少Web延迟的新方法
4. A Memory Hierarchical Layer Assigning and Prefetching Technique to Overcome the Memory Performance/Energy Bottleneck [C] . Minas Dasygenis, Erik Brockmeyer, Bart Durinck, International Conference on Advances in Computer Enterntainment Technology . 2009

机译：内存分层层分配和预取技术可克服内存性能/能耗瓶颈
5. A Power Conservation Methodology for Hard Drives by Combining Prefetching Algorithms and Flash Memory [D] . Halper, Raymond J. 2013

机译：结合预取算法和闪存的硬盘驱动器节能方法
6. Research comment: The case for adopting a combined comparative medicine and One Health approach to tackle emerging diseases [O] . Margaret J Hosie, Seema Jasim -1

机译：研究评论：采用合并比较医学和一种健康方法来解决新兴疾病的情况
7. Scalable and Efficient Virtual Memory Sharing in Heterogeneous SoCs with TLB Prefetching and MMU-Aware DMA Engine [O] . Andreas Kurth, Pirmin Vogel, Andrea Marongiu, 2018

机译：具有TLB预取和MMU感知DMA引擎的异构SoC中的可扩展和高效的虚拟内存共享

A combined DMA and application-specific prefetching approach for tackling the memory latency bottleneck

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅