Using Managed Runtime Systems to Tolerate Holes in Wearable Memories

机译：使用托管运行时系统将孔洞放在可穿戴存储器中的孔

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

New memory technologies, such as phase-change memory (PCM), promise denser and cheaper main memory, and are expected to displace DRAM. However, many of them experience permanent failures far more quickly than DRAM. DRAM mechanisms that handle permanent failures rely on very low failure rates and, if directly applied to PCM, are extremely inefficient: Discarding a page when the first line fails wastes 98% of the memory. This paper proposes low complexity cooperative software and hardware that handle failure rates as high as 50%. Our approach makes error handling transparent to the application by using the memory abstraction offered by managed languages. Once hardware error correction for a memory line is exhausted, rather than discarding the entire page, the hardware communicates the failed line to a failure-aware OS and runtime. The runtime ensures memory allocations never use failed lines and moves data when lines fail during program execution. This paper describes minimal extensions to an Immix mark-region garbage collector, which correctly utilizes pages with failed physical lines by skipping over failures. This paper also proposes hardware support that clusters failed lines at one end of a memory region to reduce fragmentation and improve performance under failures. Contrary to accepted hardware wisdom that advocates for wear-leveling, we show that with software support non-uniform failures delay the impact of memory failure. Together, these mechanisms incur no performance overhead when there are no failures and at failure levels of 10% to 50% suffer only an average overhead of 4% and 12%, respectively. These results indicate that hardware and software cooperation can greatly extend the life of wearable memories.

机译：新的内存技术，如相变内存（PCM），承诺密集和更便宜的主内存，并预计将取代DRAM。然而，其中许多人经历了比DRAM更快的永久性失败。处理永久故障的DRAM机制依赖于非常低的故障率，并且如果直接应用于PCM，则非常低效：当第一行失败时丢弃页面浪费98％的内存。本文提出了低复杂性协作软件和硬件，该硬件处理高达50％的故障率。我们的方法通过使用托管语言提供的内存抽象来处理对应用程序透明的错误。一旦内存行的硬件纠错耗尽，而不是丢弃整个页面，硬件将失败的行传送到故障感知的操作系统和运行时。运行时确保内存分配永远不会使用失败的线路，并在程序执行期间执行行失败时移动数据。本文介绍了对Immix Mark-Region垃圾收集器的最小扩展，通过跳过故障，正确利用具有失败物理线路的页面。本文还提出了硬件支持，该硬件支持将在内存区域的一端群集的群集失败，以减少碎片化并在故障下提高性能。与接受的硬件智慧相反，倡导佩戴佩戴的硬件智慧，我们展示了软件支持不均匀的故障延迟内存故障的影响。在一起，这些机制在没有失败的情况下，在10％至50％的失败水平下，这些机制均无效率开销，分别仅遭受4％和12％的平均开销。这些结果表明，硬件和软件合作可以大大延长可穿戴记忆的寿命。

著录项

来源
《ACM SIGPLAN Conference on Programming Language Design and Implementation》|2013年||共12页
会议地点
作者
Tiejun Gao; Karin Strauss; Stephen M. Blackburn; Kathryn S. McKinley; Doug Burger; James Larus;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP312-53;
关键词
Failure tolerance; Memory management; Phase-change memory; Reliability;

机译：失败耐受;内存管理;相变内存;可靠性;

相似文献

外文文献
中文文献
专利

1. Using Managed Runtime Systems to Tolerate Holes in Wearable Memories [J] . Tiejun Gao, Karin Strauss, Stephen M. Blackburn, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2013,第6期

机译：使用托管运行时系统容忍可穿戴内存中的漏洞
2. Leveraging Managed Runtime Systems to Build, Analyze, and Optimize Memory Graphs [J] . Rebecca Smith, Scott Rixner ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2016,第7期

机译：利用托管运行时系统构建，分析和优化内存图
3. Static Allocation of Basic Blocks Based on Runtime and Memory Requirements in Embedded Real-Time Systems with Hierarchical Memory Layout [J] . Philipp Jungklass, Mladen Berekovic OASIcs : OpenAccess Series in Informatics . 2021,第a期

机译：基于运行时和内存要求的基本块的静态分配，具有分层内存布局的嵌入式实时系统
4. Using Managed Runtime Systems to Tolerate Holes in Wearable Memories [C] . Tiejun Gao, Karin Strauss, Stephen M. Blackburn, Proceedings of the 2013 ACM SIGPLAN conference on programming language design and implementation . 2013

机译：使用托管运行时系统容忍可穿戴内存中的漏洞
5. Compiler and Runtime for Memory Management on Software Managed Manycore Processors. [D] . Bai, Ke. 2014

机译：在软件管理的Manycore处理器上进行内存管理的编译器和运行时。
6. Theory of minds: managing mental state inferences in working memory is associated with the dorsomedial subsystem of the default network and social integration [O] . Meghan L Meyer, Eleanor Collier 2020

机译：心智理论：管理工作记忆中的心理状态推论与默认网络和社会整合的背子系统相关
7. Using Managed Runtime Systems to Tolerate Holes in Wearable Memories [O] . Tiejun Gao, Karin Strauss, Stephen M. Blackburn, 2013

机译：使用托管运行时系统容忍可穿戴内存中的漏洞

Using Managed Runtime Systems to Tolerate Holes in Wearable Memories

摘要

著录项

相似文献

相关主题

期刊订阅