首页> 外文期刊>IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems >Fast and Correct Load-Link/Store-Conditional Instruction Handling in DBT Systems
【24h】

Fast and Correct Load-Link/Store-Conditional Instruction Handling in DBT Systems

机译:DBT系统中快速和正确的负载链路/商店条件指令处理

获取原文
获取原文并翻译 | 示例
           

摘要

Dynamic binary translation (DBT) requires the implementation of load-link/store-conditional (LL/SC) primitives for guest systems that rely on this form of synchronization. When targeting, e.g., x86 host systems, LL/SC guest instructions are typically emulated using atomic compare-and-swap (CAS) instructions on the host. Whilst this direct mapping is efficient, this approach is problematic due to subtle differences between LL/SC and CAS semantics. In this article, we demonstrate that this is a real problem, and we provide code examples that fail to execute correctly on QEMU and a commercial DBT system, which both use the CAS approach to LL/SC emulation. We then develop two novel and provably correct LL/SC emulation schemes: 1) a purely software-based scheme, which uses the DBT system's page translation cache for correctly selecting between fast, but unsynchronized, and slow, but fully synchronized memory accesses and 2) a hardware-accelerated scheme that leverages hardware transactional memory (HTM) provided by the host. We have implemented these two schemes in the Synopsys DesignWare ARC nSIM DBT system, and we evaluate our implementations against full applications, and targeted microbenchmarks. We demonstrate that our novel schemes are not only correct but also deliver competitive performance on-par or better than the widely used, but broken CAS scheme.
机译:动态二进制转换(DBT)需要为依赖这种同步形式的客户系统执行Load-Link / Store-Confenceal(LL / SC)原语。瞄准时,例如,X86主机系统,LL / SC客户的指示通常使用主机上的原子比较和交换(CAS)指令模拟。虽然这种直接映射是有效的,但由于LL / SC和CAS语义之间的微妙差异,这种方法存在问题。在本文中,我们证明这是一个真正的问题,我们提供了无法在QEMU和商业DBT系统上正确执行的代码示例,这两者都使用CAS方法LL / SC仿真。然后,我们开发两种新颖且可提供正确的LL / SC仿真计划:1)一种基于软件的方案,它使用DBT系统的页面转换缓存进行正确选择快速但不同步,慢速,但完全同步的内存访问和2 )一种硬件加速方案,它利用主机提供的硬件事务存储器(HTM)。我们在Synopsys Designware Arc NSIM DBT系统中实施了这两种方案,我们评估了我们针对完整应用程序和有针对性的Microbenchmark的实现。我们证明我们的小说计划不仅是正确的,而且还可以在普遍使用的情况下提供竞争性能,而不是被广泛使用的CAS方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号