首页> 外文会议>Parallel and distributed computing and networks >AN ENERGY EFFICIENT SMT PROCESSOR WITH HETEROGENEOUS INSTRUCTION SET ARCHITECTURES
【24h】

AN ENERGY EFFICIENT SMT PROCESSOR WITH HETEROGENEOUS INSTRUCTION SET ARCHITECTURES

机译:具有异构指令集架构的高效节能SMT处理器

获取原文
获取原文并翻译 | 示例

摘要

Recently, it has become popular to employ a multi-core processor that heterogeneously includes a conventional core and several VLIW/DSP cores to achieve high performance. This approach achieves not only quick integration of embedded OS and multimedia programs but also maintaining Quality of Service (QoS) on multimedia applications such as stereo matching. However, from the viewpoint of cost, multi-core processors that increase the chip area by incorporating discrete cores straightforwardly are not the best solution.For an energy efficient processor in embedded systems, we propose an SMT processor named OROCHI, which contains two heterogeneous front-end pipelines, which correspond to ARM ISA for conventional OS programs and FR-V ISA for VLIW multimedia applications, and a common back-end pipeline based on a VLIW processor. In this paper, we propose an instruction scheduling and issue mechanism for SMT execution of ARM and FRV instructions with a VLIW instruction queue. We also propose an asymmetric QoS mechanism to improve performance drop by cache miss stalls.Based on an ASIC implementation with a 0.25/um cell library, we compared our design with a traditional multi-core processor heterogeneously containing ARM and FRV cores. Regarding QoS on the VLIW side, the evaluation results show that the energy delay product is 6.0% better than that without the mechanism on SMT execution with high frequency of cache misses.
机译:近来,采用多核处理器变得越来越流行,该处理器异构地包括常规核和几个VLIW / DSP核以实现高性能。这种方法不仅可以实现嵌入式OS和多媒体程序的快速集成,而且可以在多媒体应用程序(如立体声匹配)上维持服务质量(QoS)。然而,从成本的角度来看,通过直接整合分立内核来增加芯片面积的多核处理器并不是最佳解决方案。对于嵌入式系统中的节能处理器,我们提出了一种名为OROCHI的SMT处理器,该处理器包含两个异构前端端管线,对应于常规OS程序的ARM ISA和VLIW多媒体应用程序的FR-V ISA,以及基于VLIW处理器的通用后端管线。在本文中,我们提出了使用VLIW指令队列对ARM和FRV指令进行SMT执行的指令调度和发出机制。我们还提出了一种非对称QoS机制,以改善由于缓存未命中而导致的性能下降。基于具有0.25 / um单元库的ASIC实现,我们将我们的设计与异构包含ARM和FRV内核的传统多核处理器进行了比较。关于VLIW端的QoS,评估结果表明,与没有SMT执行机制且高速缓存未命中率较高的机制相比,能量延迟乘积要好6.0%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号