Efficient Job Offloading in Heterogeneous Systems Through Hardware-Assisted Packet-Based Dispatching and User-Level Runtime Infrastructure

Tomoutzoglou Othon; Mbakoyiannis Dimitris; Kornaros George; Coppola Marcello

首页> 外文期刊>IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems >Efficient Job Offloading in Heterogeneous Systems Through Hardware-Assisted Packet-Based Dispatching and User-Level Runtime Infrastructure

【24h】

Efficient Job Offloading in Heterogeneous Systems Through Hardware-Assisted Packet-Based Dispatching and User-Level Runtime Infrastructure

机译：通过基于硬件辅助数据包的调度和用户级运行时基础架构在异构系统中卸载异构系统的高效工作

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Emerging heterogeneous systems architectures increasingly integrate general-purpose processors, GPUs, and other specialized computational units to provide both power and performance benefits. While the motivations for developing systems with accelerators are clear, it is important to design efficient dispatching mechanisms in terms of performance and energy while leveraging programmability and orchestration of the diverse computational components. In this paper, we present an infrastructure composed of a hardware, general, packet-based processing-dispatching unit, named generic packet processing unit (GPPU), and of an associated runtime that facilitates user-level access to GPPU objects, such as packets, queues, and contexts. Hence, we remove drawbacks of traditional costly user-to-kernel-level operations, low-level accelerator subtleties that hinder programming productivity, along with architectural obstacles such as handling accelerators' unified virtual address space. We present the design and evaluation of our framework by integrating the GPPU infrastructure with data streaming type accelerators, image filtering, and matrix multiplication, tightly coupled to ARMv8 architecture via unified virtual memory. Under scaling workload our proposed dispatching methods can deliver $3.7{imes }$ performance improvement over baseline offloading, and up to $4.7{imes }$ better energy efficiency.

机译：新兴异构系统架构越来越集成了通用处理器，GPU和其他专用计算单元，以提供功率和性能效益。虽然具有加速器的开发系统的动机很清楚，但对于在性能和能量方面设计有效的调度机制非常重要，同时利用各种计算组件的可编程性和编排。在本文中，我们介绍了一个由硬件，一般数据包的处理调度单元组成的基础架构，名为通用分组处理单元（GPPU），以及促进对GPPU对象（例如数据包）的用户级访问的关联运行时，队列和上下文。因此，我们删除了传统的昂贵的用户到内核级操作，妨碍编程生产力的低级加速器微妙之处，以及诸如处理加速器统一虚拟地址空间的架构障碍。我们通过将GPPU基础架构与数据流型加速器，图像过滤和矩阵乘法集成，通过统一虚拟内存紧密地耦合到ARMv8架构来介绍我们的框架的设计和评估。在缩放工作负载下，我们建议的调度方法可以通过基线卸载提供3.7 { times} $性能改进，最高可达4.7美元{ times}。

著录项

来源
《IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems》 |2020年第5期|1017-1030|共14页
作者
Tomoutzoglou Othon; Mbakoyiannis Dimitris; Kornaros George; Coppola Marcello;
展开▼
作者单位

Technol Educ Inst Crete Informat Engn Dept Iraklion 71410 Greece;

Technol Educ Inst Crete Informat Engn Dept Iraklion 71410 Greece;

Technol Educ Inst Crete Informat Engn Dept Iraklion 71410 Greece;

STMicroelect Dept MCD Grenoble 38000 France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
CPU-accelerators unified address space; hardware-assisted offloading; packet-based dispatching; user-level embedded systems dispatching;

机译：CPU-Accelerators统一地址空间;硬件辅助卸载;基于数据包的调度;用户级嵌入式系统调度;

相似文献

外文文献
中文文献
专利

1. Using Runtime Systems Tools to Implement Efficient Preconditioners for Heterogeneous Architectures [J] . Roussel Adrien, Gratien Jean-Marc, Gautier Thierry Oil & gas science and technology . 2016,第6期

机译：使用运行时系统工具为异构体系结构实现高效的预处理器
2. Using Runtime Systems Tools to Implement Efficient Preconditioners for Heterogeneous Architectures [J] . Adrien Roussel, Jean-Marc Gratien, Thierry Gautier Oil & gas science and technology . 2016,第6期

机译：使用运行时系统工具为异构体系结构实现高效的预处理器
3. Mobile device power models for energy efficient dynamic offloading at runtime [J] . Farhan Azmat Ali, Pieter Simoens, Tim Verbelen, The Journal of Systems and Software . 2016,第mara期

机译：移动设备功耗模型，可在运行时实现节能高效的动态卸载
4. Enabling Efficient Job Dispatching in Accelerator-Extended Heterogeneous Systems with Unified Address Space [C] . Georgios Kornaros, Marcello Coppola International Symposium on Computer Architecture and High Performance Computing . 2018

机译：使用统一地址空间在加速器扩展的异构系统中实现高效的作业调度
5. Towards a Next-Generation Runtime Infrastructure Engine for Configuration Management Systems. [D] . Alzabarah, Ali. 2014

机译：面向配置管理系统的下一代运行时基础结构引擎。
6. A systematic review on silica- carbon- and magnetic materials-supported copper species as efficient heterogeneous nanocatalysts in click reactions [O] . Pezhman Shiri, Jasem Aboonajmi 2020

机译：对二氧化硅碳 - 和磁性材料的系统回顾支持的铜物种作为点击反应中有效的异质纳米催化剂
7. Hardware-assisted Remote Runtime Attestation for Critical Embedded Systems [O] . Munir Geden, Kasper Rasmussen 2019

机译：用于关键嵌入式系统的硬件辅助远程运行时间证明
8. Using Markov Decision Processes with Heterogeneous Queueing Systems to Examine Military MEDEVAC Dispatching Policies. [R] . Jenkins, P. R. 2017

机译：利用具有异构排队系统的马尔可夫决策过程来检验军事mEDEVaC调度策略。

Efficient Job Offloading in Heterogeneous Systems Through Hardware-Assisted Packet-Based Dispatching and User-Level Runtime Infrastructure

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅