Self-Optimizing and Self-Programming Computing Systems: A Combined Compiler, Complex Networks, and Machine Learning Approach

Xiao Yao; Nazarian Shahin; Bogdan Paul

首页> 外文期刊>IEEE transactions on very large scale integration (VLSI) systems >Self-Optimizing and Self-Programming Computing Systems: A Combined Compiler, Complex Networks, and Machine Learning Approach

【24h】

Self-Optimizing and Self-Programming Computing Systems: A Combined Compiler, Complex Networks, and Machine Learning Approach

机译：自优化和自编程计算系统：组合的编译器，复杂的网络和机器学习方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

There exists an urgent need for determining the right amount and type of specialization while making a heterogeneous system as programmable and flexible as possible. Therefore, in this paper, we pioneer a self-optimizing and selfprogramming computing system (SOSPCS) design framework that achieves both programmability and flexibility and exploits computing heterogeneity [e.g., CPUs, GPUs, and hardware accelerators (HWAs)]. First, at compile time, we form a task pool consisting of hybrid tasks with different processing element (PE) affinities according to target applications. Tasks preferred to be executed on GPUs or accelerators are detected from target applications by neural networks. Tasks suitable to run on CPUs are formed by community detection to minimize data movement overhead. Next, a distributed reinforcement learning-based approach is used at runtime to allow agents to map the tasks onto the network-on-chip-based heterogeneous PEs by learning an optimal policy based on Q values in the environment. We have conducted experiments on a heterogeneous platform consisting of CPUs, GPUs, and HWAs with deep learning algorithms such as matrix multiplication, ReLU, and sigmoid functions. We concluded that SOSPCS provides performance improvement up to 4.12x and energy reduction up to 3.24x compared to the state-of-the-art approaches.

机译：迫切需要确定合适的专业化数量和类型，同时使异构系统尽可能地可编程和灵活。因此，在本文中，我们率先提出了一种自优化和自编程计算系统（SOSPCS）设计框架，该框架可实现可编程性和灵活性，并利用计算的异构性（例如CPU，GPU和硬件加速器（HWA））。首先，在编译时，我们根据目标应用程序形成了一个由具有不同处理元素（PE）亲和力的混合任务组成的任务池。通过神经网络从目标应用程序中检测优先在GPU或加速器上执行的任务。通过社区检测可以形成适合在CPU上运行的任务，以最大程度地减少数据移动开销。接下来，在运行时使用基于分布式强化学习的方法，以允许代理通过学习基于环境中Q值的最佳策略，将任务映射到基于芯片网络的异构PE。我们已经在由CPU，GPU和HWA组成的异构平台上进行了实验，这些平台具有深度学习算法，例如矩阵乘法，ReLU和Sigmoid函数。我们得出的结论是，与最先进的方法相比，SOSPCS的性能提高了4.12倍，能耗降低了3.24倍。

著录项

来源
《IEEE transactions on very large scale integration (VLSI) systems》 |2019年第6期|1416-1427|共12页
作者
Xiao Yao; Nazarian Shahin; Bogdan Paul;
展开▼
作者单位

Univ Southern Calif, Dept Elect & Comp Engn, Viterbi Sch Engn, Los Angeles, CA 90089 USA;

Univ Southern Calif, Dept Elect & Comp Engn, Viterbi Sch Engn, Los Angeles, CA 90089 USA;

Univ Southern Calif, Dept Elect & Comp Engn, Viterbi Sch Engn, Los Angeles, CA 90089 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Distributed Q-learning; domain-specific system-on-chip (DSSoC); heterogeneous systems; network-on-chip (NoC); neural networks (NNs); self-optimizing; self-programming; software-defined hardware (SDH);

机译：分布式Q-Learning;域特定的片上系统（DSSOC）;异构系统;片上网（NOC）;神经网络（NNS）;自我优化;自我编程;自我编程;软件定义的硬件（SDH）;

相似文献

外文文献
中文文献
专利

1. Self-Optimizing and Self-Programming Computing Systems: A Combined Compiler, Complex Networks, and Machine Learning Approach [J] . Xiao Yao, Nazarian Shahin, Bogdan Paul IEEE transactions on very large scale integration (VLSI) systems . 2019,第6期

机译：自我优化和自我编程计算系统：组合编译器，复杂网络和机器学习方法
2. Combining Local Representative Networks to Improve Learning in Complex Nonlinear Learning Systems [J] . Goutam CHAKRABORTY, Masayuki SAWADA, Shoichi NOGUCHI IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 1997,第9期

机译：结合本地代表网络以改善复杂非线性学习系统中的学习
3. A Machine Learning Approach for Task and Resource Allocation in Mobile-Edge Computing-Based Networks [J] . Sihua Wang, Mingzhe Chen, Xuanlin Liu, Internet of Things Journal, IEEE . 2021,第3期

机译：基于移动边缘计算网络中的任务和资源分配机器学习方法
4. S4oC: A Self-Optimizing, Self-Adapting Secure System-on-Chip Design Framework to Tackle Unknown Threats — A Network Theoretic, Learning Approach [C] . Shahin Nazarian, Paul Bogdan IEEE International Symposium on Circuits and Systems . 2020

机译：S4oC：一种自我优化，自适应的安全片上系统设计框架，以应对未知威胁—网络理论，学习方法
5. Uncovering Patterns in Complex Data with Reservoir Computing and Network Analytics: A Dynamical Systems Approach [D] . Krishnagopal, Sanjukta . 2020

机译：使用储层计算和网络分析揭示复杂数据中的模式：动态系统方法
6. Complex Networks Govern Coiled-Coil Oligomerization – Predicting and Profiling by Means of a Machine Learning Approach [O] . Carsten C. Mahrenholz, Ingrid G. Abfalter, Ulrich Bodenhofer, 2011

机译：复杂的网络控制卷材的齐聚反应-通过机器学习方法进行预测和分析
7. A Machine Learning Approach for Task and Resource Allocation in Mobile-Edge Computing-Based Networks [O] . Sihua Wang, Mingzhe Chen, Xuanlin Liu, 2021

机译：基于移动边缘计算网络中的任务和资源分配机器学习方法

Self-Optimizing and Self-Programming Computing Systems: A Combined Compiler, Complex Networks, and Machine Learning Approach

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅