Analysis in performance and new model for multiple kernels executions on many-core architectures

机译：针对多核架构上的多个内核执行的性能和新模型的分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays, due to massively parallel characteristics of current many-core architectures, these devices are not only being used in order to exploit data-parallelism and to minimize the execution time in a single problem, but, they are beginning to be used in order both to execute and to increase the performances when executing more than one application simultaneously. In this work, we provide a performance analysis on the use of current many-core architectures for this new purpose; this performance analysis has been carried out over two different many-core architectures. Furthermore, two different programming approaches to tackle this new role have been tested. The results so obtained show that a increase in the computational requirements implies an important fall in performance. The main objective of this paper is to explain the reasons for this behavior, and afterwards, to propose a set of alternatives to deal with these disadvantages previously mentioned.

机译：如今，由于当前多核体系结构的大规模并行特性，这些设备不仅被用于开发数据并行性并最大程度地减少单个问题的执行时间，而且还开始被用于同时使用这两种设备。同时执行多个应用程序时执行和提高性能。在这项工作中，我们提供了针对当前新内核的使用情况进行性能分析的方法。这种性能分析是在两种不同的多核体系结构上进行的。此外，已经测试了两种不同的编程方法来应对这一新角色。如此获得的结果表明，计算需求的增加意味着性能上的重大下降。本文的主要目的是解释这种行为的原因，然后，提出一套解决上述缺点的替代方法。

著录项

来源
《IEEE International Conference on Cognitive Informatics Cognitive Computing》|2013年|189-194|共6页
会议地点
作者
Valero-Lara Pedro; Pelayo Fernando L.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
CUDA; Heterogeneous architectures; Multiple kernels execution;

机译：CUDA;异构架构;多个内核执行;

相似文献

外文文献
中文文献
专利

1. Performance and Scalability Study of FMM Kernels on Novel Multi- and Many-core Architectures [J] . Antón Rey, Francisco D. Igual, Manuel Prieto-Matías, Procedia Computer Science . 2017,第期

机译：FMM内核对新型多核架构的性能和可扩展性研究
2. Improving performance portability for GPU-specific OpenCL kernels on multi-core/many-core CPUs by analysis-based transformations [J] . Mei?Wen, Da-fei?Huang, Chang-qing?Xun, Frontiers of Information Technology & Electronic Engineering . 2015,第11期

机译：通过基于分析的转换，提高多核/多核CPU上特定于GPU的OpenCL内核的性能可移植性
3. Improving performance portability for GPU-specific OpenCL kernels on multi-core/many-core CPUs by analysis-based transformations*# [J] . Mei WEN, Da-fei HUANG, Chang-qing XUN, 浙江大学学报（英文版）（C辑：计算机与电子） . 2015,第011期

机译：通过基于分析的转换来提高多核/多核CPU上特定于GPU的OpenCL内核的性能可移植性*＃
4. Analysis in performance and new model for multiple kernels executions on many-core architectures [C] . Valero-Lara Pedro, Pelayo Fernando L. IEEE International Conference on Cognitive Informatics Cognitive Computing . 2013

机译：多核架构中多内核执行的性能和新模型分析
5. Memory optimization in codelet execution model on many-core architectures [D] . Wu, Yao 2014

机译：许多核心架构上的Codelet执行模型中的内存优化
6. Meta-analysis of prediction model performance across multiplestudies: Which scale helps ensure between-study normality for theC-statistic and calibration measures? [O] . Kym IE Snell, Joie Ensor, Thomas PA Debray, -1

机译：跨多个预测模型性能的元分析研究：哪种量表有助于确保研究的正常性C统计和校准措施？
7. Figure 1: Application architecture of MKL-GRNI (A) Combined kernel (B) Decomposed regulation matrices (C) Parallel distribution and model building (D) Model execution (E) Writing results to shared object. [O] . -1

机译：图1：MKL-GRNI（A）的应用架构组合内核（B）分解规则矩阵（C）并行分布和模型构建（D）模型执行（E）将结果写入共享对象。

Analysis in performance and new model for multiple kernels executions on many-core architectures

摘要

著录项

相似文献

相关主题

期刊订阅