An Empirical Evaluation of Design Abstraction and Performance of Thrust Framework

机译：设计抽象与推力框架性能的实证评价

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

High performance computing applications are far more difficult to write, therefore, practitioners expect a well-tuned software to last long and provide optimized performance even when the hardware is upgraded. It may also be necessary to write software using sufficient abstraction over the hardware so that it is capable of running on heterogeneous architecture. Therefore, it is required to have a proper programming abstraction paradigm that strikes a balance between the abstraction and visibility over the hardware so that the programmer can write a program without having to understand the hardware nuances, yet exploit the compute power optimally. In this paper we have analyzed the power of design abstraction and performance of a popular design abstraction framework called Thrust. We have shown quantitatively that while it is easier to write an application using Thrust compared to writing the same in the native CUDA or OpenMP backends, the framework does not provide any abstraction over the memory hierarchy of the underlying backend to the programmer. We have compared the performance of three Thrust applications with their corresponding native versions in CUDA, OpenMP, Xeon-Phi and the CPP backends and demonstrate that the current Thrust version performs poorly in most of the cases when the application is compute intensive. However, the framework provides close to the native performance for a non-compute intensive applications. We analyze the reasons for the performance and highlight the improvements necessary for the framework.

机译：高性能计算应用更难以写入，因此，从业者期望一个良好调整的软件持续时间，即使硬件升级时，也要提供优化的性能。也可能需要使用硬件上使用足够的抽象来编写软件，以便它能够在异构架构上运行。因此，需要具有适当的编程抽象范例，可在硬件上击中抽象和可见性之间的平衡，使得程序员可以在不必了解硬件细微差别的情况下编写程序，但最佳地利用计算功率。在本文中，我们分析了名为推力的流行设计抽象框架的设计抽象和性能的力量。与在本机Cuda或Openmp后端的写入相同的相比，相比，我们已经定量地显示了使用推力的虽然在本机中的相同中编写应用程序，但是该框架不会向程序员的基础后端的内存层次结构提供任何抽象。将三个推力应用程序的性能与CUDA，OpenMP，Xeon-Phi和CPP后端的相应本机版本进行了比较，并证明当前推力版本在应用程序计算密集型时在大多数情况下表现不佳。但是，该框架提供了对非计算密集型应用程序的本机性能。我们分析了绩效的原因，并突出了框架所需的改进。

著录项

来源
《International Workshop on Embedded Multicore Systems》|2017年|320p|共10页
会议地点
作者
Ajai V. George; Sanket Rajan Gupte; Sankar Manoj; Santonu Sarkar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.133.2-53;
关键词
Design Abstraction; Thrust; Shared memory; Cyclomatic complexity; CUDA; OpenMP; Xeon-Phi;

机译：设计抽象;推力;共享记忆;循环复杂性;CUDA;OPENMP;Xeon-PHI;

相似文献

外文文献
中文文献
专利

1. Thrust2D: A new design abstraction framework for structuredrngrid class of algorithms [J] . Santonu Sarkar, Ajai V George, Sankar Manoj Concurrency and computation: practice and experience . 2018,第19期

机译：Thrust2D：针对结构化算法类的新设计抽象框架
2. Proposing a Value-Added Indicators framework for the apparel and fashion sector: Design and empirical evaluation [J] . Bertolini M., Romagnoli G., Weinhard A. International journal of RF technologies: research and applications . 2017,第3期

机译：为服装和时装行业提出增值指标框架：设计和实证评估
3. Design and Evaluation of a Cross-Layer Framework for Improving 802.11 Networks:An Empirical Study [J] . Nurul I. Sarkar International journal of business data communications and networking . 2013,第1期

机译：改进802.11网络的跨层框架的设计和评估：一项实证研究
4. An Empirical Evaluation of Design Abstraction and Performance of Thrust Framework [C] . Ajai V. George, Sanket Rajan Gupte, Sankar Manoj, International Workshop on Embedded Multicore Systems . 2017

机译：设计抽象与推力框架性能的实证评价
5. The Design and Empirical Evaluation of the Core-satellite Framework for Urban Passenger Data Collection [D] . Loa, Patrick Maung Soe Win Htun. 2019

机译：城市乘客数据收集核心卫星框架的设计与实证评价
6. Measuring coverage of infant and young child feeding counselling interventions: A framework and empirical considerations for survey question design [O] . Jowel Choufani, Sunny S. Kim, Phuong Hong Nguyen, 2020

机译：婴幼儿饲养咨询干预措施的覆盖：调查问题设计的框架和实证考虑因素
7. Influence of the Fluid Inertia Forces on the Dynamic Characteristics of Externally Pressurized Thrust Bearings : 2nd Report, Evaluation of Various Approximate Solutions for the Influence of Film Inertia Forces on the Dynamic Performance of Externally Pressurized Infinitely Long Thrust Bearings [O] . Haruyama Yoshio, Mori Atsunobu, Kazamaki Tsuneji, 1985

机译：流体惯性力对外压推力轴承动态特性的影响：第2报告，膜惯性力对外加压无限长推力轴承动态性能影响的各种近似解的评价

An Empirical Evaluation of Design Abstraction and Performance of Thrust Framework

摘要

著录项

相似文献

相关主题

期刊订阅