首页> 外文会议>IEEE/ACS International Conference on Computer Systems and Applications >Auto-tuning TRSM with an asynchronous task assignment model on multicore, multi-GPU and coprocessor systems
【24h】

Auto-tuning TRSM with an asynchronous task assignment model on multicore, multi-GPU and coprocessor systems

机译:在多核,多GPU和协处理器系统上使用异步任务分配模型自动调整TRSM

获取原文

摘要

The increasing need for computing power today justifies the continuous search for techniques that decrease the time to answer usual computational problems. To take advantage of new hybrid parallel architectures composed by multithreading and multiprocessor hardware, our current efforts involve the design and validation of highly parallel algorithms that efficiently explore the characteristics of such architectures. In this paper, we propose an automatic tuning methodology to easily exploit multicore, multi-GPU and coprocessor systems. We present an optimization of an algorithm for solving triangular systems (TRSM), based on block decomposition and asynchronous task assignment, and discuss some results.
机译:如今,对计算能力的日益增长的需求证明了不断寻求减少回答常见计算问题的时间的技术的合理性。为了利用由多线程和多处理器硬件组成的新的混合并行架构,我们目前的工作包括设计和验证高度并行算法,以有效探索此类架构的特性。在本文中,我们提出了一种自动调整方法,可以轻松利用多核,多GPU和协处理器系统。我们基于块分解和异步任务分配,提出了一种解决三角系统(TRSM)的算法的优化方法,并讨论了一些结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号