首页> 外文会议>2011 11th International Conference on Intelligent Systems Design and Applications >Biclustering and classification analysis in gene expression using Nonnegative Matrix Factorization on multi-GPU systems
【24h】

Biclustering and classification analysis in gene expression using Nonnegative Matrix Factorization on multi-GPU systems

机译:在多GPU系统上使用非负矩阵分解进行基因表达的分类和分类分析

获取原文

摘要

A great interest has been given to the Nonnegative Matrix Factorization (NMF) technique due to its ability of extracting highly-interpretable parts from data sets. Gene expression analysis is one of the most popular applications of NMF in Bioinformatics. Nonetheless, its usage is hindered by the computational complexity when processing large data sets. In this paper, we present two parallel implementations of NMF. The first version uses CUDA on a Graphics Processing Unit (GPU). Large input matrices are iteratively blockwise transferred and processed. The second implementation distributes data among multiple GPUs synchronized through MPI (Message Passing Interface). When analyzing large data sets with two and four GPUs, it performs respectively, 2.3 and 4.13 times faster than the single-GPU version. This represents about 120 times faster than a conventional CPU. These super linear speedups are achieved when data portions assigned to each GPU are small enough to be transferred only once.
机译:非负矩阵因式分解(NMF)技术引起了人们极大的兴趣,因为它具有从数据集中提取高度可解释的部分的能力。基因表达分析是NMF在生物信息学中最流行的应用之一。但是,在处理大数据集时,其使用受到计算复杂性的阻碍。在本文中,我们介绍了NMF的两种并行实现。第一个版本在图形处理单元(GPU)上使用CUDA。大型输入矩阵以迭代方式逐块传输和处理。第二种实现是在通过MPI(消息传递接口)同步的多个GPU之间分配数据。使用两个和四个GPU分析大型数据集时,其性能分别比单GPU版本快2.3倍和4.13倍。这表示比传统CPU快约120倍。当分配给每个GPU的数据部分足够小以至于只能传输一次时,就可以实现这些超线性加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号