首页> 外文OA文献 >ParBiBit: Parallel tool for binary biclustering on modern distributed-memory systems
【2h】

ParBiBit: Parallel tool for binary biclustering on modern distributed-memory systems

机译:Parbibit:现代分布式存储系统上的二进制BICLUSTING的并行工具

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Biclustering techniques are gaining attention in the analysis of large-scale datasets as they identify two-dimensional submatrices where both rows and columns are correlated. In this work we present ParBiBit, a parallel tool to accelerate the search of interesting biclusters on binary datasets, which are very popular on different fields such as genetics, marketing or text mining. It is based on the state-of-the-art sequential Java tool BiBit, which has been proved accurate by several studies, especially on scenarios that result on many large biclusters. ParBiBit uses the same methodology as BiBit (grouping the binary information into patterns) and provides the same results. Nevertheless, our tool significantly improves performance thanks to an efficient implementation based on C++11 that includes support for threads and MPI processes in order to exploit the compute capabilities of modern distributed-memory systems, which provide several multicore CPU nodes interconnected through a network. Our performance evaluation with 18 representative input datasets on two different eight-node systems shows that our tool is significantly faster than the original BiBit. Source code in C++ and MPI running on Linux systems as well as a reference manual are available at https://sourceforge.net/projects/parbibit/.
机译:在大规模数据集的分析中,双板颗粒技术正在关注,因为它们识别两个行和列都相关的二维子群体。在这项工作中,我们呈现Parbibit,一个并行工具,以加速在二进制数据集上搜索有趣的Biclusters,这在不同领域非常受欢迎,例如遗传,营销或文本挖掘。它基于最先进的顺序Java工具纤维,这已经通过几项研究证明了准确,特别是在导致许多大型Biclusters的情况下。 Parbibit使用与宾精相同的方法(将二进制信息分组成模式)并提供相同的结果。尽管如此,由于基于C ++ 11的有效实现,我们的工具显着提高了性能,包括对线程和MPI进程的支持,以利用现代分布式存储系统的计算能力,它提供通过网络互连的多个多核CPU节点。我们的性能评估与两个不同的八个节点系统上的18个代表性输入数据集显示我们的工具比原始纤维更快。在Linux系统上运行的C ++和MPI中的源代码以及参考手册可在https://sourceforge.net/projects/parbibit/上获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号