首页> 外文会议>Middle East Conference on Biomedical Engineering >Cloud-based parallel suffix array construction based on MPI
【24h】

Cloud-based parallel suffix array construction based on MPI

机译:基于MPI的基于云的并行后缀数组构造

获取原文

摘要

Massive amount of genomics data are being produced nowadays by Next Generation Sequencing machines. The suffix array is currently the best choice for indexing genomics data, because of its efficiency and large number of applications. In this paper, we address the problem of constructing the suffix array on computer cluster in the cloud. We present a solution that automates the establishment of a computer cluster in a cloud and automatically constructs the suffix array in a distributed fashion over the cluster nodes. This has the advantage of encapsulating all set-up details and execution of the algorithm. The distributed nature of the algorithm we use overcomes the problem that arises when the user wishes, due to cost issues, to use low memory machines in the cloud. Our experiments show that our implementation scales well with the increasing number of processors. The cloud cost is affordable and it provides a cost effective solution.
机译:如今,下一代测序仪正在产生大量的基因组数据。后缀数组由于其效率高和应用广泛,目前是索引基因组数据的最佳选择。在本文中,我们解决了在云中的计算机集群上构造后缀数组的问题。我们提出了一种解决方案,可以自动在云中建立计算机集群,并在集群节点上以分布式方式自动构建后缀数组。这具有封装所有设置细节和执行算法的优点。我们使用的算法的分布式性质克服了由于成本问题而用户希望在云中使用低内存机器时出现的问题。我们的实验表明,随着处理器数量的增加,我们的实现可以很好地扩展。云成本是可以承受的,并且它提供了具有成本效益的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号