首页> 外文会议>International Symposium on Parallel Distributed Processing >Parallel Biological Sequence Alignments on the Cell Broadband Engine
【24h】

Parallel Biological Sequence Alignments on the Cell Broadband Engine

机译:电池宽带发动机上的并联生物序列对齐

获取原文

摘要

Sequence alignment and its many variants are a fundamental tool in computational biology. There is considerable recent interest in using the Cell Broadband Engine, a heterogeneous multi-core chip that provides high performance, for biological applications. However, work so far has been limited to computing optimal alignment scores using quadratic space under the basic global/local alignment algorithm. In this paper, we present a comprehensive study of developing sequence alignment algorithms on the Cell exploiting its thread and data level parallelism features. First, we develop a Cell implementation that computes optimal alignments and adopts Hirschberg's linear space technique. The former is essential as merely computing optimal alignment scores is not useful while the latter is needed to permit alignments of longer sequences. We then present Cell implementations of two advanced alignment techniques - spliced alignments and syntenic alignments. In a spliced alignment, consecutive non-overlapping portions of a sequence align with ordered non-overlapping, but non-consecutive portions of another sequence. Spliced alignments are useful in aligning mRNA sequences with corresponding genomic sequences to uncover gene structure. Syntenic alignments are used to discover conserved exons and other sequences between long genomic sequences from different organisms. We present experimental results for these three types of alignments on the Cell BE and report speedups of about 4 on six SPUs on the Playstation 3, when compared to the respective best serial algorithms on the Cell BE and the Pentium 4 processor.
机译:序列对齐及其许多变体是计算生物学中的基本工具。使用细胞宽带发动机,具有提供高性能的异构多核芯片,具有相当大的兴趣,可用于生物应用。然而,到目前为止的工作仅限于在基本全局/局部对准算法下使用二次空间计算最佳对准分数。在本文中,我们对开发其线程和数据级并行特征的细胞开发序列对准算法的综合研究。首先,我们开发一个单元实现,可以计算最佳对准并采用Hirschberg的线性空间技术。前者是必不可少的,因为仅计算最佳对准分数在后者需要允许更长序列的对齐时没有用。然后,我们提供两个高级对准技术的细胞实现 - 拼接对准和同步对齐。在拼接对准中,序列的连续非重叠部分与有序的非重叠,但是另一个序列的非连续部分。拼接对准可用于对准MRNA序列与相应的基因组序列进行对准以发现基因结构。同期取向用于发现来自不同生物的长基因组序列之间的保守外显子和其他序列。我们在小区上的这三种类型的对准的实验结果,并在PlayStation 3上报告了在PlayStation 3上的六个水合物上的大约4的加速度,与细胞上的相应最佳的串行算法相比是奔腾4处理器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号