首页> 外文会议>2010 IEEE/ACS International Conference on Computer Systems and Applications >RBT-Km: K-Means clustering for Multiple Sequence Alignment
【24h】

RBT-Km: K-Means clustering for Multiple Sequence Alignment

机译:RBT-Km:用于多序列比对的K-Means聚类

获取原文

摘要

This paper presents a novel approach for solving the Multiple Sequence Alignment (MSA) problem. K-Means clustering is combined with the Rubber Band Technique (RBT) to introduce an iterative optimization algorithm, namely RBT-Km, to find the optimal alignment for a set of input protein sequences. In this technique, the MSA problem is modeled as a Rubber Band, while the solution space is modeled as plate with several poles corresponding locations in the input sequences that are most likely to be correlated and/or biologically related. K-Means clustering is then used to discriminate biologically related locations from those that may appear by chance. RBT-Km is tested with one of the well-known benchmarks in this field (BALiBASE 2.0). The results demonstrate the superiority of the proposed technique even in the case of formidable sequences.
机译:本文提出了一种解决多序列比对(MSA)问题的新颖方法。 K-Means聚类与橡皮筋技术(RBT)结合使用,引入了一种迭代优化算法,即RBT-Km,以找到一组输入蛋白质序列的最佳比对。在此技术中,MSA问题建模为橡皮筋,而求解空间建模为平板,在输入序列中具有几个极点对应位置的板极可能是相关的和/或生物学相关的。然后,使用K均值聚类将生物学相关的位置与可能偶然出现的位置区分开。 RBT-Km已通过该领域的著名基准测试之一(BALiBASE 2.0)进行了测试。结果证明了所提出技术的优越性,即使在强大序列的情况下也是如此。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号