【24h】

Simplifying Amino Acid Alphabets Using a Genetic Algorithm and Sequence Alignment

机译:使用遗传算法和序列比对简化氨基酸字母

获取原文
获取原文并翻译 | 示例

摘要

In some areas of bioinformatics (like protein folding or sequence alignment) the full alphabet of amino acid symbols is not necessary. Often, better results are received with simplified alphabets. In general, simplified alphabets are as universal as possible. In this paper we show that this concept may not be optimal. We present a genetic algorithm for alphabet simplifying and we use it in a method based on global sequence alignment. We demonstrate that our algorithm is much faster and produces better results than the previously presented genetic algorithm. We also compare alphabets constructed on the base of universal substitution matrices like BLOSUM with our alphabets built through sequence alignment and propose a new coefficient describing the value of alphabets in the sequence alignment context. Finally we show that our simplified alphabets give better results in a sequence classification (using k-NN classifier), than most previously presented simplified alphabets and better than full 20-letter alphabet.
机译:在某些生物信息学领域(例如蛋白质折叠或序列比对),不需要完整的氨基酸符号字母。通常,使用简化的字母会收到更好的结果。通常,简化字母尽可能通用。在本文中,我们表明此概念可能不是最佳的。我们提出了一种用于字母简化的遗传算法,并将其用于基于全局序列比对的方法中。我们证明了我们的算法比以前提出的遗传算法要快得多,并且产生了更好的结果。我们还将在通用替换矩阵(如BLOSUM)的基础上构建的字母与通过序列比对构建的字母进行比较,并提出一个新系数来描述序列比对上下文中的字母值。最后,我们表明,与大多数以前提供的简化字母相比,我们的简化字母在序列分类(使用k-NN分类器)中提供了更好的结果,并且比完整的20个字母更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号