首页> 外文期刊>Swarm and Evolutionary Computation >A multi-population memetic algorithm for the 3-D protein structure prediction problem
【24h】

A multi-population memetic algorithm for the 3-D protein structure prediction problem

机译:一种多群麦克料算法3d蛋白质结构预测问题

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we present a knowledge-based memetic algorithm to tackle the three-dimensional protein structure prediction problem without the explicit use of experimentally determined protein structures' templates. Our algorithm proposal was divided into two main prediction steps: (i) solutions sampling and initialization; and (ii) structural models' optimization coming from the previous stage. The first step generates and classifies several structural models for a given target protein, through the Angle Probability List strategy, to identify distinct structural patterns and consider reasonable solutions in the memetic algorithm initialization. The Angle Probability List takes advantage of structural knowledge stored in the Protein Data Bank to reduce the size and, consequently, the conformational search space complexity. The second step of the method consists in the optimization of the structures generated in the first stage by the proposed memetic algorithm. It uses a tree-based population where each node can be seen as an independent subpopulation that interacts with each other over global search operations, aiming at knowledge sharing, population diversity, and better exploration of the multimodal search space. The method also encompasses ad hoc global search operators, whose objective is to increase the method exploration ability focusing on specific characteristics of the protein structure prediction problem, combined with the artificial bee colony algorithm used as an exploitation technique applied to each node of the tree. The proposed algorithm was tested on a set of 24 amino acid sequences, as well as compared to the reference method in the protein structure prediction area, the method of Rosetta. The obtained results show the ability of our method to predict three-dimensional protein structures with similar folding to the experimentally determined ones, regarding the structural metrics Root-Mean-Square Deviation and Global Distance Total Score Test. We also show that our method was able to reach comparable results to Rosetta, and in some cases, it outperformed Rosetta, corroborating the effectiveness of our proposal.
机译:在本文中,我们介绍了一种基于知识的遗料算法来解决三维蛋白质结构预测问题,而无明确地使用实验确定的蛋白质结构的模板。我们的算法提案分为两个主要预测步骤:(i)解决方案采样和初始化; (ii)结构模型的优化来自前一级。第一步通过角度概率列表策略来生成和对给定的目标蛋白质的若干结构模型进行分类,以识别不同的结构模式,并考虑在迭代算法初始化中的合理解决方案。角度概率列表利用存储在蛋白质数据库中的结构知识来减小尺寸,从而减小构象搜索空间复杂性。该方法的第二步骤包括通过所提出的麦克算法在第一阶段中产生的结构的优化。它使用基于树的人口,其中每个节点可以被视为一个独立的子群,在全球搜索操作中互相交互,旨在瞄准知识共享,人口分集,更好地探索多模式搜索空间。该方法还包括Ad Hoc全球搜索运营商,其目的是提高专注于蛋白质结构预测问题的特定特征的方法探测能力,与应用于树的每个节点的开发技术相结合。在一组24个氨基酸序列中测试了所提出的算法,以及与蛋白质结构预测区域中的参考方法相比,Rosetta的方法。所获得的结果表明,我们的方法预测三维蛋白质结构,其在实验确定的方法中预测具有与实验确定的折叠的三维蛋白质结构,关于结构度量根均方偏差和全局距离总分测试。我们还表明,我们的方法能够与Rosetta达到可比结果,并且在某些情况下,它表现出罗萨塔优势,证实了我们提案的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号