首页> 外文会议>Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics >On the Convergence of Protein Structure andDynamics. Statistical Learning Studies of Pseudo Folding Pathways
【24h】

On the Convergence of Protein Structure andDynamics. Statistical Learning Studies of Pseudo Folding Pathways

机译:关于蛋白质结构与动力学的趋同。伪折叠途径的统计学习研究

获取原文
获取原文并翻译 | 示例

摘要

Many algorithms that attempt to predict proteins' native structure from sequence need to generate a large set of hypotheses in order to ensure that nearly correct structures are included, leading to the problem of assessing the quality of alternative 3D conformations. This problem has been mostly approached by focusing on the final 3D conformation, with machine learning techniques playing a leading role. We argue in this paper that additional information for recognising native-like structures can be obtained by regarding the final conformation as the result of a generative process reminiscent of the folding process that generates structures in nature. We introduce a coarse representation of protein pseudo-folding based on binary trees and introduce a kernel function for assessing their similarity. Kernel-based analysis techniques empirically demonstrate a significant correlation between information contained into pseudo-folding trees and features of native folds in a large and non-redundant set of proteins.
机译:为了确保包括几乎正确的结构,许多试图从序列中预测蛋白质天然结构的算法都需要产生大量的假设,从而导致评估备选3D构象质量的问题。这个问题主要是通过关注最终的3D构型来解决的,其中机器学习技术起着主导作用。我们在本文中认为,通过将最终构象视为生成过程的结果,可以想到最终的构象,从而获得识别自然样结构的其他信息,而这种过程让人联想到自然界中产生结构的折叠过程。我们介绍了基于二叉树的蛋白质伪折叠的粗略表示,并介绍了用于评估其相似性的核函数。基于内核的分析技术从经验上证明伪折叠树中包含的信息与大型且非冗余蛋白质集中的自然折叠特征之间存在显着相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号