首页> 外文期刊>Proteins: Structure, Function, and Genetics >Effects of amino acid composition, finite size of proteins, and sparse statistics on distance-dependent statistical pair potentials.
【24h】

Effects of amino acid composition, finite size of proteins, and sparse statistics on distance-dependent statistical pair potentials.

机译:氨基酸组成,蛋白质的有限大小和稀疏统计对距离相关统计对电位的影响。

获取原文
获取原文并翻译 | 示例
           

摘要

Statistical distance dependent pair potentials are frequently used in a variety of folding, threading, and modeling studies of proteins. The applicability of these types of potentials is tightly connected to the reliability of statistical observations. We explored the possible origin and extent of false positive signals in statistical potentials by analyzing their distance dependence in a variety of randomized protein-like models. While on average potentials derived from such models are expected to equal zero at any distance, we demonstrate that systematic and significant distortions exist. These distortions originate from the limited statistical counts in local environments of proteins and from the limited size of protein structures at large distances. We suggest that these systematic errors in statistical potentials are connected to the dependence of amino acid composition on protein size and to variation in protein sizes. Additionally, atom-based potentials are dominated by a false positive signal that is due to correlation among distances measured from atoms of one residue to atoms of another residue. The significance of residue-based pairwise potentials at various spatial pair separations was assessed in this study and it was found that as few as approximately 50% of potential values were statistically significant at distances below 4 A, and only at most approximately 80% of them were significant at larger pair separations. A new definition for reference state, free of the observed systematic errors, is suggested. It has been demonstrated to generate statistical potentials that compare favorably to other publicly available ones.
机译:统计距离相关对电位在蛋白质的各种折叠,穿线和建模研究中经常使用。这些类型的电位的适用性与统计观测的可靠性紧密相关。我们通过分析在各种随机蛋白质样模型中它们的距离依赖性,探索了统计潜力中假阳性信号的可能来源和程度。虽然从此类模型得出的平均电势在任何距离上均预期为零,但我们证明存在系统性的显着畸变。这些失真是由于蛋白质局部环境中有限的统计计数以及远距离蛋白质结构的有限大小引起的。我们建议这些统计潜力的系统性错误与氨基酸组成对蛋白质大小的依赖性以及蛋白质大小的变化有关。另外,由于从一个残基的原子到另一残基的原子之间测得的距离之间的相关性,基于原子的电势由虚假的正信号控制。在这项研究中评估了在各种空间对间距处基于残基的成对电位的显着性,发现在低于4 A的距离处,只有约50%的电位值具有统计学意义,而最多只有约80%在较大的配对分离中具有重要意义。提出了一个新的参考状态定义,没有观察到的系统误差。事实证明,它产生的统计潜力可与其他公众可获得的潜力相提并论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号