首页> 美国卫生研究院文献>other >HotPatch: A Statistical Approach to Finding Biologically Relevant Features on Protein Surfaces
【2h】

HotPatch: A Statistical Approach to Finding Biologically Relevant Features on Protein Surfaces

机译:HotPatch:一种在蛋白质表面发现生物学相关特征的统计方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We describe a fully automated algorithm for finding functional sites on protein structures. Our method finds surface patches of unusual physicochemical properties on protein structures, and estimates the patches’ probability of overlapping functional sites. Other methods for predicting the locations of specific types of functional sites exist, but in previous analyses, it has been difficult to compare methods when they are applied to different types of sites. Thus, we introduce a new statistical framework that enables rigorous comparisons of the usefulness of different physicochemical properties for predicting virtually any kind of functional site. The program’s statistical models were trained for 11 individual properties (electrostatics, concavity, hydrophobicity, etc.) and for 15 neural network combination properties, all optimized and tested on 15 diverse protein functions. To simulate what to expect if the program were run on proteins of unknown function, as might arise from structural genomics, we tested it on 618 proteins of diverse mixed functions. In the higher-scoring top half of all predictions, a functional residue could typically be found within the first 1.7 residues chosen at random. The program may or may not use partial information about the protein’s function type as an input, depending on which statistical model the user chooses to employ. If function type is used as an additional constraint, prediction accuracy usually increases, and is particularly good for enzymes, DNA-interacting sites, and oligomeric interfaces. The program can be accessed online at .
机译:我们描述了一种用于查找蛋白质结构上功能位点的全自动算法。我们的方法在蛋白质结构上发现了具有异常物理化学特性的表面斑块,并估计了斑块重叠功能位点的可能性。还存在用于预测特定类型功能站点位置的其他方法,但是在以前的分析中,当将方法应用于不同类型的站点时,很难对它们进行比较。因此,我们引入了一个新的统计框架,该框架能够严格比较不同物理化学性质对预测几乎任何类型的功能部位的有用性。该程序的统计模型针对11种独立属性(静电,凹度,疏水性等)和15种神经网络组合属性进行了训练,均对15种不同的蛋白质功能进行了优化和测试。为了模拟程序在功能未知的蛋白质上运行(由结构基因组学产生的结果)时所期望的结果,我们在618种具有多种混合功能的蛋白质上进行了测试。在所有预测的得分较高的上半部分中,通常可以在随机选择的前1.7个残基中找到功能残基。该程序可能会也可能不会使用有关蛋白质功能类型的部分信息作为输入,具体取决于用户选择采用哪种统计模型。如果将功能类型用作附加约束,则预测精度通常会提高,并且特别适用于酶,DNA相互作用位点和寡聚界面。该程序可以从以下网站在线访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号