首页> 美国卫生研究院文献>Protein Science : A Publication of the Protein Society >Fuzzy cluster analysis of simple physicochemical properties of amino acids for recognizing secondary structure in proteins.
【2h】

Fuzzy cluster analysis of simple physicochemical properties of amino acids for recognizing secondary structure in proteins.

机译:氨基酸的简单理化性质的模糊聚类分析用于识别蛋白质的二级结构。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Fuzzy cluster analysis has been applied to the 20 amino acids by using 65 physicochemical properties as a basis for classification. The clustering products, the fuzzy sets (i.e., classical sets with associated membership functions), have provided a new measure of amino acid similarities for use in protein folding studies. This work demonstrates that fuzzy sets of simple molecular attributes, when assigned to amino acid residues in a protein's sequence, can predict the secondary structure of the sequence with reasonable accuracy. An approach is presented for discriminating standard folding states, using near-optimum information splitting in half-overlapping segments of the sequence of assigned membership functions. The method is applied to a nonredundant set of 252 proteins and yields approximately 73% matching for correctly predicted and correctly rejected residues with approximately 60% overall success rate for the correctly recognized ones in three folding states: alpha-helix, beta-strand, and coil. The most useful attributes for discriminating these states appear to be related to size, polarity, and thermodynamic factors. Van der Waals volume, apparent average thickness of surrounding molecular free volume, and a measure of dimensionless surface electron density can explain approximately 95% of prediction results. hydrogen bonding and hydrophobicity induces do not yet enable clear clustering and prediction.
机译:通过使用65种理化特性作为分类基础,对20个氨基酸进行了模糊聚类分析。模糊集(即具有相关隶属函数的经典集)的聚类产物提供了用于蛋白质折叠研究的氨基酸相似性的新度量。这项工作表明,将简单分子属性的模糊集分配给蛋白质序列中的氨基酸残基后,可以以合理的准确性预测序列的二级结构。提出了一种方法,该方法使用在分配的隶属函数的序列的半重叠段中进行的近乎最佳的信息拆分来区分标准折叠状态。该方法应用于252种蛋白质的非冗余集,对于正确预测和正确剔除的残基,产生约73%的匹配,对于三种折叠状态下正确识别的残基,总成功率约为60%:α-螺旋,β-链和线圈。区分这些状态最有用的属性似乎与大小,极性和热力学因素有关。范德华体积,周围分子自由体积的表观平均厚度以及无因次表面电子密度的量度可以解释约95%的预测结果。氢键和疏水性诱导剂尚不能实现清晰的聚类和预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号