首页> 外文会议>International Workshop on Database and Expert Systems Applications >Novel Application of Query-Based Qualitative Predictors for Characterization of Solvent Accessible Residues in Conjunction with Protein Sequence Homology
【24h】

Novel Application of Query-Based Qualitative Predictors for Characterization of Solvent Accessible Residues in Conjunction with Protein Sequence Homology

机译:基于查询的定性预测因子与蛋白质序列同源性结合溶剂可偏转残留物的新型应用

获取原文

摘要

Prediction of relative solvent accessibility (RSA) is a standard first-approach in predicting three-dimensional protein structures. Here we have applied linear regression methods that include various sequence homology values for each residue as well as query residue qualitative predictors, corresponding to each of the twenty canonical amino acids. We fit the 268-protein learning set with a variety of sequence homology terms, including 20 and 6-term sequence entropy, and residue qualitative predictors. Then estimated RSA values are subsequently generated for the 215-protein Manesh test set. The qualitative predictors describe the actual query residue type (e.g. Gly) as opposed to the measures of sequence homology for the aligned subject sequences. This is consistent with our framework of modeling a limited set of discrete and/or physically intuitive predictors. Initial calculations involving normalized RSA values were considered as a likely first attempt, incorporating the notion of fitting an explicit binary characterization of individual residues, either as buried or accessible. Interestingly, the utilization of qualitative predictors showed significant prediction accuracy. Subsequent calculations using the original RSA values gave estimated values that, upon binary classification, indicated accuracies comparable to other first stage methods. Development of a second stage methodology is of current interest.
机译:相对溶剂可访问性的预测(RSA)是预测三维蛋白质结构的标准第一方法。在这里,我们已经应用了用于每个残基的各种序列同源物的线性回归方法以及对应于二十个规范氨基酸中的每一个的查询残留物定性预测因子。我们符合各种序列同源性术语的268蛋白学习,包括20和6术语序列熵和残留物定性预测因子。然后随后为215蛋白索取测试集生成估计的RSA值。定性预测器描述了实际的查询残留型(例如,GLY),而不是对准对象序列的序列同源性的措施。这与我们建模有限的离散和/或身体直观预测器的框架一致。涉及归一化RSA值的初始计算被认为是一种可能的首次尝试,其中包含拟合单个残留物的明确二进制表征的概念,如掩埋或可访问。有趣的是,定性预测因子的利用显示出显着的预测准确性。使用原始RSA值的后续计算使估计值在二进制分类时,指示与其他第一阶段方法相当的准确性。第二阶段方法的发展是目前的兴趣。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号