首页> 外文期刊>Current Bioinformatics >Sequence-Based Prediction of Enzyme Thermostability Through Bioinformatics Algorithms
【24h】

Sequence-Based Prediction of Enzyme Thermostability Through Bioinformatics Algorithms

机译:生物信息学算法基于序列的酶热稳定性预测

获取原文
获取原文并翻译 | 示例
           

摘要

Predicting the thermostability of a biomolecule, given its sequence, is one of the big challenges of protein engineering and developing tools to screen thermostable mutants is of great interest. Here we used various screening, clustering, decision tree and generalized rule induction models to search for patterns of thermostability. Arg was solely found as N-terminal amino acid in proteins at temperatures higher than 70 C. Fifty-four protein features were important in feature selection, and the number of peer groups (anomaly index 2.12) declined from 7 to 2 with selected features; no changes were found in K-Means and TwoStep clusters with/without feature selection filtering. Tree depths of decision tree models varied from 14 (in C5.0 with 10-fold cross-validation and with feature selection) to 4 (in CHAID) branches and C5.0 was the best and the Quest model was the worst. No significant difference in the performance of various decision tree models was found with/without feature selection, but the number of peer groups in clustering models was reduced significantly (p<0.05). The frequency of Gln was the most important feature in decision tree rules and for all association rules in antecedent to support the rules. The importance of Gln in protein thermostability is discussed in this paper.
机译:预测生物分子的热稳定性(给定序列)是蛋白质工程的重大挑战之一,开发筛选热稳定突变体的工具引起了人们的极大兴趣。在这里,我们使用了各种筛选,聚类,决策树和广义规则归纳模型来搜索热稳定性模式。在温度高于70 C的蛋白质中,Arg仅作为蛋白质的N末端氨基酸被发现。54个蛋白质特征在特征选择中很重要,同伴组(异常指数2.12)的数目从7减少到2。在使用/不使用功能选择过滤的K-Means和TwoStep群集中,未发现任何变化。决策树模型的树深度从14(在C5.0中具有10倍交叉验证和特征选择)到4(在CHAID中)分支,C5.0最好,而Quest模型最差。在有/没有特征选择的情况下,各种决策树模型的性能没有显着差异,但是聚类模型中的对等组数量显着减少(p <0.05)。 Gln的频率是决策树规则中最重要的功能,也是所有支持规则的关联规则中最重要的功能。本文讨论了Gln在蛋白质热稳定性中的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号