...
首页> 外文期刊>Amino Acids >Predicting DNA-binding proteins: approached from Chou’s pseudo amino acid composition and other specific sequence features
【24h】

Predicting DNA-binding proteins: approached from Chou’s pseudo amino acid composition and other specific sequence features

机译:预测DNA结合蛋白:从Chou的假氨基酸组成和其他特定序列特征出发

获取原文
获取原文并翻译 | 示例
           

摘要

DNA-binding proteins play a pivotal role in gene regulation. It is vitally important to develop an automated and efficient method for timely identification of novel DNA-binding proteins. In this study, we proposed a method based on alone the primary sequences of proteins to predict the DNA-binding proteins. DNA-binding proteins were encoded by autocross-covariance transform, pseudo-amino acid composition, dipeptide composition, respectively and also the different combinations of the three encoded methods; further, these feature matrices were applied to support vector machine classifiers to predict the DNA-binding proteins. All modules were trained and validated by the jackknife cross-validation test. Through comparing the performance of these substituted modules, the best result was obtained from pseudo-amino acid composition with the overall accuracy of 96.6% and the sensitivity of 90.7%. The results suggest that it can efficiently predict the novel DNA-binding proteins only using the primary sequences.
机译:DNA结合蛋白在基因调控中起关键作用。开发一种及时有效地鉴定新型DNA结合蛋白的自动化有效方法至关重要。在这项研究中,我们提出了一种仅基于蛋白质的主要序列来预测DNA结合蛋白的方法。 DNA结合蛋白分别通过自交协变,假氨基酸组成,二肽组成以及三种编码方法的不同组合进行编码。此外,将这些特征矩阵应用于支持向量机分类器,以预测DNA结合蛋白。所有模块均通过折刀交叉验证测试进行了培训和验证。通过比较这些取代模块的性能,从假氨基酸组合物中获得了最佳结果,总准确度为96.6%,灵敏度为90.7%。结果表明,仅使用一级序列,它就可以有效地预测新型DNA结合蛋白。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号