...
首页> 外文期刊>BioSystems >Design of accurate predictors for DNA-binding sites in proteins using hybrid SVM–PSSM method
【24h】

Design of accurate predictors for DNA-binding sites in proteins using hybrid SVM–PSSM method

机译:使用混合SVM-PSSM方法设计蛋白质中DNA结合位点的精确预测子

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we investigate the design of accurate predictors for DNA-binding sites in proteins from amino acid sequences. As a result, we propose a hybrid method using support vector machine (SVM) in conjunction with evolutionary information of amino acid sequences in terms of their position-specific scoring matrices (PSSMs) for prediction of DNA-binding sites. Considering the numbers of binding and non-binding residues in proteins are significantly unequal, two additional weights as well as SVM parameters are analyzed and adopted to maximize net prediction (NP, an average of sensitivity and specificity) accuracy. To evaluate the generalization ability of the proposed method SVM–PSSM, a DNA-binding dataset PDC-59 consisting of 59 protein chains with low sequence identity on each other is additionally established. The SVM-based method using the same six-fold cross-validation procedure and PSSM features has NP = 80.15% for the training dataset PDNA-62 and NP = 69.54% for the test dataset PDC-59, which are much better than the existing neural network-based method by increasing the NP values for training and test accuracies up to 13.45% and 16.53%, respectively. Simulation results reveal that SVM–PSSM performs well in predicting DNA-binding sites of novel proteins from amino acid sequences.
机译:在本文中,我们研究了氨基酸序列中蛋白质中DNA结合位点的准确预测子的设计。结果,我们提出了一种混合使用支持向量机(SVM)的方法,结合氨基酸序列的进化信息来预测DNA结合位点的位置特异性得分矩阵(PSSM)。考虑到蛋白质中结合残基和非结合残基的数量明显不相等,因此分析并采用了两个附加权重以及SVM参数,以最大程度地提高净预测(NP,即灵敏度和特异性的平均值)的准确性。为了评估提出的方法SVM–PSSM的泛化能力,还建立了一个DNA结合数据集PDC-59,该数据集由59个彼此之间具有较低序列同一性的蛋白质链组成。使用相同六重交叉验证程序和PSSM功能的基于SVM的方法,训练数据集PDNA-62的NP = 80.15%,测试数据集PDC-59的NP = 69.54%,这比现有数据集要好得多基于神经网络的方法,将训练和测试准确度的NP值分别提高到13.45%和16.53%。仿真结果表明,SVM–PSSM在从氨基酸序列预测新蛋白质的DNA结合位点方面表现良好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号