首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >Using hidden Markov models to predict DNA-binding proteins with sequence and structure information
【24h】

Using hidden Markov models to predict DNA-binding proteins with sequence and structure information

机译:使用隐马尔可夫模型预测具有序列和结构信息的DNA结合蛋白

获取原文
获取原文并翻译 | 示例
           

摘要

In the post-genome period, the protein domain structures are published rapidly, but they have not been studied comprehensively. To figure out the cell function, the protein-DNA interactions decrypt the protein domain structures in recent research. Several machine-learning based methods are applied to the issue; however, they are not efficient to translate the tertiary structure characteristics of proteins into appropriate features for predicting the DNAbinding proteins. In this work, a novel machine-learning approach based on hidden Markov models identifies the characteristics of DNA-binding proteins with their amino acid sequences and tertiary structures.After we distill the features from DNA-binding proteins, a support vector machine based classifier predicts general DNA-binding proteins with the accuracy of 88.45 % through fivefolds cross-validation. Furthermore, we construct a response element specific classifier for predicting response element specific DNA-binding proteins, and the performance achieves the precision of 96.57% with recall rate as 88.83%in average. To verify the prediction ofDNA-binding proteins,we used theDNA-binding proteins from MCF-7 that are likely to bind with estrogen response elements (ERE), and the results show that our methods can apply to practice.
机译:在后基因组时期,蛋白质结构域结构被迅速公开,但是尚未对其进行全面研究。为了弄清细胞功能,蛋白质-DNA相互作用在最近的研究中解密了蛋白质结构域结构。有几种基于机器学习的方法适用于该问题。然而,它们不能有效地将蛋白质的三级结构特征转化为用于预测DNA结合蛋白的适当特征。在这项工作中,一种基于隐马尔可夫模型的新颖机器学习方法可识别具有氨基酸序列和三级结构的DNA结合蛋白的特征。从DNA结合蛋白中提取特征后,基于支持向量机的分类器可预测一般的DNA结合蛋白通过五倍交叉验证的准确性为88.45%。此外,我们构建了一个预测反应元件特异性DNA结合蛋白的反应元件特异性分类器,该性能达到了96.57%的精度,召回率平均为88.83%。为了验证对DNA结合蛋白的预测,我们使用了可能与雌激素反应元件(ERE)结合的MCF-7的DNA结合蛋白,结果表明我们的方法可用于实践。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号