...
首页> 外文期刊>Scientific reports. >PDNAsite: Identification of DNA-binding Site from Protein Sequence by Incorporating Spatial and Sequence Context
【24h】

PDNAsite: Identification of DNA-binding Site from Protein Sequence by Incorporating Spatial and Sequence Context

机译:pdnasite:通过在空间和序列背景下通过蛋白质序列鉴定DNA结合位点

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Protein-DNA interactions are involved in many fundamental biological processes essential for cellular function. Most of the existing computational approaches employed only the sequence context of the target residue for its prediction. In the present study, for each target residue, we applied both the spatial context and the sequence context to construct the feature space. Subsequently, Latent Semantic Analysis (LSA) was applied to remove the redundancies in the feature space. Finally, a predictor (PDNAsite) was developed through the integration of the support vector machines (SVM) classifier and ensemble learning. Results on the PDNA-62 and the PDNA-224 datasets demonstrate that features extracted from spatial context provide more information than those from sequence context and the combination of them gives more performance gain. An analysis of the number of binding sites in the spatial context of the target site indicates that the interactions between binding sites next to each other are important for protein-DNA recognition and their binding ability. The comparison between our proposed PDNAsite method and the existing methods indicate that PDNAsite outperforms most of the existing methods and is a useful tool for DNA-binding site identification. A web-server of our predictor (http://hlt.hitsz.edu.cn:8080/PDNAsite/) is made available for free public accessible to the biological research community.
机译:蛋白质-DNA相互作用涉及许多基本生物方法对细胞功能必不可少的。大多数现有的计算方法仅使用目标残留物的序列上下文进行其预测。在本研究中,对于每个目标渣数,我们应用了空间上下文和序列上下文来构造特征空间。随后,应用潜在语义分析(LSA)来删除特征空间中的冗余。最后,通过集成支持向量机(SVM)分类器和集合学习来开发预测器(PDNASITE)。 PDNA-62的结果和PDNA-224数据集表明,从空间上下文提取的特征提供比序列上下文的更多信息,并且它们的组合提供了更多的性能增益。靶位位点的空间背景下的结合位点数分析表明彼此相邻的结合位点之间的相互作用对于蛋白质-DNA识别和它们的结合能力是重要的。我们提出的PDNASITE方法与现有方法之间的比较表明PDNASITE优于大多数现有方法,并且是DNA结合位点鉴定的有用工具。我们的预测器的Web服务器(http://hlt.hitsz.edu.cn:8080/pdnasite/)可用于获得生物研究界的免费公众。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号