首页> 外文期刊>Bioinformatics >Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information
【24h】

Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information

机译:基于组成,序列和结构信息的DNA结合蛋白及其结合残基的分析和预测

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Though vitally important to cell function, the mechanism of protein–DNA binding has not yet been completely understood. We therefore analysed the relationship between DNA binding and protein sequence composition, solvent accessibility and secondary structure. Using non-redundant databases of transcription factors and protein–DNA complexes, neural network models were developed to utilize the information present in this relationship to predict DNA-binding proteins and their binding residues. Results: Sequence composition was found to provide sufficient information to predict the probability of its binding to DNA with nearly 69% sensitivity at 64% accuracy for the considered proteins; sequence neighbourhood and solvent accessibility information were sufficient to make binding site predictions with 40% sensitivity at 79% accuracy. Detailed analysis of binding residues shows that some three- and five-residue segments frequently bind to DNA and that solvent accessibility plays a major role in binding. Although, binding behaviour was not associated with any particular secondary structure, there were interesting exceptions at the residue level. Over-representation of some residues in the binding sites was largely lost at the total sequence level, but a different kind of compositional preference was observed in DNA-binding proteins.
机译:动机:尽管对细胞功能至关重要,但蛋白质与DNA结合的机制尚未完全被理解。因此,我们分析了DNA结合与蛋白质序列组成,溶剂可及性和二级结构之间的关系。利用转录因子和蛋白质-DNA复合物的非冗余数据库,开发了神经网络模型,以利用这种关系中存在的信息来预测DNA结合蛋白及其结合残基。结果:发现序列组成可以提供足够的信息来预测其与DNA结合的可能性,对于所考虑的蛋白质,其敏感性接近69%,准确性为64%;序列邻域和溶剂可及性信息足以以79%的准确度以40%的灵敏度进行结合位点预测。对结合残基的详细分析表明,某些3和5个残基片段经常与DNA结合,溶剂可及性在结合中起主要作用。尽管结合行为与任何特定的二级结构均不相关,但在残基水平上还是有一些有趣的例外。在总序列水平上,结合位点上某些残基的过度表达在很大程度上消失了,但是在DNA结合蛋白中观察到了不同种类的组成偏好。

著录项

  • 来源
    《Bioinformatics》 |2004年第4期|p. 477-486|共10页
  • 作者单位

    Department of Biochemical Science and Engineering, Kyushu Institute of Technology, Fukuoka, Iizuka 820 8502, Japan;

    Department of Biosciences, Jamia Mila Islamia, New Delhi 110025, India;

    Department of Biochemical Science and Engineering, Kyushu Institute of Technology, Fukuoka, Iizuka 820 8502, Japan;

  • 收录信息 美国《科学引文索引》(SCI);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物科学;生物工程学(生物技术);
  • 关键词

  • 入库时间 2022-08-17 23:50:16

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号