首页> 外文期刊>Advances in Bioinformatics >Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information
【24h】

Development of a Machine Learning Method to Predict Membrane Protein-Ligand Binding Residues Using Basic Sequence Information

机译:使用基本序列信息预测膜蛋白-配体结合残基的机器学习方法的开发

获取原文
           

摘要

Locating ligand binding sites and finding the functionally important residues from protein sequences as well as structures became one of the challenges in understanding their function. Hence a Naïve Bayes classifier has been trained to predict whether a given amino acid residue in membrane protein sequence is a ligand binding residue or not using only sequence based information. The input to the classifier consists of the features of the target residue and two sequence neighbors on each side of the target residue. The classifier is trained and evaluated on a nonredundant set of 42 sequences (chains with at least one transmembrane domain) from 31 alpha-helical membrane proteins. The classifier achieves an overall accuracy of 70.7% with 72.5% specificity and 61.1% sensitivity in identifying ligand binding residues from sequence. The classifier performs better when the sequence is encoded by psi-blast generated PSSM profiles. Assessment of the predictions in the context of three-dimensional structures of proteins reveals the effectiveness of this method in identifying ligand binding sites from sequence information. In 83.3% (35 out of 42) of the proteins, the classifier identifies the ligand binding sites by correctly recognizing more than half of the binding residues. This will be useful to protein engineers in exploiting potential residues for functional assessment.
机译:定位配体结合位点并从蛋白质序列和结构中找到功能上重要的残基成为理解其功能的挑战之一。因此,已经训练了朴素贝叶斯分类器以仅使用基于序列的信息来预测膜蛋白序列中的给定氨基酸残基是否为配体结合残基。分类器的输入包括目标残基的特征和目标残基每一侧的两个序列相邻物。在来自31个α螺旋膜蛋白的42个非冗余序列集(具有至少一个跨膜结构域的链)上训练和评估分类器。该分类器从序列中鉴定配体结合残基时,总体准确度达到70.7%,特异性为72.5%,灵敏度为61.1%。当序列由psi-blast生成的PSSM配置文件编码时,分类器的效果会更好。在蛋白质的三维结构的背景下对预测的评估揭示了该方法从序列信息中识别配体结合位点的有效性。在83.3%(42种蛋白中的35种)蛋白中,分类器通过正确识别一半以上的结合残基来识别配体结合位点。这将对蛋白质工程师利用潜在残基进行功能评估很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号