首页> 外文会议>Annual International Conference on Research in Computational Molecular Biology >Predicting Protein-Peptide Binding Affinity by Learning Peptide-Peptide Distance Functions
【24h】

Predicting Protein-Peptide Binding Affinity by Learning Peptide-Peptide Distance Functions

机译:通过学习肽 - 肽距离功能预测蛋白质肽结合亲和力

获取原文

摘要

Many important cellular response mechanisms are activated when a peptide binds to an appropriate receptor. In the immune system, the recognition of pathogen peptides begins when they bind to cell membrane Major Histocompatibility Complexes (MHCs). MHCproteins then carry these peptides to the cell surface in order to allow the activation of cytotoxic T-cells. The MHC binding cleft is highly polymorphic and therefore protein-peptide binding is highly specific. Developing computational methods for predicting protein-peptide binding is important for vaccine design and treatment of diseases like cancer. Previous learning approaches address the binding prediction problem using traditional margin based binary classifiers. In this paper we propose a novelapproach for predicting binding affinity. Our approach is based on learning a peptide-peptide distance function. Moreover, we learn a single peptide-peptide distance function over an entire family of proteins (e.g MHC class I). This distance function canbe used to compute the affinity of a novel peptide to any of the proteins in the given family. In order to learn these peptide-peptide distance functions, we formalize the problem as a semi-supervised learning problem with partial information in the form of equivalence constraints. Specifically we propose to use DistBoost [1,2], which is a semi-supervised distance learning algorithm. We compare our method to various state-of-the-art binding prediction algorithms on MHC class I and MHC class II datasets. In almost all cases, our method outperforms all of its competitors. One of the major advantages of our novel approach is that it can also learn an affinity function over proteins for which only small amounts of labeled peptides exist. In these cases, DistBoost's performance gain, when compared to other computational methods, is even more pronounced.
机译:当肽与合适的受体结合时,激活许多重要的细胞响应机制。在免疫系统中,当它们与细胞膜主要组织相容络合物(MHCs)结合时,识别病原体肽的识别开始。然后将这些肽携带到细胞表面以允许活化细胞毒性T细胞。 MHC结合裂隙是高多态性的,因此蛋白质肽结合是高度特异性的。开发用于预测蛋白肽结合的计算方法对于癌症的疫苗设计和治疗疾病是重要的。先前的学习方法使用基于传统的基于边界的二进制分类器来解决绑定预测问题。在本文中,我们提出了一种用于预测结合亲和力的NoveLaproach。我们的方法是基于学习肽肽距离功能。此外,我们在整个蛋白质家族(例如MHC I类)上学习单个肽肽距离功能。该距离功能可以用于将新肽的亲和力计算在给定家庭中的任何蛋白质中。为了学习这些肽肽距离函数,我们将问题正式化为具有等同约束形式的部分信息的半监督学习问题。具体地,我们建议使用DISTBOOST [1,2],这是一个半监督距离学习算法。我们将我们的方法与MHC I类和MHC II类数据集的各种最先进的绑定预测算法进行比较。在几乎所有情况下,我们的方法都优于其所有竞争对手。我们的新方法的主要优点之一是它还可以学习蛋白质的亲和功能,其仅存在少量标记的肽。在这些情况下,与其他计算方法相比,DistBoost的性能增益甚至更明显。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号