首页> 外文期刊>PLoS Computational Biology >Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification
【24h】

Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification

机译:使用抗体 - 抗原结合界面培训基于图像的深神经网络进行抗体表位分类

获取原文
           

摘要

High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identification of therapeutic antibodies (Abs) or those associated with disease exposure and protection. Here, we describe our efforts to use artificial intelligence (AI)-based image-analyses for prospective classification of Abs based solely on sequence information. We hypothesized that Abs recognizing the same part of an antigen share a limited set of features at the binding interface, and that the binding site regions of these Abs share share common structure and physicochemical property patterns that can serve as a “fingerprint” to recognize uncharacterized Abs. We combined large-scale sequence-based protein-structure predictions to generate ensembles of 3-D Ab models, reduced the Ab binding interface to a 2-D image (fingerprint), used pre-trained convolutional neural networks to extract features, and trained deep neural networks (DNNs) to classify Abs. We evaluated this approach using Ab sequences derived from human HIV and Ebola viral infections to differentiate between two Abs, Abs belonging to specific B-cell family lineages, and Abs with different epitope preferences. In addition, we explored a different type of DNN method to detect one class of Abs from a larger pool of Abs. Testing on Ab sets that had been kept aside during model training, we achieved average prediction accuracies ranging from 71–96% depending on the complexity of the classification task. The high level of accuracies reached during these classification tests suggests that the DNN models were able to learn a series of structural patterns shared by Abs belonging to the same class. The developed methodology provides a means to apply AI-based image recognition techniques to analyze high-throughput B-cell sequencing datasets (repertoires) for Ab classification.
机译:高通量B细胞测序开辟了新的途径,用于研究我们适应性免疫反应的复杂机制。这些技术进步驱动数据生成和需要挖掘和分析这些大型数据集中包含的信息,特别是治疗抗体(ABS)或与疾病暴露和保护相关的信息。在这里,我们描述了使用人工智能(AI)的努力,基于序列信息基于ABS的预期分类。我们假设ABS识别抗原的相同部分在结合界面中共享有限的特征,并且这些ABS份额的结合位点区域享有常见的结构和物理化学性质图案,其可以用作“指纹”以识别不表达的“指纹” ABS。我们组合大规模的序列基蛋白质结构预测以产生3-D AB模型的集合,将AB绑定界面减少到2-D图像(指纹),使用预先训练的卷积神经网络来提取特征,并培训深度神经网络(DNN)分类ABS。我们使用源自人艾滋病毒和埃博拉病毒感染的AB序列评估了这种方法,以区分两个ABS,属于特异性B细胞家族谱系的ABS,以及具有不同表位偏好的ABS。此外,我们探讨了一种不同类型的DNN方法来检测一类从较大的ABS池中的ABS。在模型培训期间一直保留的AB集测试,我们取决于71-96%的平均预测精度,具体取决于分类任务的复杂性。这些分类测试期间达到的高水平的精度表明DNN模型能够学习由属于同一类的ABS共享的一系列结构模式。开发的方法提供了应用基于AI的图像识别技术的方法,以分析用于AB分类的高吞吐量B小区测序数据集(reptoIres)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号