...
首页> 外文期刊>BMC Bioinformatics >Semi-supervised learning for somatic variant calling and peptide identification in personalized cancer immunotherapy
【24h】

Semi-supervised learning for somatic variant calling and peptide identification in personalized cancer immunotherapy

机译:个性化癌症免疫疗法中躯体变异呼叫和肽鉴定的半监督学习

获取原文
           

摘要

Personalized cancer vaccines are emerging as one of the most promising approaches to immunotherapy of advanced cancers. However, only a small proportion of the neoepitopes generated by somatic DNA mutations in cancer cells lead to tumor rejection. Since it is impractical to experimentally assess all candidate neoepitopes prior to vaccination, developing accurate methods for predicting tumor-rejection mediating neoepitopes (TRMNs) is critical for enabling routine clinical use of cancer vaccines. In this paper we introduce Positive-unlabeled Learning using AuTOml (PLATO), a general semi-supervised approach to improving accuracy of model-based classifiers. PLATO generates a set of high confidence positive calls by applying a stringent filter to model-based predictions, then rescores remaining candidates by using positive-unlabeled learning. To achieve robust performance on clinical samples with large patient-to-patient variation, PLATO further integrates AutoML hyper-parameter tuning, classification threshold selection based on spies, and support for bootstrapping. Experimental results on real datasets demonstrate that PLATO has improved performance compared to model-based approaches for two key steps in TRMN prediction, namely somatic variant calling from exome sequencing data and peptide identification from MS/MS data.
机译:个性化的癌症疫苗是最有希望的晚期癌症的最有希望的方法之一。然而,只有一小部分癌细胞中的体细胞DNA突变产生的新阑尾导致肿瘤排斥。由于在疫苗接种前实验评估所有候选新患者是不切实际的,因此制定用于预测肿瘤排斥的准确方法,介导介导的肿瘤 - 抑制新脑(TRMN)对于实现癌症疫苗的常规临床使用至关重要。在本文中,我们使用Automl(柏拉图)介绍正面未标记的学习,这是一种提高基于模型的分类器的准确性的一般半监督方法。柏拉图通过将严格的过滤器应用于基于模型的预测来产生一组高置信度呼叫,然后通过使用正面未标记的学习来重新分配剩余的候选。为了在具有大型患者到患者变化的临床样本上实现鲁棒性能,柏拉图进一步集成了自动化超参数调谐,基于间谍的分类阈值选择,并支持自动启动。实验结果对实际数据集表明,与TRMN预测中的两个关键步骤中的模型的方法相比,柏拉图改善了性能,即来自Exome测序数据和来自MS / MS数据的肽识别的体制变体调用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号