首页> 外文期刊>Nature Communications >A multi-task convolutional deep neural network for variant calling in single molecule sequencing
【24h】

A multi-task convolutional deep neural network for variant calling in single molecule sequencing

机译:用于单分子测序中变异调用的多任务卷积深度神经网络

获取原文
           

摘要

The accurate identification of DNA sequence variants is an important, but challenging task in genomics. It is particularly difficult for single molecule sequencing, which has a per-nucleotide error rate of ~5-15%. Meeting this demand, we developed Clairvoyante, a multi-task five-layer convolutional neural network model for predicting variant type (SNP or indel), zygosity, alternative allele and indel length from aligned reads. For the well-characterized NA12878 human sample, Clairvoyante achieves 99.67, 95.78, 90.53% F1-score on 1KP common variants, and 98.65, 92.57, 87.26% F1-score for whole-genome analysis, using Illumina, PacBio, and Oxford Nanopore data, respectively. Training on a second human sample shows Clairvoyante is sample agnostic and finds variants in less than 2?h on a standard server. Furthermore, we present 3,135 variants that are missed using Illumina but supported independently by both PacBio and Oxford Nanopore reads. Clairvoyante is available open-source ( https://github.com/aquaskyline/Clairvoyante ), with modules to train, utilize and visualize the model.
机译:DNA序列变异体的准确鉴定是基因组学中一项重要但具有挑战性的任务。单分子测序的难度非常大,因为每个核苷酸的错误率约为5-15%。为满足此需求,我们开发了Clairvoyante,这是一个多任务五层卷积神经网络模型,用于从对齐的读取中预测变异类型(SNP或indel),接合性,替代等位基因和indel长度。对于特征明确的NA12878人类样品,Clairvoyante在1KP常见变体上的F1-得分达到99.67、95.78、90.53%,使用Illumina,PacBio和Oxford Nanopore数据进行全基因组分析的F1得分达到98.65、92.57、87.26%。 , 分别。对第二个人类样本的训练表明,Clairvoyante具有样本不可知性,并且可以在不到2?h的时间内在标准服务器上找到变体。此外,我们提供了3,135个变体,这些变体使用Illumina遗漏了,但受到PacBio和Oxford Nanopore读物的独立支持。 Clairvoyante是开放源代码(https://github.com/aquaskyline/Clairvoyante),带有用于训练,利用和可视化模型的模块。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号