...
首页> 外文期刊>BMC Bioinformatics >DeepSV: accurate calling of genomic deletions from high-throughput sequencing data using deep convolutional neural network
【24h】

DeepSV: accurate calling of genomic deletions from high-throughput sequencing data using deep convolutional neural network

机译:Deepsv:使用深卷积神经网络准确地呼叫基因组缺失的基因组缺失

获取原文

摘要

BACKGROUND:Calling genetic variations from sequence reads is an important problem in genomics. There are many existing methods for calling various types of variations. Recently, Google developed a method for calling single nucleotide polymorphisms (SNPs) based on deep learning. Their method visualizes sequence reads in the forms of images. These images are then used to train a deep neural network model, which is used to call SNPs. This raises a research question: can deep learning be used to call more complex genetic variations such as structural variations (SVs) from sequence data?RESULTS:In this paper, we extend this high-level approach to the problem of calling structural variations. We present DeepSV, an approach based on deep learning for calling long deletions from sequence reads. DeepSV is based on a novel method of visualizing sequence reads. The visualization is designed to capture multiple sources of information in the sequence data that are relevant to long deletions. DeepSV also implements techniques for working with noisy training data. DeepSV trains a model from the visualized sequence reads and calls deletions based on this model. We demonstrate that DeepSV outperforms existing methods in terms of accuracy and efficiency of deletion calling on the data from the 1000 Genomes Project.CONCLUSIONS:Our work shows that deep learning can potentially lead to effective calling of different types of genetic variations that are complex than SNPs.
机译:背景:调用序列读取的遗传变化是基因组学中的重要问题。有许多用于调用各种类型的变体的方法。最近,谷歌开发了一种基于深度学习呼叫单一核苷酸多态性(SNP)的方法。它们的方法可视化序列以图像的形式读取。然后使用这些图像来训练一个深神经网络模型,用于调用SNP。这提出了一个研究问题:可以使用深度学习来调用更复杂的遗传变化,例如结构变化(SVS)序列数据?结果:在本文中,我们将这种高级方法扩展到呼叫结构变化的问题。我们呈现Deepsv,一种基于深度学习的方法,用于呼叫序列读取的长删除。 Deepsv基于一种可视化序列读取的新方法。可视化旨在捕获与长删除相关的序列数据中的多个信息源。 Deepsv还实现了使用嘈杂培训数据的技术。 Deepsv从可视化序列读取型号读取和呼叫删除基于此模型。我们证明Deepsv在呼吁从1000个基因组项目的数据的准确性和效率方面表明了现有的方法。结论:我们的工作表明,深度学习可能导致有效地呼吁不同类型的遗传变异,这些变化是复杂的比SNP复杂的不同类型的遗传变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号