...
首页> 外文期刊>BMC Bioinformatics >Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management
【24h】

Analysis of amplicon-based NGS data from neurological disease gene panels: a new method for allele drop-out management

机译:来自神经疾病基因组的基于扩增子的NGS数据分析:等位基因脱落管理的新方法

获取原文

摘要

Background Amplicon-based targeted resequencing is a commonly adopted solution for next-generation sequencing applications focused on specific genomic regions. The reliability of such approaches rests on the high specificity and deep coverage, although sequencing artifacts attributable to PCR-like amplification can be encountered. Between these artifacts, allele drop-out, which is the preferential amplification of one allele, causes an artificial increase in homozygosity when heterozygous mutations fall on a primer pairing region. Here, a procedure to manage such artifacts, based on a pipeline composed of two steps of alignment and variant calling, is proposed. This methodology has been compared to the Illumina Custom Amplicon workflow, available on Illumina MiSeq, on the analysis of data obtained with four newly designed TruSeq Custom Amplicon gene panels. Results Four gene panels, specific for Parkinson disease, for Intracerebral Hemorrhage Diseases (COL4A1 and COL4A2 genes) and for Familial Hemiplegic Migraine (CACNA1A and ATP1A2 genes) were designed. A total of 119 samples were re-sequenced with Illumina MiSeq sequencer and panel characterization in terms of coverage, number of variants found and allele drop-out potential impact has been carried out. Results show that 14?% of identified variants is potentially affected by allele drop-out artifacts and that both the Custom Amplicon workflow and the procedure proposed here could correctly identify them. Furthermore, a more complex configuration in presence of two mutations was simulated in silico . In this configuration, our proposed methodology outperforms Custom Amplicon workflow, being able to correctly identify two mutations in all the studied configurations. Conclusions Allele drop-out plays a crucial role in amplicon-based targeted re-sequencing and specific procedures in data analysis of amplicon data should be adopted. Although a consensus has been established in the elimination of primer sequences from aligned data (e.g., via primer sequence trimming or soft clipping), more complex configurations need to be managed in order to increase the retrieved information from available data. Our method shows how to manage one of these complex configurations, when two mutations occur.
机译:背景技术基于扩增子的靶向重测序是针对特定基因组区域的下一代测序应用的常用解决方案。尽管可能会遇到归因于PCR样扩增的测序假象,但这种方法的可靠性取决于高特异性和深度覆盖。在这些伪像之间,等位基因缺失是一种等位基因的优先扩增,当杂合突变落在引物配对区域时,会导致人为纯合性的人工增加。在此,提出了一种基于由对齐和变体调用两个步骤组成的流水线来管理此类工件的过程。该方法已与Illumina MiSeq上提供的Illumina Custom Amplicon工作流程进行了比较,该工作流程是对使用四个新设计的TruSeq Custom Amplicon基因面板获得的数据进行分析的。结果设计了四个基因组,分别针对帕金森病,脑出血性疾病(COL4A1和COL4A2基因)和家族性偏瘫偏头痛(CACNA1A和ATP1A2基因)。使用Illumina MiSeq测序仪对总共119个样品进行了重新测序,并在覆盖率,发现的变体数量和等位基因剔除潜在影响方面进行了面板鉴定。结果表明,识别出的变异中有14%受等位基因缺失人工影响,并且Custom Amplicon工作流程和此处提出的程序都可以正确识别它们。此外,在计算机上模拟了存在两个突变的更复杂的构型。在这种配置中,我们提出的方法优于自定义扩增子工作流程,能够正确识别所有研究配置中的两个突变。结论等位基因缺失在基于扩增子的靶向重测序中起着至关重要的作用,在扩增子数据的数据分析中应采用特定的程序。尽管在从比对数据中消除引物序列方面已达成共识(例如,通过引物序列修整或软剪裁),但仍需要管理更复杂的配置以增加从可用数据中检索到的信息。我们的方法显示了当发生两个突变时如何管理这些复杂的配置之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号