首页> 外文期刊>BMC Genomics >Evaluation of tools for identifying large copy number variations from ultra-low-coverage whole-genome sequencing data
【24h】

Evaluation of tools for identifying large copy number variations from ultra-low-coverage whole-genome sequencing data

机译:评估用于识别超低覆盖全基因组测序数据的大拷贝数变异的工具

获取原文
获取外文期刊封面目录资料

摘要

Detection of copy number variations (CNVs) from high-throughput next-generation whole-genome sequencing (WGS) data has become a widely used research method during the recent years. However, only a little is known about the applicability of the developed algorithms to ultra-low-coverage (0.0005–0.8×) data that is used in various research and clinical applications, such as digital karyotyping and single-cell CNV detection. Here, the performance of six popular read-depth based CNV detection algorithms (BIC-seq2, Canvas, CNVnator, FREEC, HMMcopy, and QDNAseq) was studied using ultra-low-coverage WGS data. Real-world array- and karyotyping kit-based validation were used as a benchmark in the evaluation. Additionally, ultra-low-coverage WGS data was simulated to investigate the ability of the algorithms to identify CNVs in the sex chromosomes and the theoretical minimum coverage at which these tools can accurately function. Our results suggest that while all the methods were able to detect large CNVs, many methods were susceptible to producing false positives when smaller CNVs ( 3?h) compared with FREEC (~?3?min), which we considered the second-best method. Our comparative analysis demonstrates that CNV detection from ultra-low-coverage WGS data can be a highly accurate method for the detection of large copy number variations when their length is in millions of base pairs. These findings facilitate applications that utilize ultra-low-coverage CNV detection.
机译:从高通量下一代全基因组测序(WGS)数据中检测拷贝数变异(CNV)已成为近年来广泛使用的研究方法。然而,只有稍微知道发达的算法的适用性(0.0005-0.8×)数据,用于各种研究和临床应用的数据,例如数字核型和单细胞CNV检测。这里,使用超低覆盖WGS数据研究了六种流行读取深度基于CNV检测算法(BIC-SEQ2,CANVAS,CNVNATOR,FREC,HMMCOPY和QDNASEQ)的性能。基于现实世界的阵列和核型套件基于基于基准的验证被用作评估中的基准。另外,模拟超低覆盖WGS数据以研究算法识别性染色体中CNV的能力以及这些工具可以准确地发挥作用的理论最小覆盖。我们的研究结果表明,虽然所有方法都能够检测到大型CNV,但是当与Freec(〜3?min)相比,当较小的CNV(3Ωh)时,许多方法易于产生假阳性,我们认为是第二种方法。我们的比较分析表明,来自超低覆盖WGS数据的CNV检测可以是在数百万基对中的大量副拷贝数变化的高度准确的方法中。这些发现有助于利用超低覆盖CNV检测的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号