首页> 外文期刊>Genome Biology >Comprehensive evaluation of structural variation detection algorithms for whole genome sequencing
【24h】

Comprehensive evaluation of structural variation detection algorithms for whole genome sequencing

机译:全基因组测序的结构变形检测算法综合评价

获取原文
获取外文期刊封面目录资料

摘要

Abstract BackgroundStructural variations (SVs) or copy number variations (CNVs) greatly impact the functions of the genes encoded in the genome and are responsible for diverse human diseases. Although a number of existing SV detection algorithms can detect many types of SVs using whole genome sequencing (WGS) data, no single algorithm can call every type of SVs with high precision and high recall.ResultsWe comprehensively evaluate the performance of 69 existing SV detection algorithms using multiple simulated and real WGS datasets. The results highlight a subset of algorithms that accurately call SVs depending on specific types and size ranges of the SVs and that accurately determine breakpoints, sizes, and genotypes of the SVs. We enumerate potential good algorithms for each SV category, among which GRIDSS, Lumpy, SVseq2, SoftSV, Manta, and Wham are better algorithms in deletion or duplication categories. To improve the accuracy of SV calling, we systematically evaluate the accuracy of overlapping calls between possible combinations of algorithms for every type and size range of SVs. The results demonstrate that both the precision and recall for overlapping calls vary depending on the combinations of specific algorithms rather than the combinations of methods used in the algorithms.ConclusionThese results suggest that careful selection of the algorithms for each type and size range of SVs is required for accurate calling of SVs. The selection of specific pairs of algorithms for overlapping calls promises to effectively improve the SV detection accuracy.
机译:摘要背景结构变异(SV)或拷贝数变异(CNV)大大影响了基因组中编码的基因的功能,并负责不同的人类疾病。虽然许多现有的SV检测算法可以使用全基因组测序(WGS)数据检测许多类型的SVS,但没有单次算法可以用高精度和高召回的每种类型的SV呼叫。详细评估69个现有的SV检测算法的性能。使用多个模拟和真正的WGS数据集。结果突出显示了一种算法的子集,可根据SVS的特定类型和大小范围准确地调用SV,并准确地确定SV的断点,大小和基因型。我们为每个SV类别列举潜在的良好算法,其中网格,Lumpy,SVSeq2,SoftSv,Manta和Wham是删除或重复类别中的更好的算法。为了提高SV呼叫的准确性,我们系统地评估了SV的各种类型和大小范围的可能组合之间的重叠呼叫的准确性。结果表明,重叠呼叫的精度和回忆都根据特定算法的组合而变化,而不是算法中使用的方法的组合。结论这些结果表明需要仔细选择每种类型和大小范围的SVS的算法准确呼叫SVS。针对重叠呼叫的特定算法对的选择有效地提高了SV检测精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号