首页> 外文会议>International conference on intelligent computing >TNSim: A Tumor Sequencing Data Simulator for Incorporating Clonality Information
【24h】

TNSim: A Tumor Sequencing Data Simulator for Incorporating Clonality Information

机译:TNSim:用于整合克隆信息的肿瘤测序数据模拟器

获取原文

摘要

In recent years, the next generation sequencing enables us to obtain high resolution landscapes of the genetic changes at single-nucleotide level. More and more novel methods are proposed for efficient and effective analyses on cancer sequencing data. To facilitate such development, data simulator is a crucial tool, which not only tests and evaluates proposed approaches, but provides the feedbacks for further improvements as well. Several simulators are released to generate the next generation sequencing data. However, based on our best knowledge, none of them considers clonality information. It is suggested that clonal heterogeneity does widely exist in tumor samples. The patterns of somatic mutational events usually expose a wide spectrum of variant allelic frequencies, while some of them are only detectable in one or multiple clonal lineages. In this article, we introduce a Tumor-Normal sequencing Simulator, TNSim, to generate the next generation sequencing data by involving clonality information. The simulator is able to mimic a tumor sample and the paired normal sample, where the germline variants and somatic mutations can be settled respectively. Tumor purity is adjustable. Clonal architecture is preas-signed as one or more clonal lineages, where each lineage consists of a set of somatic mutations whose variant allelic frequencies are similar. A group of experiments are conducted to evaluate its performance. The statistical features of the artificial sequencing reads are comparable to the real tumor sequencing data whose sample consists of multiple sub-clones.
机译:近年来,下一代测序使我们能够在单核苷酸水平获得高分辨率的遗传变化图。提出了越来越多的新颖方法来对癌症测序数据进行有效和有效的分析。为了促进此类开发,数据模拟器是至关重要的工具,它不仅可以测试和评估所提出的方法,而且还可以提供反馈以进一步改进。发布了一些模拟器以生成下一代测序数据。但是,根据我们的最佳知识,他们都没有考虑克隆性信息。提示克隆异质性确实广泛存在于肿瘤样品中。体细胞突变事件的模式通常会暴露出广泛的变体等位基因频率,而其中一些仅可在一个或多个克隆谱系中检测到。在本文中,我们介绍了一种肿瘤正常测序模拟器TNSim,它通过涉及克隆性信息来生成下一代测序数据。该模拟器能够模拟肿瘤样本和配对的正常样本,在此处可以分别确定种系变体和体细胞突变。肿瘤纯度是可调的。克隆结构被预先指定为一个或多个克隆谱系,其中每个谱系由一组体细胞突变组成,其突变等位基因频率相似。进行了一组实验以评估其性能。人工测序读数的统计特征与真实肿瘤测序数据相当,后者的样品由多个亚克隆组成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号