首页> 外文会议>IEEE International Conference on Healthcare Informatics >Kimimila: A New Model to Classify NGS Short Reads by Their Allele Origin
【24h】

Kimimila: A New Model to Classify NGS Short Reads by Their Allele Origin

机译:Kimimila:一种根据其等位基因来源对NGS短读进行分类的新模型

获取原文

摘要

Next generation sequencing (NGS) technologies, often referred to as massively parallel sequencing, are having a huge impact on genomics and clinical applications. These technologies generate billions of short sequences (reads) that are consequently mapped to their corresponding reference genome to find out known and/or novel genomic variants potentially correlated to patients phenotype. DNA fragment library is usually derived from a diploid genome: we refer to genotyping on NGS data as the analytical process to assign the zygosity of identified variants. Current algorithms typically rely on data of the single genomic locus where variants have been called and are based on the condition of independence between variant locus and reads. These strong assumptions might bring to possible inaccuracies throughout the genotyping process. We have therefore developed an efficient assumption-free algorithm based on a kinetic model approach and distance geometry (Kimimila) that delivers the belonging allele for each read using the inference provided by the measure of differences (i.e. Variants) among overlapping reads.
机译:下一代测序(NGS)技术(通常称为大规模并行测序)对基因组学和临床应用产生巨大影响。这些技术产生数十亿个短序列(读段),这些短序列随后被映射到其相应的参考基因组,以找出可能与患者表型相关的已知和/或新型基因组变异。 DNA片段文库通常来自二倍体基因组:我们将NGS数据的基因分型称为分析过程,以分配已鉴定变体的接合性。当前的算法通常依赖于已调用变体的单个基因组基因座的数据,并且基于变体基因座和读取之间的独立性条件。这些强有力的假设可能会在整个基因分型过程中带来可能的误差。因此,我们开发了一种基于动力学模型方法和距离几何(Kimimila)的高效无假设算法,该算法使用重叠读数之间差异(即变体)的量度提供的推论,为每个读数提供所属等位基因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号