首页> 外文期刊>Nucleic Acids Research >The missing indels: an estimate of indel variation in a human genome and analysis of factors that impede detection
【24h】

The missing indels: an estimate of indel variation in a human genome and analysis of factors that impede detection

机译:缺失的插入缺失:估计人类基因组中插入缺失变异并分析阻碍检测的因素

获取原文
获取原文并翻译 | 示例
           

摘要

With the development of High-Throughput Sequencing (HTS) thousands of human genomes have now been sequenced. Whenever different studies analyze the same genome they usually agree on the amount of single-nucleotide polymorphisms, but differ dramatically on the number of insertion and deletion variants (indels). Furthermore, there is evidence that indels are often severely under-reported. In this manuscript we derive the total number of indel variants in a human genome by combining data from different sequencing technologies, while assessing the indel detection accuracy. Our estimate of approximately 1 million indels in a Yoruban genome is much higher than the results reported in several recent HTS studies. We identify two key sources of difficulties in indel detection: the insufficient coverage, read length or alignment quality; and the presence of repeats, including short interspersed elements and homopolymers/dimers. We quantify the effect of these factors on indel detection. The quality of sequencing data plays a major role in improving indel detection by HTS methods. However, many indels exist in long homopolymers and repeats, where their detection is severely impeded. The true number of indel events is likely even higher than our current estimates, and new techniques and technologies will be required to detect them.
机译:随着高通量测序(HTS)的发展,现已对成千上万的人类基因组进行了测序。每当不同的研究分析相同的基因组时,它们通常在单核苷酸多态性的数量上是一致的,但是在插入和缺失变体(indels)的数量上却有很大差异。此外,有证据表明,插入缺失经常被严重低估。在这份手稿中,我们结合了来自不同测序技术的数据,同时评估了插入缺失检测的准确性,从而得出了人类基因组中插入缺失变异体的总数。我们估计Yoruban基因组中约有1百万个插入缺失,这比最近几项HTS研究报告的结果要高得多。我们确定indel检测困难的两个关键原因:覆盖范围不足,读取长度或对齐质量;重复的存在,包括短的散布元素和均聚物/二聚体。我们量化这些因素对插入缺失检测的影响。测序数据的质量在改善HTS方法对indel的检测中起着重要作用。但是,许多indel存在于长均聚物和重复序列中,严重阻碍了其检测。插入缺失事件的真实数量可能甚至比我们目前的估计还要高,并且需要新的技术来检测它们。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号