首页> 外文期刊>BMC Genomics >Identifying micro-inversions using high-throughput sequencing reads
【24h】

Identifying micro-inversions using high-throughput sequencing reads

机译:使用高通量测序读数鉴定微转化

获取原文
           

摘要

The identification of inversions of DNA segments shorter than read length (e.g., 100?bp), defined as micro-inversions (MIs), remains challenging for next-generation sequencing reads. It is acknowledged that MIs are important genomic variation and may play roles in causing genetic disease. However, current alignment methods are generally insensitive to detect MIs. Here we develop a novel tool, MID (Micro-Inversion Detector), to identify MIs in human genomes using next-generation sequencing reads. The algorithm of MID is designed based on a dynamic programming path-finding approach. What makes MID different from other variant detection tools is that MID can handle small MIs and multiple breakpoints within an unmapped read. Moreover, MID improves reliability in low coverage data by integrating multiple samples. Our evaluation demonstrated that MID outperforms Gustaf, which can currently detect inversions from 30?bp to 500?bp. To our knowledge, MID is the first method that can efficiently and reliably identify MIs from unmapped short next-generation sequencing reads. MID is reliable on low coverage data, which is suitable for large-scale projects such as the 1000 Genomes Project (1KGP). MID identified previously unknown MIs from the 1KGP that overlap with genes and regulatory elements in the human genome. We also identified MIs in cancer cell lines from Cancer Cell Line Encyclopedia (CCLE). Therefore our tool is expected to be useful to improve the study of MIs as a type of genetic variant in the human genome. The source code can be downloaded from: http://cqb.pku.edu.cn/ZhuLab/MID .
机译:短于读取长度(例如100?bp)的DNA片段的倒位识别(定义为微倒位(MIs))对于下一代测序读取仍然具有挑战性。公认的是,MIs是重要的基因组变异,并可能在引起遗传疾病中起作用。然而,当前的对准方法通常对检测MI不敏感。在这里,我们开发了一种新型工具MID(微倒置检测器),可以使用下一代测序读取来识别人类基因组中的MI。 MID算法是基于动态编程路径查找方法设计的。 MID与其他变种检测工具的不同之处在于MID可以处理未映射的读取中的小型MI和多个断点。此外,MID通过集成多个样本提高了低覆盖率数据的可靠性。我们的评估表明,MID的性能优于Gustaf,后者目前可以检测到从30?bp到500?bp的反转。据我们所知,MID是能够从未映射的下一代短测序读段中高效,可靠地识别MI的第一种方法。 MID在低覆盖率数据上是可靠的,适用于大型项目,例如1000基因组计划(1KGP)。 MID从1KGP中识别出先前未知的MI,这些MI与人类基因组中的基因和调控元件重叠。我们还从癌细胞系百科全书(CCLE)中鉴定了癌细胞系中的MI。因此,我们的工具有望用于改善MIs的研究,MIs是人类基因组中的一种遗传变异。可以从以下网址下载源代码:http://cqb.pku.edu.cn/ZhuLab/MID。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号