首页> 外文期刊>Bioinformatics >Fast characterization of segmental duplications in genome assemblies
【24h】

Fast characterization of segmental duplications in genome assemblies

机译:基因组组件中节段重复的快速表征

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: Segmental duplications (SDs) or low-copy repeats, are segments of DNA & 1 Kbp with high sequence identity that are copied to other regions of the genome. SDs are among the most important sources of evolution, a common cause of genomic structural variation and several are associated with diseases of genomic origin including schizophrenia and autism. Despite their functional importance, SDs present one of the major hurdles for de novo genome assembly due to the ambiguity they cause in building and traversing both state-of-the-art overlap-layout-consensus and de Bruijn graphs. This causes SD regions to be misassembled, collapsed into a unique representation, or completely missing from assembled reference genomes for various organisms. In turn, this missing or incorrect information limits our ability to fully understand the evolution and the architecture of the genomes. Despite the essential need to accurately characterize SDs in assemblies, there has been only one tool that was developed for this purpose, called Whole-Genome Assembly Comparison (WGAC); its primary goal is SD detection. WGAC is comprised of several steps that employ different tools and custom scripts, which makes this strategy difficult and time consuming to use. Thus there is still a need for algorithms to characterize within-assembly SDs quickly, accurately, and in a user friendly manner.
机译:动机:节段性重复(SDS)或低拷贝重复,是DNA&amp的段; GT; 1 kBp具有高序列同一性,被复制到基因组的其他区域。 SDS是最重要的进化来源之一,基因组结构变异的常见原因和几种与基因组起源的疾病有关,包括精神分裂症和自闭症。尽管他们的功能性重要性,SDS由于他们在建造和遍历了最先进的重叠布局 - 共识和DE BRUIJN图表中,SDS为De Novo Genome组装的主要障碍之一。这使得SD区域被误解,折叠成独特的表示,或者完全缺少各种生物的组装参考基因组。反过来,这种缺失或不正确的信息限制了我们充分了解基因组的演变和建筑的能力。尽管必须准确地表征组件中的SDS,但只有一个为此目的开发的工具,称为全基因组装配比较(WGAC);其主要目标是SD检测。 WGAC由采用不同工具和自定义脚本的几个步骤组成,这使得该策略难以使用且耗时。因此,仍然需要算法以快速,准确,并以用户友好的方式在组装内SDS内部表征。

著录项

  • 来源
    《Bioinformatics》 |2018年第17期|共9页
  • 作者单位

    MIT Comp Sci &

    Artificial Intelligence Lab 77 Massachusetts Ave Cambridge MA 02139 USA;

    Bilkent Univ Dept Comp Engn TR-06800 Ankara Turkey;

    MIT Comp Sci &

    Artificial Intelligence Lab 77 Massachusetts Ave Cambridge MA 02139 USA;

    MIT Comp Sci &

    Artificial Intelligence Lab 77 Massachusetts Ave Cambridge MA 02139 USA;

    Bilkent Univ Dept Comp Engn TR-06800 Ankara Turkey;

    Vancouver Prostate Ctr Vancouver BC V6H 3Z9 Canada;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物工程学(生物技术);
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号