首页> 美国卫生研究院文献>Bioinformatics >Fast characterization of segmental duplications in genome assemblies
【2h】

Fast characterization of segmental duplications in genome assemblies

机译:快速表征基因组装配中的节段重复

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

MotivationSegmental duplications (SDs) or low-copy repeats, are segments of DNA > 1 Kbp with high sequence identity that are copied to other regions of the genome. SDs are among the most important sources of evolution, a common cause of genomic structural variation and several are associated with diseases of genomic origin including schizophrenia and autism. Despite their functional importance, SDs present one of the major hurdles for de novo genome assembly due to the ambiguity they cause in building and traversing both state-of-the-art overlap-layout-consensus and de Bruijn graphs. This causes SD regions to be misassembled, collapsed into a unique representation, or completely missing from assembled reference genomes for various organisms. In turn, this missing or incorrect information limits our ability to fully understand the evolution and the architecture of the genomes. Despite the essential need to accurately characterize SDs in assemblies, there has been only one tool that was developed for this purpose, called Whole-Genome Assembly Comparison (WGAC); its primary goal is SD detection. WGAC is comprised of several steps that employ different tools and custom scripts, which makes this strategy difficult and time consuming to use. Thus there is still a need for algorithms to characterize within-assembly SDs quickly, accurately, and in a user friendly manner.
机译:动机分段重复(SD)或低拷贝重复序列是具有高序列同一性的DNA 1 Kbp片段,被复制到基因组的其他区域。 SD是最重要的进化来源之一,是基因组结构变异的常见原因,并且几种与包括精神分裂症和自闭症在内的基因组疾病有关。尽管SD具有功能上的重要性,但由于它们在构建和遍历最新的重叠布局共识图和de Bruijn图时造成歧义,因此它们成为从头进行基因组组装的主要障碍之一。这导致SD区域被错误组装,折叠成独特的表示形式,或者从各种生物体的组装参考基因组中完全缺失。反过来,这些丢失或不正确的信息限制了我们充分了解基因组进化和结构的能力。尽管有必要对装配中的SD进行准确表征,但只有一个为此目的而开发的工具称为全基因组装配比较(WGAC);它的主要目标是SD检测。 WGAC由使用不同工具和自定义脚本的几个步骤组成,这使得该策略难以使用且耗时。因此,仍然需要用于以快速,准确且以用户友好的方式表征组装内SD的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号