首页> 外文会议>Algorithms in Bioinformatics >Detecting Repeat Families in Incompletely Sequenced Genomes
【24h】

Detecting Repeat Families in Incompletely Sequenced Genomes

机译:在不完全测序的基因组中检测重复家族

获取原文
获取原文并翻译 | 示例

摘要

Repeats form a major class of sequence in genomes with implications for functional genomics and practical problems. Their detection and analysis pose a number of challenges in genomic sequence analysis, especially if the genome is not completely sequenced. The most abundant and evolutionary active forms of repeats are found in the form of families of long similar sequences. We present a novel method for repeat family detection and characterization in cases where the target genome sequence is not completely known. Therefore we first establish the sequence graph, a compacted version of sparse de Bruijn graphs. Using appropriate analysis of the structure of this graph and its connected components after local modifications, we are able to devise two algorithms for repeat family detection. The applicability of the methods is shown for both simulated and real genomic data sets.
机译:重复序列是基因组中一类主要的序列,对功能基因组学和实际问题具有影响。它们的检测和分析在基因组序列分析中提出了许多挑战,尤其是在基因组未完全测序的情况下。重复序列中最丰富,进化最活跃的形式是长相似序列家族。在目标基因组序列不完全清楚的情况下,我们提出了一种新的重复家族检测和表征方法。因此,我们首先建立序列图,这是稀疏de Bruijn图的压缩形式。在局部修改后,通过对此图的结构及其连接的组件进行适当的分析,我们能够设计出两种用于重复族检测的算法。示出了该方法对于模拟和实际基因组数据集的适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号