首页> 外文会议>2nd international conference on bioinformatics and computational biology 2010 >Space-efficient genome comparisons with compressed full-text indexes
【24h】

Space-efficient genome comparisons with compressed full-text indexes

机译:利用压缩的全文本索引进行节省空间的基因组比较

获取原文
获取原文并翻译 | 示例

摘要

Comparative genomics is the study of the relationship of genome structure and function across different biological species or strains. The starting point for any comparison of mammalian genomes is the computation of exact matches between their DNA sequences. It is well-known how to do this time-efficiently with full-text index structures like the suffix tree or the suffix array. The space consumption of these indexes is often the limiting factor in whole-genome comparative projects and other large-scale applications. Fortunately, in the last years research on compressed full-text index structures has flourished, and algorithms for the computation of common k-mers and maximal unique matches on compressed indexes have been proposed. However, for the important class of maximal exact matches such an algorithm has not been provided. In this paper, we present the first algorithm for the computation of maximal exact matches on a compressed full-text index.
机译:比较基因组学是研究不同生物物种或菌株之间基因组结构和功能之间关系的研究。进行任何哺乳动物基因组比较的起点是计算它们的DNA序列之间的精确匹配。众所周知,如何使用全文索引结构(例如后缀树或后缀数组)高效地执行此操作。这些指标的空间消耗通常是全基因组比较项目和其他大规模应用中的限制因素。幸运的是,近年来,对压缩全文本索引结构的研究蓬勃发展,并且提出了用于计算公共k-mers和最大唯一匹配项的算法。但是,对于重要的最大精确匹配类别,尚未提供这种算法。在本文中,我们提出了一种用于在压缩的全文本索引上计算最大精确匹配的第一种算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号