首页> 外文会议>International conference on bioinformatics and computational biology >Space-efficient genome comparisons with compressed full-text indexes
【24h】

Space-efficient genome comparisons with compressed full-text indexes

机译:空间高效的基因组比较,具有压缩的全文索引

获取原文

摘要

Comparative genomics is the study of the relationship of genome structure and function across different biological species or strains. The starting point for any comparison of mammalian genomes is the computation of exact matches between their DNA sequences. It is well-known how to do this time-efficiently with full-text index structures like the suffix tree or the suffix array. The space consumption of these indexes is often the limiting factor in whole-genome comparative projects and other large-scale applications. Fortunately, in the last years research on compressed full-text index structures has flourished, and algorithms for the computation of common k-mers and maximal unique matches on compressed indexes have been proposed. However, for the important class of maximal exact matches such an algorithm has not been provided. In this paper, we present the first algorithm for the computation of maximal exact matches on a compressed full-text index.
机译:对比基因组学是研究基因组结构与不同生物物种或菌株的功能的关系。哺乳动物基因组的任何比较的起点是计算其DNA序列之间的精确匹配。众所周知,如何与后缀树或后缀数组中的全文索引结构有效地进行此项时间。这些指标的空间消耗通常是全基因组比较项目和其他大规模应用的限制因素。幸运的是,在过去几年中,对压缩的全文索引结构进行了繁荣,已经提出了计算普通K-MERS和压缩索引上的最大独特匹配的算法。但是,对于重要的最大精确匹配,尚未提供这种算法。在本文中,我们介绍了第一种计算压缩全文索引上的最大精确匹配的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号