Space-efficient genome comparisons with compressed full-text indexes

机译：利用压缩的全文本索引进行节省空间的基因组比较

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Comparative genomics is the study of the relationship of genome structure and function across different biological species or strains. The starting point for any comparison of mammalian genomes is the computation of exact matches between their DNA sequences. It is well-known how to do this time-efficiently with full-text index structures like the suffix tree or the suffix array. The space consumption of these indexes is often the limiting factor in whole-genome comparative projects and other large-scale applications. Fortunately, in the last years research on compressed full-text index structures has flourished, and algorithms for the computation of common k-mers and maximal unique matches on compressed indexes have been proposed. However, for the important class of maximal exact matches such an algorithm has not been provided. In this paper, we present the first algorithm for the computation of maximal exact matches on a compressed full-text index.

机译：比较基因组学是研究不同生物物种或菌株之间基因组结构和功能之间关系的研究。进行任何哺乳动物基因组比较的起点是计算它们的DNA序列之间的精确匹配。众所周知，如何使用全文索引结构（例如后缀树或后缀数组）高效地执行此操作。这些指标的空间消耗通常是全基因组比较项目和其他大规模应用中的限制因素。幸运的是，近年来，对压缩全文本索引结构的研究蓬勃发展，并且提出了用于计算公共k-mers和最大唯一匹配项的算法。但是，对于重要的最大精确匹配类别，尚未提供这种算法。在本文中，我们提出了一种用于在压缩的全文本索引上计算最大精确匹配的第一种算法。

著录项

来源
《2nd international conference on bioinformatics and computational biology 2010》|2010年|P.19-24|共6页
会议地点 Honolulu HI(US);Honolulu HI(US)
作者
Enno Ohlebusch; Simon Gog;
展开▼
作者单位

Theoretical Computer Science University of Ulm D-89069 Ulm, Germany;

rnTheoretical Computer Science University of Ulm D-89069 Ulm, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类生物工程学（生物技术）;
关键词

相似文献

外文文献
中文文献
专利

1. Space-efficient construction of Lempel-Ziv compressed text indexes Diego Arroyuelo [J] . Gonzalo Navarro Information and computation . 2011,第7期

机译：Lempel-Ziv压缩文本索引的空间高效构造Diego Arroyuelo
2. Distribution-Aware Compressed Full-Text Indexes [J] . Paolo Ferragina, Jouni Siren, Rossano Venturini Algorithmica . 2013,第4期

机译：分发感知的压缩全文本索引
3. Compressed Full-Text Indexes [J] . GONZALO NAVARRO, VELI MAEKINEN ACM Computing Surveys . 2007,第1期

机译：压缩全文索引
4. Space-efficient genome comparisons with compressed full-text indexes [C] . International conference on bioinformatics and computational biology . 2010

机译：空间高效的基因组比较，具有压缩的全文索引
5. Comparison of multiple antibiotic resistant Staphylococcus aureus genomes and the genome structure of Elizabethkingia meningoseptica. [D] . Matyi, Stephanie Ann. 2014

机译：多种抗生素抗性金黄色葡萄球菌基因组和伊利沙伯菌脑膜败血病基因组结构的比较。
6. The effects of sampling on the efficiency and accuracy of k−mer indexes: Theoretical and empirical comparisons using the human genome [O] . Meznah Almutairy, Eric Torng 2011

机译：采样对k-mer索引的效率和准确性的影响：使用人类基因组的理论和经验比较
7. Space-Efficient Construction of Compressed Indexes in Deterministic Linear Time [O] . J. Ian Munro, Gonzalo Navarro, Yakov Nekrich 2017

机译：确定性线性时间中的空间高效构造压缩指标

Space-efficient genome comparisons with compressed full-text indexes

摘要

著录项

相似文献

相关主题

期刊订阅