Efficient techniques for k-mer counting

机译：高效的k-mer计数技术

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A large number of bioinformatics applications require counting of k-length substrings in genetically important long strings. K-mer counting generates the frequencies of each k-length substring in genome sequences. Genome assembly, repeat detection, multiple sequence alignment, error detection, and many other related applications use k-mer counting as a building block. Many approaches are already available to address the problem. Some of them are time efficient, and some of them are memory efficient. Most of the current solutions use multi-threading to utilize available cores of a machine. A few efficient disk-based algorithms have been devised to reduce required memory. We analyze all available algorithms, and time and memory requirements of those implementations. We improve time consumption by devising a novel algorithm to this problem. Our results show that this new algorithm outperforms previous best-known algorithms.

机译：大量的生物信息学应用需要对具有重要遗传意义的长字符串中的k长度子字符串进行计数。 K聚体计数产生基因组序列中每个k长度子串的频率。基因组组装，重复检测，多序列比对，错误检测以及许多其他相关应用程序都使用k-mer计数作为构建模块。已经有许多方法可以解决该问题。其中一些是省时的，而某些则是内存的。当前大多数解决方案都使用多线程来利用计算机的可用内核。已经设计了一些基于磁盘的有效算法来减少所需的内存。我们分析了所有可用算法以及这些实现的时间和内存要求。通过针对此问题设计新颖的算法，我们提高了时间消耗。我们的结果表明，该新算法优于以前的最著名算法。

著录项

来源
《IEEE International Conference on Computational Advances in Bio and Medical Sciences》|2015年|1-1|共1页
会议地点
作者
Mamun Abdullah-Al; Pal Soumitra; Rajasekaran Sanguthevar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. MQF and buffered MQF: quotient filters for efficient storage of k-mers with their counts and metadata [J] . Moustafa Shokrof, C. Titus Brown, Tamer A. Mansour BMC Bioinformatics . 2021,第1期

机译：MQF和Buffered MQF：商用过滤器，用于高效存储K-MERS及其计数和元数据
2. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers [J] . Carl Kingsford Bioinformatics . 2011,第6期

机译：快速，无锁的方法，可有效地并行计算k-mers的出现
3. Hierarchical Clustering of DNA k-mer Counts in RNAseq Fastq Files Identifies Sample Heterogeneities [J] . Wolfgang Kaisers?, Holger Schwender, Heiner Schaal? International Journal of Molecular Sciences . 2018,第11期

机译：RNAseq Fastq文件中DNA k-mer计数的分层聚类可确定样品异质性
4. Efficient techniques for k-mer counting [C] . Mamun Abdullah-Al, Pal Soumitra, Rajasekaran Sanguthevar IEEE International Conference on Computational Advances in Bio and Medical Sciences . 2015

机译：K-MER计数的高效技术
5. Sparse and Low-Rank Techniques for the Efficient Restoration of Images =Sparse and Low-Rank Techniques for the Efficient Restoration of Images [D] . Zhang, Mingli. 2017

机译：高效的图像稀疏和低秩技术=高效的图像稀疏和低秩技术
6. These Are Not the K-mers You Are Looking For: Efficient Online K-mer Counting Using a Probabilistic Data Structure [O] . Qingpeng Zhang, Jason Pell, Rosangela Canino-Koning, -1

机译：这些不是您要找的K-mer：使用概率数据结构的高效在线K-mer计数
7. These are not the k-mers you are looking for: efficient online k-mer counting using a probabilistic data structure. [O] . Qingpeng Zhang, Jason Pell, Rosangela Canino-Koning, 2014

机译：这些不是您正在寻找的k-mers：使用概率数据结构进行有效的在线k-mer计数。

Efficient techniques for k-mer counting

摘要

著录项

相似文献

相关主题

期刊订阅