首页> 外文会议>International Conference on Bioinformatics Models, Methods and Algorithms >De Novo Short Read Assembly Algorithm with Low Memory Usage
【24h】

De Novo Short Read Assembly Algorithm with Low Memory Usage

机译:低内存用法的De Novo短读装配算法

获取原文

摘要

Determining whole genome sequences of various species has many applications not only in biological system, but also in medicine, pharmacy and agriculture. In recent years, the emergence of high-throughput next generation sequencing technologies has dramatically reduced time and costs for whole genome sequencing. These new technologies provide ultrahigh throughput with lower unit data cost. However, the data are very short length fragments of DNA. Thus, developing algorithms for merging these fragments is very important. Merging these fragments without reference data is called de novo assembly. Many algorithms for de novo assembly have been proposed in recent years. Velvet, one of the algorithms, is famous because it has good performance in terms of memory and time consumption. But memory consumption increases dramatically when the size of input fragments is huge. Therefore, it is necessary to develop algorithm with low memory usage. In this paper, we propose an algorithm for de novo assembly with lower memory. In our experiments using E.coli K-12 strain MG 1655, memory consumption of the proposed algorithm was one-third of that of Velvet.
机译:确定各种种类的全基因组序列不仅在生物系统中的应用,而且在医学,药房和农业中具有许多应用。近年来,高通量下一代测序技术的出现显着降低了全基因组测序的时间和成本。这些新技术提供了具有较低单元数据成本的超高吞吐量。但是,数据是DNA的非常短的长度碎片。因此,用于合并这些碎片的开发算法非常重要。合并无参考数据的这些片段被称为de novo集装件。近年来提出了De Novo集会的许多算法。天鹅绒,其中一个算法,是着名的,因为它在内存和时间消耗方面具有良好的性能。但是当输入片段的大小是巨大的,内存消耗急剧增加。因此,有必要开发具有低内存使用率的算法。在本文中,我们提出了一种具有较低存储器的De Novo组件的算法。在我们使用E.coli K-12菌株MG 1655的实验中,所提出的算法的存储器消耗量为天鹅绒的三分之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号