首页> 外文期刊>Information Sciences: An International Journal >Bulk Aligner: A novel sequence alignment algorithm based on graph theory and Trinity
【24h】

Bulk Aligner: A novel sequence alignment algorithm based on graph theory and Trinity

机译:本体比对:一种基于图论和Trinity的新型序列比对算法

获取原文
获取原文并翻译 | 示例
           

摘要

Sequence alignment is a widely-used tool in genomics. With the development of next generation sequencing (NGS) technology, the production of sequence read data has recently increased. A number of read alignment algorithms for handling NGS data have been developed. However, these algorithms suffer from a trade-off between the throughput and alignment quality, due to the large computational costs for processing repeat reads. Conversely, alignment algorithms with distributed systems such as Hadoop and Trinity can obtain a better throughput than existing algorithms on single machine without compromising the alignment quality. In this paper, we suggest BulkAligner, a novel sequence alignment algorithm on the graph-based in-memory distributed system Trinity. We covert the original reference sequence into graph form and perform sequence alignment by finding the longest paths on the graph. Our experimental results show that BulkAligner has at least an 1.8x and up to 57x better throughput with the same, or higher quality than existing algorithms with Hadoop. We analyze the scalability and show that we can obtain a better throughput by simply adding machines. (C) 2015 Elsevier Inc. All rights reserved.
机译:序列比对是基因组学中广泛使用的工具。随着下一代测序(NGS)技术的发展,序列读取数据的产生最近有所增加。已经开发了许多用于处理NGS数据的读取比对算法。然而,由于用于处理重复读取的大量计算成本,这些算法在吞吐量和对准质量之间进行折衷。相反,与分布式系统(例如Hadoop和Trinity)相比,对齐算法可以获得比单机上现有算法更好的吞吐量,而不会影响对齐质量。在本文中,我们建议BulkAligner,这是一种基于图的内存分布式系统Trinity的新型序列比对算法。我们将原始参考序列转换为图形形式,并通过在图形上找到最长的路径来执行序列比对。我们的实验结果表明,与采用Hadoop的现有算法相比,BulkAligner在相同或更高质量的情况下,吞吐量至少提高了1.8倍,最高提高了57倍。我们分析了可伸缩性,并表明仅添加计算机即可获得更好的吞吐量。 (C)2015 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号