首页> 外文会议>Algorithms in Bioinformatics >Fast Algorithms for Finding Maximum-Density Segments of a Sequence with Applications to Bioinformatics
【24h】

Fast Algorithms for Finding Maximum-Density Segments of a Sequence with Applications to Bioinformatics

机译:查找序列最大密度片段的快速算法及其在生物信息学中的应用

获取原文

摘要

We study an abstract optimization problem arising from biomolecular sequence analysis. For a sequence A = of real numbers, a segment 5 is a consecutive subsequence . The width of S is j ― i + 1, while the density is (Σ_(i≤k≤j)a_k)/(j ― i+1). The maximum-density segment problem takes A and two integers L and U as input and asks for a segment of A with the largest possible density among those of width at least L and at most U. If U = n (or equiva-lently, U = 2L ― 1), we can solve the problem in O(n) time, improving upon the O(n log L)-time algorithm by Lin, Jiang and Chao for a general sequence A. Furthermore, if U and L are arbitrary, we solve the problem in O(n + n log(U ― L + 1)) time. There has been no nontrivial result for this case previously. Both results also hold for a weighted variant of the maximum-density segment problem.
机译:我们研究了由生物分子序列分析引起的抽象优化问题。对于实数序列A = ,段5是连续的子序列。 S的宽度为j ― i + 1,而密度为(Σ_(i≤k≤j)a_k)/(j ― i + 1)。最大密度线段问题将A和两个整数L和U作为输入,并要求A的线段中宽度至少为L且最大为U的线段中密度最大。如果U = n(或等价地, U = 2L -1),我们可以解决O(n)时间上的问题,对Lin,Jiang和Chao的O(n log L)时间算法进行了改进,得到了一般序列A.此外,如果U和L为任意,我们解决了O(n + n log(U ― L + 1))时间的问题。以前没有发生过这种情况的重要结果。这两个结果也适用于最大密度段问题的加权变体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号