Efficient Algorithms for Locating the Length-Constrained Heaviest Segments, with Applications to Biomolecular Sequence Analysis

机译：用于定位长度约束最重的段的高效算法，具有生物分子序列分析的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study two fundamental problems concerning the search for interesting regions in sequences: (i) given a sequence of real numbers of length n and an upper bound U, find a consecutive subsequence of length at most U with the maximum sum and (ii) given a sequence of real numbers of length n and a lower bound L, find a consecutive subsequence of length at least L with the maximum average. We present an O(n)-time algorithm for the first problem and an O(n log L)-time algorithm for the second. The algorithms have potential applications in several areas of biomolecular sequence analysis including locating GC-rich regions in a genomic DNA sequence, post-processing sequence alignments, annotating multiple sequence alignments, and computing length-constrained ungapped local alignment. Our preliminary tests on both simulated and real data demonstrate that the algorithms are very efficient and able to locate useful (such as GC-rich) regions.

机译：我们研究了关于搜索序列有趣区域的两个基本问题：（i）给定一系列实数长度n和上限U，找到大多数U的连续子率，最大和（ii）给出长度n和下限L的一系列实数为n和下限L，在最大平均值中找到长度的长度的连续子值。我们为第一问题提供了一个O（n）-time算法和第二个问题的O（n log l）-time算法。该算法具有在若干生物分子序列分析区域中的潜在应用，包括在基因组DNA序列中定位富含GC的区域，后处理序列对准，注释多个序列比对，以及计算长度约束的未被局部对准。我们对模拟和实际数据的初步测试表明算法非常有效，能够定位有用（如GC-Rich）区域。

著录项

来源
《International symposium on mathematical foundtions of computer science》|2002年||共12页
会议地点
作者
Yaw-Ling Lin; Tao Jiang; Kun-Mao Chao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
algorithm; efficiency; maximum consecutive subsequence; length constraint; biomolecular sequence analysis; ungapped local alignment;

机译：算法;效率;最大连续的子宫;长度约束;生物分子序列分析;未拍摄的局部对齐;

相似文献

外文文献
中文文献
专利

1. Efficient algorithms for locating the length-constrained heaviest segments with applications to biomolecular sequence analysis [J] . Yaw-Ling Lin, Tao Jiang, Kun-Mao Chao Journal of computer and system sciences . 2002,第3期

机译：定位受长度限制的最重链段的高效算法及其在生物分子序列分析中的应用
2. BpMatch: An Efficient Algorithm for a Segmental Analysis of Genomic Sequences [J] . Felicioli Claudio Computational Biology and Bioinformatics, IEEE/ACM Transactions on . 2012,第4期

机译：BpMatch：用于基因组序列分段分析的高效算法
3. Efficient learning algorithm for sparse subsequence pattern-based classification and applications to comparative animal trajectory data analysis [J] . Sakuma Takuto, Nishi Kazuya, Kishimoto Kaoru, Advanced Robotics: The International Journal of the Robotics Society of Japan . 2019,第3a4期

机译：基于稀疏后续模式的分类和应用的高效学习算法对比较动物轨迹数据分析
4. Efficient Algorithms for Locating the Length-Constrained Heaviest Segments, with Applications to Biomolecular Sequence Analysis [C] . Yaw-Ling Lin, Tao Jiang, Kun-Mao Chao International symposium on mathematical foundtions of computer science . 2002

机译：用于定位长度约束最重的段的高效算法，具有生物分子序列分析的应用
5. Efficient algorithms for large data sets of genomic sequences in microbial community analysis. [D] . Knox, David A. 2010

机译：微生物群落分析中基因组序列大数据集的高效算法。
6. Applications of parallel processing algorithms for DNA sequence analysis. [O] . J F Collins, A F Coulson 1984

机译：并行处理算法在DNA序列分析中的应用。
7. Efficient algorithms for locating the length-constrained heaviest segments with applications to biomolecular sequence analysis [O] . Yaw-ling Lin, Tao Jiang, Kun-mao Chao 2002

机译：定位长度受限制的最重链段的高效算法及其在生物分子序列分析中的应用
8. Efficient algorithms and data structures in support of DNA mapping and sequence analysis. Progress report, February 1991--February 1992 [R] . Gusfield, D, Lawler, EL, Balasubramanian, K, 1992

机译：高效的算法和数据结构，支持DNa映射和序列分析。进展报告，1991年2月至1992年2月

Efficient Algorithms for Locating the Length-Constrained Heaviest Segments, with Applications to Biomolecular Sequence Analysis

摘要

著录项

相似文献

相关主题

期刊订阅