首页> 外文期刊>Journal of neurosurgical sciences >Pattern Matching and Consensus Problems on Weighted Sequences and Profiles
【24h】

Pattern Matching and Consensus Problems on Weighted Sequences and Profiles

机译:加权序列和简档的模式匹配和共识问题

获取原文
获取原文并翻译 | 示例
       

摘要

We study pattern matching problems on two major representations of uncertain sequences used in molecular biology: weighted sequences (also known as position weight matrices, PWM) and profiles (scoring matrices). In the simple version, in which only the pattern or only the text is uncertain, we obtain efficient algorithms with theoretically-provable running times using a variation of the lookahead scoring technique. We also consider a general variant of the pattern matching problems in which both the pattern and the text are uncertain. Central to our solution is a special case where the sequences have equal length, called the consensus problem. We propose algorithms for the consensus problem parameterised by the number of strings that match one of the sequences. As our basic approach, a careful adaptation of the classic meet-in-the-middle algorithm for the knapsack problem is used. On the lower bound side, we prove that our dependence on the parameter is optimal up to lower-order terms conditioned on the optimality of the original algorithm for the knapsack problem. Therefore, we make an effort to keep the lower order terms of the complexities of our algorithms as small as possible.
机译:我们研究了分子生物学中使用不确定序列的两个主要表示的模式匹配问题:加权序列(也称为位置权重矩阵,PWM)和轮廓(评分矩阵)。在简单的版本中,其中仅模式或仅文本是不确定的,我们使用寻线评分技术的变化获得理论上可提供的运行时间的高效算法。我们还考虑了模式匹配问题的一般变体,其中图案和文本都不确定。我们解决方案的核心是一个特殊情况,序列具有相同的长度,称为共识问题。我们提出了通过匹配其中一个序列的字符串数参数化的共识问题的算法。作为我们的基本方法,使用了仔细适应Charace-In-In-中间算法的Knapsack问题。在下限方面,我们证明我们对参数的依赖性是最佳的,在对背包问题的原始算法的最优算法的最优性上的较低阶项。因此,我们努力将我们算法的复杂性的较低顺序保持尽可能小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号