首页> 外文会议>Frontiers in algorithmics >Efficient Algorithms for the Closest String and Distinguishing String Selection Problems
【24h】

Efficient Algorithms for the Closest String and Distinguishing String Selection Problems

机译:最接近的字符串和区分字符串选择问题的高效算法

获取原文
获取原文并翻译 | 示例

摘要

In the paper, we study three related problems, the closest string problem, the farthest string problem and the distinguishing string selection problem. These problems have applications in motif detection, binding sites locating, genetic drug target identification, genetic probes design, universal PCR primer design, etc. They have been extensively studied in recent years.rnThe problems are defined as follows:rnThe closest string problem: given a group of strings B = {s_1, s_2, ..., s_n}, each of length L, and an integer d, the problem is to compute a center string s of length L such that the Hamming distance d(s, s_i) ≤ d for all s_y ∈ B.rnThe farthest string problem: given a group of strings g = {g_1, g_2, ..., g_(n2)}; with all strings of the same length L, and an integer d_b, the farthest string problem is to compute a center string s of length L such that the Hamming distance d(s,g_j) ≥ L - d_b for all g_j ∈g.rnThe distinguishing string selection problem: given two groups of strings B (bad genes) and g (good genes), B = {s_1, s_2,..., s_(n1)} and g = {g_(n1+1), g_(n1+2), ...,g_(n2)}, with all strings of the same length L, and two integers d_b and d_g with d_g ≥ L - d_b, the Distinguishing String Selection problem is to compute a center string s of length L such that the Hamming distance d(s, s_i) ≤ d_b, ∨s_i ∈ B and the Hamming distance d(s, g_j) ≥ d_g for all g_j ∈g.rnOur results: We design an O(Ln + nd(|∑ - 1|)~d2~(3.25d)) time fixed parameter algorithm for the closest string problem which improves upon the best known O(Ln + nd2~(4d) × (|∑| - 1)~d) algorithm in [14], where |∑| is the size of the alphabet. We also design fixed parameter algorithms for both the farthest string problem and the distinguishing string selection problem. Both algorithms run in time O(Ln + nd2~(3.25d)) when the input strings are binary strings over E = {0,1}.
机译:在本文中,我们研究了三个相关的问题,最接近的字符串问题,最远的字符串问题和区分字符串的选择问题。这些问题在基序检测,结合位点定位,遗传药物靶标识别,遗传探针设计,通用PCR引物设计等方面都有应用。近年来已对其进行了广泛的研究。问题定义如下:最接近的字符串问题:一组字符串B = {s_1,s_2,...,s_n},每个字符串的长度为L,并且为整数d,问题是要计算长度为L的中心字符串s,以使汉明距离d(s,s_i )对于所有s_y∈B,≤d.rn最远的字符串问题:给定一组字符串g = {g_1,g_2,...,g_(n2)};对于所有具有相同长度L且具有整数d_b的字符串,最远的字符串问题是计算长度L的中心字符串s,以使所有g_j∈g的汉明距离d(s,g_j)≥L-d_b区分字符串选择问题:给定两组字符串B(坏基因)和g(好基因),B = {s_1,s_2,...,s_(n1)},而g = {g_(n1 + 1),g_ (n1 + 2),...,g_(n2)},并且所有字符串的长度都相同,并且两个整数d_b和d_g且d_g≥L-d_b,区分字符串选择问题是计算中心字符串s长度L使得汉明距离d(s,s_i)≤d_b,∨s_i∈B和汉明距离d(s,g_j)≥d_g对于所有g_j∈g.rn我们的结果:我们设计了一个O(Ln + nd (| ∑-1 |)〜d2〜(3.25d))时间固定参数算法,用于最接近的字符串问题,改进了最著名的O(Ln + nd2〜(4d)×(| ∑ |-1)〜d) [14]中的算法,其中| ∑ |是字母的大小。我们还针对最远的字符串问题和区分字符串的选择问题设计了固定参数算法。当输入字符串是E = {0,1}上的二进制字符串时,两种算法都在时间O(Ln + nd2〜(3.25d))上运行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号