...
【24h】

On Indeterminate Strings Matching

机译:关于不确定的字符串匹配

获取原文

摘要

Given two indeterminate equal-length strings p and t with a set of characters per position in both strings, we obtain a determinate string p_w from p and a determinate string t_w from t by choosing one character per position. Then, we say that p and t match when p_w and t_w match for some choice of the characters. While the most standard notion of a match for determinate strings is that they are simply identical, in certain applications it is more appropriate to use other definitions, with the prime examples being parameterized matching, order-preserving matching, and the recently introduced Cartesian tree matching. We provide a systematic study of the complexity of string matching for indeterminate equal-length strings, for different notions of matching. We use n to denote the length of both strings, and r to be an upper-bound on the number of uncertain characters per position. First, we provide the first polynomial time algorithm for the Cartesian tree version that runs in deterministic e?'a(nlog?2 n) and expected e?'a(nlog nlog log n) time using e?'a(nlog n) space, for constant r. Second, we establish NP-hardness of the order-preserving version for r=2, thus solving a question explicitly stated by Henriques et al. [CPM 2018], who showed hardness for r=3. Third, we establish NP-hardness of the parameterized version for r=2. As both parameterized and order-preserving indeterminate matching reduce to the standard determinate matching for r=1, this provides a complete classification for these three variants.
机译:在两个字符串中给定两个不确定的等长字符串P和T,通过两个字符串中的每个位置的一组字符,通过选择每个位置的一个字符,从p和从t确定一个字符串t_w获得确定的字符串p_w。然后,我们说P_W和T_W匹配的P和T匹配某种选择的字符。虽然确定字符串的匹配的最标准概念是它们是简单的,但在某些应用程序中,使用其他定义更适合,其中序列示例是参数化匹配,订单保留匹配,最近推出的笛卡尔树匹配。我们提供了对不确定的等长字符串的串匹配的复杂性的系统研究,用于不同的匹配概念。我们使用n表示两个字符串的长度,并且r为每个位置不确定字符数的上限。首先,我们为笛卡尔树版本提供了第一个多项式时间算法,该笛卡尔树版本在确定性e?'a(nlog?2 n)和期望的e?'a(nlog nlog log n)时间使用e?'a(nlog n)空间,常量r。其次,我们建立了r = 2的秩序保存版本的NP硬度,从而解决了Henriques等人明确说明的问题。 [2018年CPM],为r = 3表示硬度。第三,我们为r = 2建立了参数化版本的NP硬度。作为参数化和订单保留的不确定匹配减少到R = 1的标准确定匹配,这为这三种变体提供了完整的分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号