首页> 外文期刊>Algorithmica >Homogeneous String Segmentation using Trees and Weighted Independent Sets
【24h】

Homogeneous String Segmentation using Trees and Weighted Independent Sets

机译:使用树和加权独立集的同构字符串分割

获取原文
获取原文并翻译 | 示例
       

摘要

We divide a string into k segments, each with only one sort of symbols, so as to minimize the total number of exceptions. Motivations come from machine learning and data mining. For binary strings we develop a linear-time algorithm for any k. Key to efficiency is a special-purpose data structure, called W-tree, which reflects relations between repetition lengths of symbols. For non-binary strings we give a nontrivial dynamic programming algorithm. Our problem is equivalent to finding weighted independent sets with certain size constraints, either in paths (binary case) or special interval graphs (general case). We also show that this problem is FPT in bounded-degree graphs.
机译:我们将一个字符串分为k个段,每个段仅包含一种符号,以最大程度地减少异常总数。动机来自机器学习和数据挖掘。对于二进制字符串,我们为任何k开发了线性时间算法。效率的关键是一种特殊的数据结构,称为W树,它反映符号重复长度之间的关系。对于非二进制字符串,我们给出了一个非平凡的动态规划算法。我们的问题等同于找到具有一定大小限制的加权独立集,无论是在路径(二进制情况)还是特殊间隔图(一般情况)中。我们还表明,在有界图中此问题是FPT。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号