首页> 外文期刊>Parallel Computing >Efficient CGM-based parallel algorithms for the longest common subsequence problem with multiple substring-exclusion constraints
【24h】

Efficient CGM-based parallel algorithms for the longest common subsequence problem with multiple substring-exclusion constraints

机译:具有多个子串排除约束的最长公共子序列问题的基于CGM的高效并行算法

获取原文
获取原文并翻译 | 示例
       

摘要

A variant of the Longest Common Subsequence (LCS) problem is the LCS problem with multiple substring-exclusion constraints (M-STR-EC-LCS), which has great importance in many fields especially in bioinformatics. This problem consists to compute the LCS of two strings X and Y of length n and m respectively that excluded a set of d constraints P = {P-1, P-2, ..., P-d) of total length r. Recently, Wang et al. proposed a sequential solution based on the dynamic programming technique that requires O(nmr) execution time and space. To the best of our knowledge, there is no parallel solutions for this problem. This paper describes new efficient parallel algorithms on Coarse Grained Multicomputer model (CGM) to solve this problem. Firstly, we propose a multi-level Direct Acyclic Graph (DAG) that determines the correct evaluation order of sub-problems in order to avoid redundancy due to overlap. Secondly, we propose two CGM parallel algorithms based on our DAG. The first algorithm is based on a regular partitioning of the DAG and requires O(nmr/p) execution time with O(p) communication rounds where p is the number of processors used. Its main drawback is high idleness time of processors because due to the dependencies between the nodes in the DAG, over time it has many idle processors. The second algorithm uses an irregular partitioning of the DAG that minimizes this idleness time by allowing the processors to stay active as long as possible. It requires O(nmr/p) execution time with O(kp) communication rounds. k is a constant integer allowing to setup the irregular partitioning. The both algorithms require O(r vertical bar Sigma vertical bar/p) preprocessing time where vertical bar Sigma vertical bar is the length of the alphabet. The experimental results performed show a good agreement with theoretical predictions. (C) 2019 Elsevier B.V. All rights reserved.
机译:最长公共子序列(LCS)问题的一个变体是具有多个子串排除约束的MCS问题(M-STR-EC-LCS),这在许多领域,特别是在生物信息学中具有重要意义。这个问题包括计算长度分别为n和m的两个字符串X和Y的LCS,它们排除了总长度r的一组d个约束P = {P-1,P-2,...,P-d)。最近,Wang等。提出了一种基于动态编程技术的顺序解决方案,需要O(nmr)执行时间和空间。据我们所知,没有并行解决此问题的方法。本文介绍了基于粗粒多计算机模型(CGM)的新型高效并行算法,以解决此问题。首先,我们提出一种多级直接非循环图(DAG),它确定子问题的正确评估顺序,以避免由于重叠而造成的冗余。其次,我们提出了两种基于DAG的CGM并行算法。第一种算法基于DAG的常规划分,并且需要O(nmr / p)执行时间以及O(p)个通信回合,其中p是所用处理器的数量。它的主要缺点是处理器的闲置时间长,因为由于DAG中节点之间的依赖性,随着时间的推移,它具有许多空闲处理器。第二种算法使用DAG的不规则分区,通过允许处理器尽可能长时间保持活动状态,从而最大程度地减少了空闲时间。它需要O(nmr / p)执行时间和O(kp)个通信回合。 k是允许设置不规则分区的常数整数。两种算法都需要O(r垂直条Sigma垂直条/ p)预处理时间,其中垂直条Sigma垂直条是字母的长度。进行的实验结果表明与理论预测吻合良好。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号