首页> 外文会议>International Colloquium on Automata, Languages and Programming >Permutation Editing and Matching via Embeddings
【24h】

Permutation Editing and Matching via Embeddings

机译:通过嵌入式置换和匹配匹配

获取原文

摘要

If the genetic maps of two species are modelled as permutations of (homologous) genes, the number of chromosomal rearrangements in the form of deletions, block moves, inversions etc. to transform one such permutation to another can be used as a measure of their evolutionary distance. Motivated by such scenarios, we study problems of computing distances between permutations as well as matching permutations in sequences, and finding most similar permutation from a collection (“nearest neighbor”). We adopt a general approach: embed permutation distances of relevance into well-known vector spaces in an approximately distance-preserving manner, and solve the resulting problems on the well-known spaces. Our results are as follows: ·We present the first known approximately distance preserving embeddings of these permutation distances into well-known spaces. ·Using these embeddings, we obtain several results, including the first known efficient solution for approximately solving nearest neighbor problems with permutations and the first known algorithms for finding permutation distances in the "data stream" model. ·We consider a novel class of problems called permutation matching problems which are similar to string matching problems, except that the pattern is a permutation (rather than a string) and present linear or near-linear time algorithms for approximately solving permutation matching problems; in contrast, the corresponding string problems take significantly longer.
机译:如果两个物种的遗传贴图被建模为(同源)基因的置换,则缺失形式的染色体重排的数量,块移动,逆转等的数量可以用作其进化的量度距离。通过这种情况激励,我们研究了在排列之间计算距离以及序列中的匹配偏移的问题,并找到来自集合(“最近邻居”)的最相似的置换。我们采用一般方法:以近似远程保持方式嵌入与众所周知的向量空间中的相关性的置换距离,并解决众所周知的空间上产生的问题。我们的结果如下:·我们介绍了这些置换距离的第一个已知的大约距离将这些置换距离的嵌入距离为众所周知的空间。 ·使用这些嵌入式,我们获得了几种结果,包括用于近似求解最近的邻居问题的第一已知高效解决方案,以及用于在“数据流”模型中查找置换距离的第一已知算法。 ·我们考虑一种名为置换匹配问题的新型问题,这些问题类似于字符串匹配问题,除了模式是置换(而不是字符串),并且呈现用于近似求解匹配问题的线性或近线性时间算法;相比之下,相应的字符串问题需要更长时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号