首页> 美国卫生研究院文献>Bioinformatics >Locality-sensitive hashing for the edit distance
【2h】

Locality-sensitive hashing for the edit distance

机译:编辑距离的局部敏感哈希

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

MotivationSequence alignment is a central operation in bioinformatics pipeline and, despite many improvements, remains a computationally challenging problem. Locality-sensitive hashing (LSH) is one method used to estimate the likelihood of two sequences to have a proper alignment. Using an LSH, it is possible to separate, with high probability and relatively low computation, the pairs of sequences that do not have high-quality alignment from those that may. Therefore, an LSH reduces the overall computational requirement while not introducing many false negatives (i.e. omitting to report a valid alignment). However, current LSH methods treat sequences as a bag of k-mers and do not take into account the relative ordering of k-mers in sequences. In addition, due to the lack of a practical LSH method for edit distance, in practice, LSH methods for Jaccard similarity or Hamming similarity are used as a proxy.
机译:MotivationSequence序列比对是生物信息学流水线中的核心操作,尽管有许多改进,但仍然是计算难题。局部敏感哈希(LSH)是一种用于估计两个序列具有适当比对的可能性的方法。使用LSH,可以以较高的概率和较低的计算将不具有高质量比对的序列对与可能的序列对分开。因此,LSH减少了总体计算需求,同时不引入许多假阴性(即省略报告有效的比对)。然而,当前的LSH方法将序列视为k聚体的袋,并且没有考虑序列中k聚体的相对顺序。另外,由于缺少用于编辑距离的实用LSH方法,因此在实践中,将Jaccard相似性或Hamming相似性的LSH方法用作代理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号