首页> 美国卫生研究院文献>BMC Bioinformatics >microTaboo: a general and practical solution to the k-disjoint problem
【2h】

microTaboo: a general and practical solution to the k-disjoint problem

机译:microTaboo:k不相交问题的通用且实用的解决方案

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundA common challenge in bioinformatics is to identify short sub-sequences that are unique in a set of genomes or reference sequences, which can efficiently be achieved by k-mer (k consecutive nucleotides) counting. However, there are several areas that would benefit from a more stringent definition of “unique”, requiring that these sub-sequences of length W differ by more than k mismatches (i.e. a Hamming distance greater than k) from any other sub-sequence, which we term the k-disjoint problem. Examples include finding sequences unique to a pathogen for probe-based infection diagnostics; reducing off-target hits for re-sequencing or genome editing; detecting sequence (e.g. phage or viral) insertions; and multiple substitution mutations. Since both sensitivity and specificity are critical, an exhaustive, yet efficient solution is desirable.
机译:背景技术生物信息学中的一个共同挑战是要鉴定一组基因组或参考序列中独特的短亚序列,这可以通过k-mer(k个连续核苷酸)计数有效地实现。但是,有几个领域可以从更严格的“唯一”定义中受益,要求这些长度为W的子序列与任何其他子序列相差超过k个不匹配(即汉明距离大于k),我们称其为k不相交的问题。实例包括发现病原体特有的序列以进行基于探针的感染诊断;减少脱靶命中,以进行重新测序或基因组编辑;检测序列(例如噬菌体或病毒)插入;和多个替换突变。由于敏感性和特异性都至关重要,因此需要一种详尽而有效的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号