首页> 美国卫生研究院文献>Journal of Computational Biology >A Coverage Criterion for Spaced Seeds and Its Applications to Support Vector Machine String Kernels and k-Mer Distances
【2h】

A Coverage Criterion for Spaced Seeds and Its Applications to Support Vector Machine String Kernels and k-Mer Distances

机译:间隔种子的覆盖标准及其在支持向量机字符串核和k-Mer距离中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Spaced seeds have been recently shown to not only detect more alignments, but also to give a more accurate measure of phylogenetic distances, and to provide a lower misclassification rate when used with Support Vector Machines (SVMs). We confirm by independent experiments these two results, and propose in this article to use a coverage criterion to measure the seed efficiency in both cases in order to design better seed patterns. We show first how this coverage criterion can be directly measured by a full automaton-based approach. We then illustrate how this criterion performs when compared with two other criteria frequently used, namely the single-hit and multiple-hit criteria, through correlation coefficients with the correct classification/the true distance. At the end, for alignment-free distances, we propose an extension by adopting the coverage criterion, show how it performs, and indicate how it can be efficiently computed.
机译:>最近显示,间隔种子不仅可以检测到更多的比对,而且可以提供更精确的系统发育距离度量,并且与支持向量机(SVM)结合使用时,可以提供较低的误分类率。我们通过独立的实验确认了这两个结果,并在本文中建议使用覆盖率标准来测量两种情况下的种子效率,以设计更好的种子模式。我们首先展示如何通过基于全自动机的方法直接测量该覆盖标准。然后,我们通过具有正确分类/真实距离的相关系数,说明了该标准与经常使用的其他两个标准(即单击和多击标准)相比时的性能。最后,对于无对准距离,我们建议采用覆盖率准则进行扩展,展示其性能,并指出如何有效地计算它。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号