首页> 美国卫生研究院文献>Springer Open Choice >Protein sequence-similarity search acceleration using a heuristic algorithm with a sensitive matrix
【2h】

Protein sequence-similarity search acceleration using a heuristic algorithm with a sensitive matrix

机译:使用带有敏感矩阵的启发式算法加速蛋白质序列相似性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Protein database search for public databases is a fundamental step in the target selection of proteins in structural and functional genomics and also for inferring protein structure, function, and evolution. Most database search methods employ amino acid substitution matrices to score amino acid pairs. The choice of substitution matrix strongly affects homology detection performance. We earlier proposed a substitution matrix named MIQS that was optimized for distant protein homology search. Herein we further evaluate MIQS in combination with LAST, a heuristic and fast database search tool with a tunable sensitivity parameter m, where larger m denotes higher sensitivity. Results show that MIQS substantially improves the homology detection and alignment quality performance of LAST across diverse m parameters. Against a protein database consisting of approximately 15 million sequences, LAST with m = 105 achieves better homology detection performance than BLASTP, and completes the search 20 times faster. Compared to the most sensitive existing methods being used today, CS-BLAST and SSEARCH, LAST with MIQS and m = 106 shows comparable homology detection performance at 2.0 and 3.9 times greater speed, respectively. Results demonstrate that MIQS-powered LAST is a time-efficient method for sensitive and accurate homology search.Electronic supplementary materialThe online version of this article (doi:10.1007/s10969-016-9210-4) contains supplementary material, which is available to authorized users.
机译:在公共数据库中进行蛋白质数据库搜索是在结构和功能基因组学中蛋白质目标选择的基础步骤,也是推断蛋白质结构,功能和进化的基础步骤。大多数数据库搜索方法都使用氨基酸替代矩阵对氨基酸对进行评分。替代矩阵的选择强烈影响同源性检测性能。我们之前提出了一个名为MIQS的替代矩阵,该矩阵针对远距离蛋白质同源性搜索进行了优化。在本文中,我们进一步结合MIDS与LAST(一种具有可调灵敏度参数m的启发式快速数据库搜索工具)一起评估MIQS,其中m越大表示灵敏度越高。结果表明,MIQS可以显着提高LAST在不同m参数之间的同源性检测和比对质量性能。在一个由大约1500万个序列组成的蛋白质数据库中,具有m = 10 5 的LAST具有比BLASTP更好的同源性检测性能,并且完成了20倍的搜索速度。与目前使用的最敏感的现有方法相比,CS-BLAST和SSEARCH,带有MIQS的LAST和m = 10 6 的同等检测性能分别高出2.0和3.9倍。结果证明,由MIQS驱动的LAST是一种灵敏且准确的同源性搜索的高效方法。电子补充材料本文的在线版本(doi:10.1007 / s10969-016-9210-4)包含补充材料,可通过授权获得用户。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号