首页> 美国卫生研究院文献>Nucleic Acids Research >Query-seeded iterative sequence similarity searching improves selectivity 5–20-fold
【2h】

Query-seeded iterative sequence similarity searching improves selectivity 5–20-fold

机译:播种查询的迭代序列相似性搜索将选择性提高了5-20倍

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Iterative similarity search programs, like psiblast, jackhmmer, and psisearch, are much more sensitive than pairwise similarity search methods like blast and ssearch because they build a position specific scoring model (a PSSM or HMM) that captures the pattern of sequence conservation characteristic to a protein family. But models are subject to contamination; once an unrelated sequence has been added to the model, homologs of the unrelated sequence will also produce high scores, and the model can diverge from the original protein family. Examination of alignment errors during psiblast PSSM contamination suggested a simple strategy for dramatically reducing PSSM contamination. psiblast PSSMs are built from the query-based multiple sequence alignment (MSA) implied by the pairwise alignments between the query model (PSSM, HMM) and the subject sequences in the library. When the original query sequence residues are inserted into gapped positions in the aligned subject sequence, the resulting PSSM rarely produces alignment over-extensions or alignments to unrelated sequences. This simple step, which tends to anchor the PSSM to the original query sequence and slightly increase target percent identity, can reduce the frequency of false-positive alignments more than 20-fold compared with psiblast and jackhmmer, with little loss in search sensitivity.
机译: psiblast jackhmmer psisearch 之类的迭代相似性搜索程序比像 blast 这样的成对相似性搜索方法要敏感得多。 kbd>和 ssearch 是因为它们建立了位置特定的评分模型(PSSM或HMM),该模型捕获了蛋白质家族的序列保守特征模式。但是模型容易受到污染。一旦将不相关的序列添加到模型中,不相关序列的同源物也将产生高分,并且该模型可能与原始蛋白质家族有所不同。检查 psiblast PSSM污染期间的对准错误提示了一种显着减少PSSM污染的简单策略。 psiblast PSSM是由基于查询的多序列比对(MSA)构建的,该序列由查询模型(PSSM,HMM)和库中主题序列之间的成对比对暗示。当将原始查询序列残基插入比对的受试者序列的缺口位置时,所得的PSSM很少产生比对过度延伸或与无关序列的比对。与 psiblast 相比,此简单步骤倾向于将PSSM锚定到原始查询序列并稍微提高目标同一性百分比,可以将假阳性比对的频率降低20倍以上。千斤顶,搜索灵敏度几乎没有损失。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号