...
首页> 外文期刊>Bioinformatics >Simple is beautiful: a straightforward approach to improve the delineation of true and false positives in PSI-BLAST searches
【24h】

Simple is beautiful: a straightforward approach to improve the delineation of true and false positives in PSI-BLAST searches

机译:简单就是美丽:在PSI-BLAST搜索中改善真假阳性描述的简单方法

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: The deluge of biological information from different genomic initiatives and the rapid advancement in biotechnologies have made bioinformatics tools an integral part of modern biology. Among the widely used sequence alignment tools, BLAST and PSI-BLAST are arguably the most popular. PSI-BLAST, which uses an iterative profile position specific score matrix (PSSM)-based search strategy, is more sensitive than BLAST in detecting weak homologies, thus making it suitable for remote homolog detection. Many refinements have been made to improve PSI-BLAST, and its computational efficiency and high specificity have been much touted. Nevertheless, corruption of its profile via the incorporation of false positive sequences remains a major challenge. Results: We have developed a simple and elegant approach to resolve the problem of model corruption in PSI-BLAST searches. We hypothesized that combining results from the first (least-corrupted) profile with results from later (most sensitive) iterations of PSI-BLAST provides a better discriminator for true and false hits. Accordingly, we have derived a formula that utilizes the E-values from these two PSI-BLAST iterations to obtain a figure of merit for rank-ordering the hits. Our verification results based on a gold-standard test set indicate that this figure of merit does indeed delineate true positives from false positives better than PSI-BLAST E-values. Perhaps what is most notable about this strategy is that it is simple and straightforward to implement.
机译:动机:来自不同基因组计划的大量生物信息以及生物技术的飞速发展,已使生物信息学工具成为现代生物学不可或缺的一部分。在广泛使用的序列比对工具中,BLAST和PSI-BLAST可以说是最受欢迎的工具。 PSI-BLAST使用基于迭代轮廓位置特定评分矩阵(PSSM)的搜索策略,在检测弱同源性方面比BLAST更为敏感,因此适合远程同源检测。为了改进PSI-BLAST,已经进行了许多改进,并且其计算效率和高特异性被吹捧。然而,通过掺入假阳性序列来破坏其概况仍然是一个重大挑战。结果:我们开发了一种简单而优雅的方法来解决PSI-BLAST搜索中的模型损坏问题。我们假设,将PSI-BLAST的第一个(最不损坏)配置文件的结果与后来的(最敏感的)迭代结果相结合,可以更好地区分真假匹配。因此,我们得出了一个公式,该公式利用了这两个PSI-BLAST迭代的E值来获得用于对命中进行排序的品质因数。我们基于金标准测试集的验证结果表明,与PSI-BLAST E值相比,此品质因数确实确实将真实阳性与假阳性区分开来。也许这种策略最值得注意的是它的实施简单明了。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号