False Discovery Rate for Homology Searches

机译：同源搜索的错误发现率

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While many different aspects of retrieval algorithms (e.g., BLAST) have been studied in depth, the method for determining the retrieval threshold has not enjoyed the same attention. Furthermore, with genetic databases growing rapidly, the challenges of multiple testing are escalating. In order to improve search sensitivity, we propose the use of the false discovery rate (FDR) as the method to control the number of irrelevant ("false positive") sequences. In this paper, we introduce BLAST_(FDR), an extended version of BLAST that uses a FDR method for the threshold criterion. We evaluated five different multiple testing methods on a large training database and chose the best performing one, Benjamini-Hochberg, as the default for BLAST_(FDR). BLAST_(FDR) achieves 14.1% better retrieval performance than BLAST on a large (5,161 queries) test database and 26.8% better retrieval score for queries belonging to small superfamilies. Furthermore, BLAST_(FDR) retrieved only 0.27 irrelevant sequences per query compared to 7.44 for BLAST.

机译：虽然已经对检索算法（例如，BLAST）的许多不同方面进行了深入研究，但是用于确定检索阈值的方法并未受到同样的关注。此外，随着基因数据库的迅速发展，多重测试的挑战正在升级。为了提高搜索灵敏度，我们建议使用错误发现率（FDR）作为控制无关（“错误肯定”）序列数量的方法。在本文中，我们介绍了BLAST_（FDR），它是BLAST的扩展版本，它使用FDR方法作为阈值标准。我们在大型培训数据库上评估了五种不同的多种测试方法，并选择了性能最佳的Benjamini-Hochberg作为BLAST_（FDR）的默认方法。在大型（5,161个查询）测试数据库上，BLAST_（FDR）的检索性能比BLAST高14.1％，对于小型超家族的查询，其检索得分高26.8％。此外，相比于BLAST的7.44，BLAST_（FDR）每次查询仅检索到0.27个无关序列。

著录项

来源
《Brazilian symposium on bioinformatics》|2013年|194-201|共8页
会议地点
作者
Hyrum D. Carroll; Alex C. Williams; Anthony G. Davis; John L. Spouge;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Improving Retrieval Efficacy of Homology Searches Using the False Discovery Rate [J] . Carroll Hyrum D., Williams Alex C., Davis Anthony G., Computational Biology and Bioinformatics, IEEE/ACM Transactions on . 2015,第3期

机译：使用错误发现率提高同质检索的检索效率
2. Partially sequenced organisms, decoy searches and false discovery rates [J] . Victor B., Gabri?l S., Kanobana K., Journal of proteome research . 2012,第3期

机译：部分测序的生物，诱饵搜索和错误发现率
3. Nonlinear fitting method for determining local false discovery rates from decoy database searches [J] . Tang WH, Shilov IV, Seymour SL Journal of proteome research . 2008,第9期

机译：从诱饵数据库搜索确定局部错误发现率的非线性拟合方法
4. False Discovery Rate for Homology Searches [C] . Hyrum D. Carroll, Alex C. Williams, Anthony G. Davis, Brazilian Symposium on Bioinformatics . 2013

机译：虚假发现率为同源性搜索
5. False discovery rates when the statistics are discrete. [D] . Dialsingh, Isaac. 2012

机译：统计数据离散时的错误发现率。
6. Improving Retrieval Efficacy of Homology Searches using the False Discovery Rate [O] . Hyrum D. Carroll, Alex C. Williams, Anthony G. Davis, -1

机译：使用错误发现率提高同质检索的检索效率
7. Assessment of Metabolome Annotation Quality: A Method for Evaluating the False Discovery Rate of Elemental Composition Searches [O] . Matsuda, Fumio, Shinbo, Yoko, Oikawa, Akira, 2009

机译：代谢组注释质量评估：一种评估元素组成搜索错误发现率的方法

False Discovery Rate for Homology Searches

摘要

著录项

相似文献

相关主题

期刊订阅