...
首页> 外文期刊>BMC Bioinformatics >Comparison of lists of genes based on functional profiles
【24h】

Comparison of lists of genes based on functional profiles

机译:基于功能概况的基因列表比较

获取原文

摘要

Background How to compare studies on the basis of their biological significance is a problem of central importance in high-throughput genomics. Many methods for performing such comparisons are based on the information in databases of functional annotation, such as those that form the Gene Ontology (GO). Typically, they consist of analyzing gene annotation frequencies in some pre-specified GO classes, in a class-by-class way, followed by p-value adjustment for multiple testing. Enrichment analysis, where a list of genes is compared against a wider universe of genes, is the most common example. Results A new global testing procedure and a method incorporating it are presented. Instead of testing separately for each GO class, a single global test for all classes under consideration is performed. The test is based on the distance between the functional profiles, defined as the joint frequencies of annotation in a given set of GO classes. These classes may be chosen at one or more GO levels. The new global test is more powerful and accurate with respect to type I errors than the usual class-by-class approach. When applied to some real datasets, the results suggest that the method may also provide useful information that complements the tests performed using a class-by-class approach if gene counts are sparse in some classes. An R library, goProfiles, implements these methods and is available from Bioconductor, http://bioconductor.org/packages/release/bioc/html/goProfiles.html. webcite Conclusions The method provides an inferential basis for deciding whether two lists are functionally different. For global comparisons it is preferable to the global chi-square test of homogeneity. Furthermore, it may provide additional information if used in conjunction with class-by-class methods.
机译:背景技术如何基于生物学意义比较研究是高通量基因组学中至关重要的问题。进行此类比较的许多方法都是基于功能注释数据库中的信息,例如构成基因本体论(GO)的那些信息。通常,它们包括以逐类的方式分析某些预先指定的GO类中的基因注释频率,然后进行p值调整以进行多次测试。最常见的例子是富集分析,其中将一个基因列表与更广泛的基因范围进行比较。结果提出了一种新的全局测试程序及其方法。代替对每个GO类进行单独测试,将对所考虑的所有类执行单个全局测试。该测试基于功能配置文件之间的距离,该距离定义为一组给定的GO类中注释的联合频率。可以在一个或多个GO级别上选择这些类别。新的全局测试在I类错误方面比通常的逐类方法更强大,更准确。当将其应用于某些真实数据集时,如果某些类别中的基因计数稀疏,则该方法还可以提供有用的信息,以补充使用逐类方法进行的测试。 R库goProfiles实现了这些方法,可从Bioconductor(http://bioconductor.org/packages/release/bioc/html/goProfiles.html)获得。结论该方法为确定两个列表在功能上是否不同提供了推论依据。为了进行全局比较,最好进行均匀性的全局卡方检验。此外,如果与逐类方法结合使用,它可能会提供其他信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号