首页> 外文期刊>Bioinformatics >Gene set analysis for self-contained tests: complex null and specific alternative hypotheses
【24h】

Gene set analysis for self-contained tests: complex null and specific alternative hypotheses

机译:独立测试的基因集分析:复杂的无效假设和特定的替代假设

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: The analysis of differentially expressed gene sets became a routine in the analyses of gene expression data. There is a multitude of tests available, ranging from aggregation tests that summarize gene-level statistics for a gene set to true multivariate tests, accounting for intergene correlations. Most of them detect complex departures from the null hypothesis but when the null hypothesis is rejected, the specific alternative leading to the rejection is not easily identifiable. Results: In this article we compare the power and Type I error rates of minimum-spanning tree (MST)-based non-parametric multivariate tests with several multivariate and aggregation tests, which are frequently used for pathway analyses. In our simulation study, we demonstrate that MST-based tests have power that is for many settings comparable with the power of conventional approaches, but outperform them in specific regions of the parameter space corresponding to biologically relevant configurations. Further, we find for simulated and for gene expression data that MST-based tests discriminate well against shift and scale alternatives. As a general result, we suggest a two-step practical analysis strategy that may increase the interpretability of experimental data: first, apply the most powerful multivariate test to find the subset of pathways for which the null hypothesis is rejected and second, apply MST-based tests to these pathways to select those that support specific alternative hypotheses.
机译:动机:差异表达基因集的分析已成为分析基因表达数据的常规方法。可以使用多种测试,从汇总基因集基因水平统计的聚合测试到真正的多变量测试(考虑基因间相关性)。他们中的大多数人都从零假设中发现了复杂的背离,但是当零假设被拒绝时,导致拒绝的特定选择就不容易确定了。结果:在本文中,我们将基于最小生成树(MST)的非参数多元变量测试与几种经常用于路径分析的多元变量和聚集测试的功效和I类错误率进行了比较。在我们的仿真研究中,我们证明了基于MST的测试在许多设置下的功能都可以与传统方法的功能相媲美,但是在与生物学相关配置相对应的参数空间的特定区域中却胜过它们。此外,我们发现对于基于模拟和基因表达的数据,基于MST的测试可以很好地区分移位和规模替代方案。总的来说,我们建议采取两步实用的分析策略,以提高实验数据的可解释性:首先,应用功能最强大的多元检验来找到无效假设被拒绝的途径子集;其次,应用MST-基于对这些途径的检验,以选择那些支持特定替代假设的检验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号