首页> 外文期刊>Frontiers in Genetics >A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis
【24h】

A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis

机译:一种新颖的联合基因集分析框架可改善交叉疾病转录组学分析中丰富途径的鉴定

获取原文
           

摘要

Motivation: Gene set enrichment analysis is a widely accepted expression analysis tool which aims at detecting coordinated expression change within a pre-defined gene sets rather than individual genes. The benefit of gene set analysis over individual differentially expressed (DE) gene analysis includes more reproducible and interpretable results and detecting small but consistent change among gene set which could not be detected by DE gene analysis. There have been many successful gene set analysis applications in human diseases. However, when the sample size of a disease study is small and no other public data sets of the same disease are available, it will lead to lack of power to detect pathways of importance to the disease. Results: We have developed a novel joint gene set analysis statistical framework which aims at improving the power of identifying enriched gene sets through integrating multiple similar disease data sets. Through comprehensive simulation studies, we demonstrated that our proposed frameworks obtained much better AUC scores than single data set analysis and another meta-analysis method in identification of enriched pathways. When applied to two real data sets, the proposed framework could retain the enriched gene sets identified by single data set analysis and exclusively obtained up to 200% more disease-related gene sets demonstrating the improved identification power through information shared between similar diseases. We expect that the proposed framework would enable researchers to better explore public data sets when the sample size of their study is limited.
机译:动机:基因集富集分析是一种广泛接受的表达分析工具,旨在检测预定义基因集中而不是单个基因中的协同表达变化。基因组分析相对于个别差异表达(DE)基因分析的好处包括可重现和可解释的结果,以及检测基因组之间微小但一致的变化,这些变化是DE基因分析无法检测到的。在人类疾病中已经有许多成功的基因组分析应用。但是,如果疾病研究的样本量很小,并且没有相同疾病的其他公共数据集,则会导致缺乏检测对该疾病重要途径的能力。结果:我们开发了一种新颖的联合基因集分析统计框架,旨在通过整合多个相似疾病数据集来提高鉴定丰富基因集的能力。通过全面的模拟研究,我们证明了我们提出的框架在识别富集途径方面比单一数据集分析和另一种荟萃分析方法获得了更好的AUC分数。当应用于两个真实数据集时,提出的框架可以保留通过单个数据集分析确定的富集基因集,并独家获得多达200%的与疾病相关的基因集,从而通过相似疾病之间共享的信息证明了增强的鉴定能力。我们希望所提出的框架能够使研究人员在研究样本量有限的情况下更好地探索公共数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号