首页> 外文会议>International Conference on Practical Applications of Computational Biology Bioinformatics >Reproducibility of Finding Enriched Gene Sets in Biological Data Analysis
【24h】

Reproducibility of Finding Enriched Gene Sets in Biological Data Analysis

机译:在生物数据分析中寻找富集基因集的再现性

获取原文

摘要

Introducing the high-throughput measurement methods into molecular' biology was a trigger to develop the algorithms for searching disorders in complex signalling systems, like pathways or gene ontologies. In recent year's, there appeared many new solutions, but the results obtained with these techniques are ambiguous. In this work, five different algorithms for pathway enrichment analysis were compared using six microarray datasets covering cases with the same disease. The number of enriched pathways at different significance level and false positive rate of finding enrichment pathways was estimated, and reproducibility of obtained results between datasets was checked. The best performance was obtained for PLAGE method. However, taking into consideration the biological knowledge about analyzed disease condition, many findings may be false positives. Out of the other methods GSVA algorithm gave the most reproducible results across tested datasets, which was also validated in biological repositories. Similarly, good outcomes were given by GSEA method. ORA and PADOG gave poor sensitivity and reproducibility, which stand in contrary to previous research.
机译:将高通量测量方法引入分子生物学中是一种触发器,用于开发用于在复杂信号系统中搜索疾病的算法,如途径或基因本体。近年来,出现了许多新的解决方案,但是通过这些技术获得的结果是模棱两可的。在这项工作中,使用覆盖具有同样疾病的案例的六个微阵列数据集进行比较五种不同的途径富集分析算法。估计了不同意义水平和发现富集途径的假阳性率的富集途径的数量,并检查了数据集之间获得的结果的再现性。利用PLAGE方法获得了最佳性能。然而,考虑到关于分析疾病状况的生物学知识,许多发现可能是误报。除了另外的方法中,GSVA算法在测试数据集中提供了最多可重复的结果,该结果也在生物存储库中验证。同样,GSEA法给出了良好的结果。奥拉和帕多克的敏感性差和再现性差,符合以前的研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号