首页> 外文会议>IEEE International Conference on Bioinformatics and Bioengineering >An Improved Ratio-Based (IRB) Batch Effects Removal Algorithm for Cancer Data in a Co-Analysis Framework
【24h】

An Improved Ratio-Based (IRB) Batch Effects Removal Algorithm for Cancer Data in a Co-Analysis Framework

机译:基于比率的(IRB)批量效应在共同分析框架中的癌症数据去除算法

获取原文

摘要

Ratio-based algorithms are proven to be effective methods for removing batch effects that exist among micro array expression data from different data sources. They are outperforming than other methods in the enhancement of cross-batch prediction, especially for cancer data sets. However, their overall power is limited by: (1) Not every batch has control samples. The original method uses all negative samples to calculate the subtrahend. (2) Micro array experimental data may not have clear labels, especially in the prediction application, the labels of test data set are unknown. In this paper, we propose an Improved Ratio-Based (IRB) method to relieve these two constraints for cross-batch prediction applications. For each batch in a single study, we select one reference sample based on the idea of aligning probability density functions (pdfs) of each gene in different batches. Moreover, for data sets without label information, we transfer the problem of finding reference sample to the dense sub graph problem in graph theory. Our newly-proposed IRB method is straightforward and efficient, and can be extended for integrating large volume micro array data sets. The experiments show that our method is stable and has high performance in tumor/non-tumor prediction.
机译:被证明的基于比率的算法是用于去除来自不同数据源的微阵列表达数据中存在的批量效应的有效方法。它们比其他方法表现优于增强交叉批量预测,特别是对于癌症数据集。但是,它们的整体功率受到限制:(1)并非每批都有控制样本。原始方法使用所有否定样本来计算子系统。 (2)微阵列实验数据可能没有明确的标签,特别是在预测应用中,测试数据集的标签未知。在本文中,我们提出了一种改进的基于比率(IRB)方法来缓解这两个约束对交叉批量预测应用。对于单一研究中的每批批次,我们根据不同批次中每个基因的概率密度函数(PDF)的想法选择一个参考样本。此外,对于没有标签信息的数据集,我们将参考样本的问题传送到图表理论中的密集子图问题。我们的新建IRB方法是简单和有效的,可以扩展为集成大卷微阵列数据集。实验表明,我们的方法是稳定的并且在肿瘤/非肿瘤预测方面具有高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号