首页> 外文会议>IEEE International Conference on Bioinformatics and Bioengineering >An Improved Ratio-Based (IRB) Batch Effects Removal Algorithm for Cancer Data in a Co-Analysis Framework
【24h】

An Improved Ratio-Based (IRB) Batch Effects Removal Algorithm for Cancer Data in a Co-Analysis Framework

机译:协同分析框架中基于改进的基于比率(IRB)的批次效应去除算法

获取原文

摘要

Ratio-based algorithms are proven to be effective methods for removing batch effects that exist among micro array expression data from different data sources. They are outperforming than other methods in the enhancement of cross-batch prediction, especially for cancer data sets. However, their overall power is limited by: (1) Not every batch has control samples. The original method uses all negative samples to calculate the subtrahend. (2) Micro array experimental data may not have clear labels, especially in the prediction application, the labels of test data set are unknown. In this paper, we propose an Improved Ratio-Based (IRB) method to relieve these two constraints for cross-batch prediction applications. For each batch in a single study, we select one reference sample based on the idea of aligning probability density functions (pdfs) of each gene in different batches. Moreover, for data sets without label information, we transfer the problem of finding reference sample to the dense sub graph problem in graph theory. Our newly-proposed IRB method is straightforward and efficient, and can be extended for integrating large volume micro array data sets. The experiments show that our method is stable and has high performance in tumoron-tumor prediction.
机译:事实证明,基于比率的算法是消除来自不同数据源的微阵列表达数据中存在的批量效应的有效方法。在增强跨批次预测方面,它们的表现优于其他方法,尤其是对于癌症数据集而言。但是,它们的整体能力受到以下因素的限制:(1)并非每个批次都有对照样品。原始方法使用所有负样本来计算次差。 (2)微阵列实验数据可能没有清晰的标签,尤其是在预测应用中,测试数据集的标签是未知的。在本文中,我们提出了一种基于比率的改进(IRB)方法,以缓解跨批次预测应用中的这两个约束。对于单个研究中的每个批次,我们根据对齐不同批次中每个基因的概率密度函数(pdfs)的想法选择一个参考样本。此外,对于没有标签信息的数据集,我们将在图论中将寻找参考样本的问题转移到稠密子图问题上。我们新提出的IRB方法简单有效,可以扩展为集成大量微阵列数据集。实验表明,我们的方法是稳定的,在肿瘤/非肿瘤预测中具有较高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号