首页> 外文会议>Pacific Symposium on Biocomputing >AICM: A Genuine Framework for Correcting Inconsistency Between Large Pharmacogenomics Datasets
【24h】

AICM: A Genuine Framework for Correcting Inconsistency Between Large Pharmacogenomics Datasets

机译:AICM:用于纠正大型药物代理数据集之间不一致的真正框架

获取原文

摘要

The inconsistency of open pharmacogenomics datasets produced by different studies limits the usage of such datasets in many tasks, such as biomarker discovery. Investigation of multiple pharmacogenomics datasets confirmed that the pairwise sensitivity data correlation between drugs, or rows, across different studies (drug-wise) is relatively low, while the pairwise sensitivity data correlation between cell-lines, or columns, across different studies (cell-wise) is considerably strong. This common interesting observation across multiple pharmacogenomics datasets suggests the existence of subtle consistency among the different studies (i.e., strong cell-wise correlation). However, significant noises are also shown (i.e., weak drug-wise correlation) and have prevented researchers from comfortably using the data directly. Motivated by this observation, we propose a novel framework for addressing the inconsistency between large-scale pharmacogenomics data sets. Our method can significantly boost the drug-wise correlation and can be easily applied to re-summarized and normalized datasets proposed by others. We also investigate our algorithm based on many different criteria to demonstrate that the corrected datasets are not only consistent, but also biologically meaningful. Eventually, we propose to extend our main algorithm into a framework, so that in the future when more datasets become publicly available, our framework can hopefully offer a "ground-truth" guidance for references.
机译:不同研究产生的开放药替昔代族学数据集的不一致限制了这种数据集的许多任务,例如生物标志物发现。对多种药物代理数据集的研究证实,药物或行之间的成对敏感性数据相关性在不同的研究(药物 - 明智)相对较低,而在不同研究中,细胞系或列之间的成对灵敏度数据相关性(细胞 - 明智的)很强大。在多种药物代理数据集中的这种常见的有趣观察表明,不同研究中的微妙一致性(即,强烈的细胞明智相关)存在微妙的一致性。然而,还显示出显着的噪声(即,弱药物相关的相关性),并阻止研究人员直接使用数据。通过这种观察,我们提出了一种新的框架,用于解决大规模药替氏菌族数据集之间的不一致性。我们的方法可以显着提高药物明智的相关性,并且可以容易地应用于其他人提出的重新汇总和标准化的数据集。我们还基于许多不同标准的算法调查了我们的算法,以证明待更正的数据集不仅是一致的,而且在生物学上有意义。最终,我们建议将我们的主要算法扩展到框架中,以便在未来在公开可用的数据集时,我们的框架可以为参考提供“地面真理”指导。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号