首页> 美国卫生研究院文献>other >AICM: A Genuine Framework for Correcting Inconsistency Between Large Pharmacogenomics Datasets
【2h】

AICM: A Genuine Framework for Correcting Inconsistency Between Large Pharmacogenomics Datasets

机译:AICM:纠正大型药物基因组学数据集之间不一致的真实框架

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The inconsistency of open pharmacogenomics datasets produced by different studies limits the usage of such datasets in many tasks, such as biomarker discovery. Investigation of multiple pharmacogenomics datasets confirmed that the pairwise sensitivity data correlation between drugs, or rows, across different studies (drug-wise) is relatively low, while the pairwise sensitivity data correlation between cell-lines, or columns, across different studies (cell-wise) is considerably strong. This common interesting observation across multiple pharmacogenomics datasets suggests the existence of subtle consistency among the different studies (i.e., strong cell-wise correlation). However, significant noises are also shown (i.e., weak drug-wise correlation) and have prevented researchers from comfortably using the data directly. Motivated by this observation, we propose a novel framework for addressing the inconsistency between large-scale pharmacogenomics data sets. Our method can significantly boost the drug-wise correlation and can be easily applied to re-summarized and normalized datasets proposed by others. We also investigate our algorithm based on many different criteria to demonstrate that the corrected datasets are not only consistent, but also biologically meaningful. Eventually, we propose to extend our main algorithm into a framework, so that in the future when more datasets become publicly available, our framework can hopefully offer a “ground-truth” guidance for references.
机译:不同研究产生的开放药物基因组学数据集的不一致性限制了此类数据集在许多任务(例如生物标记物发现)中的使用。对多个药物基因组学数据集的研究证实,不同研究(药物研究)之间的药物或行之间的成对敏感性数据相关性相对较低,而不同研究(细胞研究)之间的细胞系或列之间的成对敏感性数据相关性相对较低。明智地)。跨多个药物基因组学数据集的这一共同有趣观察表明,不同研究之间存在微妙的一致性(即强的细胞相关性)。但是,还会显示出很大的噪音(即药物之间的相关性较弱),从而使研究人员无法舒适地直接使用这些数据。基于这种观察,我们提出了一个新颖的框架来解决大规模药物基因组学数据集之间的矛盾。我们的方法可以显着增强药物方面的相关性,并且可以轻松地应用于其他人提出的重新汇总和归一化的数据集。我们还基于许多不同的标准研究了我们的算法,以证明校正后的数据集不仅一致,而且具有生物学意义。最终,我们建议将主要算法扩展到框架中,以便将来有更多数据集公开可用时,我们的框架有望为参考提供“真实的”指导。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号