首页> 外文会议>Asia Pacific Bioinformatics Conference >Prediction of drug-disease associations based on ensemble meta paths and singular value decomposition
【24h】

Prediction of drug-disease associations based on ensemble meta paths and singular value decomposition

机译:基于集合元路径和奇异值分解的毒性疾病关联预测

获取原文

摘要

Background: In the field of drug repositioning, it is assumed that similar drugs may treat similar diseases, therefore many existing computational methods need to compute the similarities of drugs and diseases. However, the calculation of similarity depends on the adopted measure and the available features, which may lead that the similarity scores vary dramatically from one to another, and it will not work when facing the incomplete data. Besides, supervised learning based methods usually need bothpositive and negative samples to train the prediction models, whereas in drug-disease pairs data there are only some verified interactions (positive samples) and a lot of unlabeled pairs. To train the models, many methods simply treat the unlabeled samples as negative ones, which may introduce artificial noises. Herein, we propose a method to predict drug-disease associations without the need of similarity information, and select more likely negative samples.Results: In the proposed EMP-SVD (Ensemble Meta Paths and Singular Value Decomposition), we introduce five meta paths corresponding to different kinds of interaction data, and for each meta path we generate a commuting matrix. Every matrix is factorizedinto two low rank matrices by SVD which are used for the latent features of drugs and diseases respectively. The features are combined to represent drug-disease pairs. We build a base classifier via Random Forest for each meta path and five base classifiers are combined as the final ensemble classifier. In order to train out a more reliable prediction model, we select more likely negative ones from unlabeled samples under the assumption that non-associated drug and disease pair have no common interactedproteins. The experiments have shown that the proposed EMP-SVD method outperforms several state-of-the-art approaches. Case studies by literature investigation have found that the proposed EMP-SVD can mine out many drug-disease associations, which implies the practicality of EMP-SVD.Conclusions: The proposed EMP-SVD can integrate the interaction data among drugs, proteins and diseases, and predict the drug-disease associations without the need of similarity information. At the same time, the strategy of selecting more reliable negative samples will benefit the prediction.
机译:背景:在药物重新定位的领域中,假设类似的药物可能治疗类似的疾病,因此许多现有的计算方法需要计算药物和疾病的相似之处。然而,相似性的计算取决于所采用的措施和可用功能,这可能导致相似度得分从一个到另一个的相似性差异,并且在面对不完整的数据时它将无法工作。此外,受监管的基于学习的方法通常需要兼容性和负样本来训练预测模型,而在药物疾病对数据中只有一些验证的相互作用(正样品)和大量未标记的对。为了培训模型,许多方法只是将未标记的样本视为消极的样本,这可能引入人造噪声。在此,我们提出了一种方法来预测药物疾病关联,而不需要相似性信息,选择更可能的负样本。结果:在提议的EMP-SVD(集合元路径和奇异值分解)中,我们介绍了五个相应的元路径对于不同种类的交互数据,以及我们生成通勤矩阵的每个元路径。每个矩阵都是将SVD分解两种低等级矩阵,其分别用于药物和疾病的潜在特征。该特征组合以代表药物疾病对。我们通过随机林为每个元路径的随机林构建基本分类器,并将五个基本分类器组合为最终的集合分类器。为了培训更可靠的预测模型,我们在假设非相关药物和疾病对不具有常见的互动蛋白的假设下选择更有可能的负面样品。实验表明,所提出的EMP-SVD方法优于几种最先进的方法。文学调查的案例研究发现,拟议的EMP-SVD可以挖掘许多毒品疾病关联,这意味着EMP-SVD的实用性。结论:拟议的EMP-SVD可以整合药物,蛋白质和疾病之间的相互作用数据,并预测药物疾病关联而不需要相似信息。同时,选择更可靠的负样本的策略将受益于预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号