首页> 外文会议>IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology >Drug target interaction predictions using PU- Leaming under different experimental setting for four formulations namely known drug target pair prediction, drug prediction, target prediction and unknown drug target pair prediction
【24h】

Drug target interaction predictions using PU- Leaming under different experimental setting for four formulations namely known drug target pair prediction, drug prediction, target prediction and unknown drug target pair prediction

机译:使用PU-Leaming的药物靶靶相互作用预测四种制剂的不同实验设置,即已知的药物目标对预测,药物预测,靶预测和未知药物目标对预测

获取原文
获取外文期刊封面目录资料

摘要

Predicting new drug target interactions experimentally through wet lab experiments is time as well as resource intensive. In general, drug-target interaction prediction problem leads to drug discovery, drug repositioning and uncovers interesting patterns in chemogenomics research. Drug and target represent heterogeneous nodes within a network of interactions. Presence of an edge between the nodes indicates a positive interaction whereas an absence suggests an unknown interaction. Classification based machine learning algorithms are heavily applied in this area of research. Classification algorithms need positive as well as negative data to yield optimized results. The major problem in this field is lack of negative data because the data that are found in the public databases are positive interaction samples. Considering unknown drug target pairs as negative data may cause severe consequences for the prediction performance. Thereby, we propose a positive un-labelled (PU) learning- based approach that uses one class support vector machine technique as the learning algorithm. The algorithm learns the positive distribution from the unified feature vector space of drugs and targets and regards unknown pairs as unlabeled instead of labelling them as negative pairs. Additionally, we use 4860 Klekota Roth fingerprint + 881 PubChem fingerprint as a high dimensional and highly discriminative feature vector representation for drugs. To represent protein features, we create a protein-motif matrix based on the sliding window score that records the probability of a motif pattern occurring within a given protein sequence. Also, we separately evaluate the prediction performance using 5-fold nested cross- validation under different experimental setting for each of the four formulations: 1) Known drug-target pair,2) Drug prediction, 3) Target prediction and 4) Unknown drug target pair. We show that our approach yields the best AUC score over previous benchmark techniques and outperforms most of the recent works based on one class classifiers and PU-based learning.
机译:通过湿实验室实验实验预测新的药物目标相互作用是时间和资源密集。通常,药物 - 目标相互作用预测问题导致药物发现,药物排雷和揭示化学素研究中的有趣模式。药物和目标代表相互作用网络内的异质节点。节点之间的边缘存在表示阳性相互作用,而缺失表明未知的交互。基于分类的机器学习算法在该研究领域大量应用。分类算法需要积极的以及负数据来产生优化的结果。此字段中的主要问题缺少否定数据,因为公共数据库中发现的数据是正互动样本。考虑未知的药物目标对作为负数据可能导致预测性能的严重后果。因此,我们提出了一种积极的未标记(PU)基于学习的方法,它使用一个类支持向量机技术作为学习算法。该算法从药物和目标的统一特征向量空间中得出积极分布,并将未知对视为未标记,而不是将它们标记为负对对。此外,我们使用4860 Klekota Roth指纹+ 881 Pubchem指纹作为药物的高维和高度辨别特征载体表示。为了代表蛋白质特征,我们基于滑动窗口评分来创建蛋白质基序矩阵,其记录在给定蛋白质序列内发生的基序图案的概率。此外,我们在四种配方中的每一个的不同实验环境下单独评估预测性能:1)已知药物 - 目标对,2)药物预测,3)靶预测和4)未知药物靶标一对。我们表明,我们的方法产生了以前基准技术的最佳AUC分数,并且优于基于一个类分类器和基于PU的学习的最新作品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号