首页> 外文会议>International Conference on Intelligent Systems and Computer Vision >Deep semi-supervised learning for DTI prediction using large datasets and H2O-spark platform
【24h】

Deep semi-supervised learning for DTI prediction using large datasets and H2O-spark platform

机译:使用大型数据集和H2O-Spark平台进行DTI预测的深度半监督学习

获取原文

摘要

Drug repositioning is the process of recycling existing drugs for new indications by identifying the potential drug-target interactions (DTIs). However, in silico predicting new associations between drugs and target proteins is a challenging issue, due to the scarcity of known DTIs and no experimentally true negative drug-target interaction sample. Furthermore, the volume of genomic sequences and chemical structures data is growing in an exponential manner, which consumes relatively too much time and effort. For these reasons, we propose a new computational method based on deep semi-supervised learning called DSSL-DTIs to accurately predict new DTI in post-genome era using large datasets and Spark-H2O platform. Firstly, we use the stacked autoencoders to convert high-dimensional features to low-dimensional representations. Then, we apply another unsupervised stacked autoencoders model for initializing the weights of a supervised deep neural network model. Comparing to other state-of-the-art methods applied all on the same reference dataset of Drug-Bank, it is found that our approach outperforms these techniques with an overall accuracy performance more than 98%. The DSSL-DTIs can be further used to predict large-scale new drug-target interactions. The highly ranked candidate DTIs obtained from DSSL-DTIs are also confirmed in the DrugBank database and in the literature, which demonstrates the effectiveness of our method.
机译:药物重新定位是通过鉴定潜在的药物 - 靶靶相互作用(DTI)来回收新适应症的现有药物的方法。然而,由于缺乏已知的DTI并且没有实验真正的阴性药物 - 靶相互作用样品,在预测药物和靶蛋白之间的新关联是一个具有挑战性的问题。此外,基因组序列和化学结构数据的体积以指数的方式生长,这消耗了相对太多的时间和努力。由于这些原因,我们提出了一种基于深度半监督学习的新的计算方法,称为DSSL-DTI,可以使用大型数据集和Spark-H2O平台准确地预测基因组时代的新DTI。首先,我们使用堆叠的autoencoders将高维功能转换为低维表示。然后,我们应用另一个无人监督的堆叠自动介质模型,用于初始化监督深度神经网络模型的权重。与其他最先进的方法相比,所有在药物 - 银行的相同参考数据集上应用,我们发现我们的方法优于这些技术,总精度性能超过98 %。 DSSL-DTI可以进一步用于预测大规模的新药物目标相互作用。在药物库数据库和文献中也确认了从DSSL-DTI获得的高度排名候选DTI,并在文献中证实了我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号