...
首页> 外文期刊>BMC Bioinformatics >A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
【24h】

A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration

机译:通过异构数据集成的药物重新定位的双层无监督聚类方法

获取原文
           

摘要

Drug repositioning is the process of identifying new uses for existing drugs. Computational drug repositioning methods can reduce the time, costs and risks of drug development by automating the analysis of the relationships in pharmacology networks. Pharmacology networks are large and heterogeneous. Clustering drugs into small groups can simplify large pharmacology networks, these subgroups can also be used as a starting point for repositioning drugs. In this paper, we propose a two-tiered drug-centric unsupervised clustering approach for drug repositioning, integrating heterogeneous drug data profiles: drug-chemical, drug-disease, drug-gene, drug-protein and drug-side effect relationships. The proposed drug repositioning approach is threefold; (i) clustering drugs based on their homogeneous profiles using the Growing Self Organizing Map (GSOM); (ii) clustering drugs based on drug-drug relation matrices based on the previous step, considering three state-of-the-art graph clustering methods; and (iii) inferring drug repositioning candidates and assigning a confidence value for each identified candidate. In this paper, we compare our two-tiered clustering approach against two existing heterogeneous data integration approaches with reference to the Anatomical Therapeutic Chemical (ATC) classification, using GSOM. Our approach yields Normalized Mutual Information (NMI) and Standardized Mutual Information (SMI) of 0.66 and 36.11, respectively, while the two existing methods yield NMI of 0.60 and 0.64 and SMI of 22.26 and 33.59. Moreover, the two existing approaches failed to produce useful cluster separations when using graph clustering algorithms while our approach is able to identify useful clusters for drug repositioning. Furthermore, we provide clinical evidence for four predicted results (Chlorthalidone, Indomethacin, Metformin and Thioridazine) to support that our proposed approach can be reliably used to infer ATC code and drug repositioning. The proposed two-tiered unsupervised clustering approach is suitable for drug clustering and enables heterogeneous data integration. It also enables identifying reliable repositioning drug candidates with reference to ATC therapeutic classification. The repositioning drug candidates identified consistently by multiple clustering algorithms and with high confidence have a higher possibility of being effective repositioning candidates.
机译:药物重新定位是识别现有药物的新用途的过程。计算药物重新定位方法可以通过自动分析药理学网络中的关系来减少药物开发的时间,成本和风险。药理学网络是大而异质的。将药物分为小组可以简化大型药理学网络,这些亚组也可以用作重新定位药物的起点。在本文中,我们提出了一种用于药物重新定位的双层药物无监督的聚类方法,整合异质药物数据谱:药物化学,毒性疾病,药物基因,药物 - 蛋​​白和药物副作用关系。所提出的药物排雷方法是三倍; (i)基于使用越来越多的自组织地图(GSOM)基于其均匀谱的聚类药物; (ii)基于药物 - 药物关系基于前一步的聚类药物,考虑到三个最先进的图形聚类方法; (iii)推断药物重新定位候选人并为每个已识别的候选人分配置信价值。在本文中,我们使用GSOM参考解剖治疗化学(ATC)分类来比较我们的双层聚类方法对两个现有的异构数据集成方法。我们的方法分别产生标准化的互信息(NMI)和标准化互信息(SMI)分别为0.66和36.11,而两种现有方法产生0.60和0.64和22.26和33.59的NMI。此外,当我们的方法能够识别用于药物重新定位的有用群集时,两种现有方法未能产生有用的集群分离。此外,我们提供四种预测结果(Chlorthaldone,Indomethacin,Metformin和Thiroidazine)的临床证据,以支持我们所提出的方法可以可靠地用于推断ATC代码和药物重新定位。提出的双层无监督的聚类方法适用于药物聚类,并实现异构数据集成。它还可以参考ATC治疗分类来识别可靠的重新定位药物候选者。通过多聚类算法一致地识别的重新定位药物候选者和高信心具有更高的有效重新定位候选者的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号