首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Iteratively collective prediction of disease-gene associations through the incomplete network
【24h】

Iteratively collective prediction of disease-gene associations through the incomplete network

机译:通过不完整的网络迭代地集体预测疾病-基因的关联

获取原文

摘要

The prediction of links between genes and disease is still one of the biggest challenges in the field of human health. Almost all state-of-the-art studies on the prediction of gene-disease links focuson a single pair of links, ignoring the associations and interactions among different types of links. Moreover, the biological information networks are usually incomplete. In this paper, we study the similarity measure to be used on two different types of nodes, based on the metapaths between them (Wsrm). Then an iterative self-updating approach for link prediction using heterogeneous information network is proposed to fit the incompletion of the network (ISL), which is a semi-supervised learning formula. Using the biological integrated network constructed from OMIM and HumanNet dataset (30,896 nodes and 1,200,166 edges) we applied our framework. The area under the receiver operating characteristic is 0.941, indicating that our approach significantly outperforms the state-of-the-art gene-disease link prediction approaches. Moreover, the sensitivity analysis signifies that our approach is robust. Consequently, our proposed framework demonstrates an efficient and accurate approach for link prediction between genes and diseases. In addition, during iteration, the accuracy of the result gradually increases. The example dataset and the implementation of our approach is avaliable at https://github.com/xymeng16/ISL.
机译:基因与疾病之间联系的预测仍然是人类健康领域最大的挑战之一。几乎所有关于基因疾病链接预测的最新研究都集中在一对链接上,而忽略了不同类型链接之间的关联和相互作用。而且,生物信息网络通常是不完整的。在本文中,我们基于两个节点之间的元路径(Wsrm),研究了要在两种不同类型的节点上使用的相似性度量。然后提出了一种基于异构信息网络的迭代自更新链路预测方法,以适应网络(ISL)的不完善性,这是一种半监督的学习公式。使用从OMIM和HumanNet数据集(30,896个节点和1,200,166个边缘)构建的生物集成网络,我们应用了我们的框架。接收器工作特性下的面积为0.941,这表明我们的方法大大优于最新的基因-疾病链接预测方法。此外,敏感性分析表明我们的方法是可靠的。因此,我们提出的框架展示了一种有效且准确的方法来预测基因与疾病之间的联系。另外,在迭代过程中,结果的准确性逐渐提高。示例数据集和我们方法的实现可从https://github.com/xymeng16/ISL获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号