首页> 外文会议>International conference on data integration in the life sciences >Mining Linked Open Data: A Case Study with Genes Responsible for Intellectual Disability
【24h】

Mining Linked Open Data: A Case Study with Genes Responsible for Intellectual Disability

机译:关联开放数据的挖掘:以负责智力障碍的基因为例的研究

获取原文

摘要

Linked Open Data (LOD) constitute a unique dataset that is in a standard format, partially integrated, and facilitates connections with domain knowledge represented within semantic web ontologies. Increasing amounts of biomedical data provided as LOD consequently offer novel opportunities for knowledge discovery in biomedicine. However, most data mining methods are neither adapted to LOD format, nor adapted to consider domain knowledge. We propose in this paper an approach for selecting, integrating, and mining LOD with the goal of discovering genes responsible for a disease. The selection step relies on a set of choices made by a domain expert to isolate relevant pieces of LOD. Because these pieces are potentially not linked, an integration step is required to connect unlinked pieces. The resulting graph is subsequently mined using Inductive Logic Programming (ILP) that presents two main advantages. First, the input format compliant with ILP is close to the format of LOD. Second, domain knowledge can be added to this input and considered by ILP. We have implemented and applied this approach to the characterization of genes responsible for intellectual disability. On the basis of this real-world use case, we present an evaluation of our mining approach and discuss its advantages and drawbacks for the mining of biomedical LOD.
机译:链接开放数据(LOD)构成了标准格式的,部分集成的唯一数据集,并促进了与语义Web本体中表示的领域知识的连接。因此,以LOD形式提供的越来越多的生物医学数据为生物医学中的知识发现提供了新的机会。但是,大多数数据挖掘方法既不适合LOD格式,也不适合考虑领域知识。我们在本文中提出了一种选择,整合和挖掘LOD的方法,目的是发现引起疾病的基因。选择步骤取决于领域专家做出的一系列选择,以隔离相关的LOD。由于这些零件可能未链接,因此需要集成步骤来连接未链接的零件。随后使用归纳逻辑编程(ILP)挖掘得到的图形,该图形具有两个主要优点。首先,符合ILP的输入格式接近LOD的格式。其次,可以将领域知识添加到此输入中,并由ILP考虑。我们已经实现了这种方法,并将其应用于表征智力障碍的基因。在此实际应用案例的基础上,我们对我们的采矿方法进行了评估,并讨论了其在生物医学LOD采矿中的优缺点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号