【24h】

Matching Biomedical Ontologies with Compact Evolutionary Algorithm

机译:匹配具有紧凑型进化算法的生物医学本体

获取原文

摘要

Literature Based Discovery (LBD) aims at building bridges between existing literatures and discovering new knowledge from them. Biomedical ontology is such a literature that provides an explicit specification on biomedical knowledge, i.e., the formal specification of the biomedical concepts and data, and the relationships between them. However, since biomedical ontologies are developed and maintained by different communities, the same biomedical information or knowledge could be defined with different terminologies or in different context, which makes the integration of them becomes a challenging problem. Biomedical ontology matching can determine the semantically identical biomedical concepts in different biomedical ontologies, which is regarded as an effective methodology to bridge the semantic gap between two biomedical ontologies. Currently, Evolutionary Algorithm (EA) is emerging as a good methodology for optimizing the ontology alignment. However, EA requires huge memory consumption and long runtime, which make EA-based matcher unable to efficiently match biomedical ontologies. To overcome these problems, in this paper, we define a discrete optimal model for biomedical ontology matching problem, and utilize a compact version of Evolutionary Algorithm (CEA) to solve it. In particular, CEA makes use of a Probability Vector (PV) to represent the population to save the memory consumption, and introduces a local search strategy to improve the algorithm's search performance. The experiment exploits Anatomy track, Large Biomed track and Disease and Phenotype track provided by the Ontology Alignment Evaluation Initiative (OAEI) to test our proposal's performance. The experimental results show that CEA-based approach can effectively reduce the runtime and memory consumption of EA-based matcher, and determine high-quality biomedical ontology alignments.
机译:文学基于的发现(LBD)旨在建立现有文献之间的桥梁,并从他们身上发现新知识。生物医学本体是一种关于生物医学知识的明确规范,即生物医学概念和数据的正式规范,以及它们之间的关系。然而,由于生物医学本体由不同的社区开发和维护,因此可以在不同的术语或不同的背景下定义相同的生物医学信息或知识,这使得它们的整合成为一个具有挑战性的问题。生物医学本体匹配可以确定不同生物医学本体中的语义相同的生物医学概念,被认为是弥合两种生物医学本体之间的语义差距的有效方法。目前,进化算法(EA)作为优化本体对齐的良好方法。但是,EA需要巨大的内存消耗和长期运行时,这使得基于EA的匹配不能有效地匹配生物医学本体。为了克服这些问题,在本文中,我们为生物医学本体匹配问题定义了一个离散的最佳模型,并利用了一种紧凑的进化算法(CEA)来解决它。特别地,CEA利用概率向量(PV)来表示节省存储器消耗的人群,并引入本地搜索策略以提高算法的搜索性能。该实验利用本体对准评估倡议(OAEI)提供的解剖轨道,大型生物曲线和疾病和表型轨​​道来测试我们的提案的表现。实验结果表明,基于CEA的方法可以有效地降低基于EA的运行时间和内存消耗,并确定高质量的生物医学本体对齐。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号