...
首页> 外文期刊>Sadhana >PSOM 2 —partitioning-based scalable ontology matching using MapReduce
【24h】

PSOM 2 —partitioning-based scalable ontology matching using MapReduce

机译:PSOM 2-使用MapReduce的基于分区的可扩展本体匹配

获取原文
   

获取外文期刊封面封底 >>

       

摘要

The growth and use of semantic web has led to a drastic increase in the size, heterogeneity and number of ontologies that are available on the web. Correspondingly, scalable ontology matching algorithms that will eliminate the heterogeneity among large ontologies have become a necessity. Ontology matching algorithms generally do not scale well due to the massive number of complex computations required to achieve matching. One of the methods used to address this problem is the use of partition-based systems to reduce thematching space. In this paper, we propose a new partitioning-based scalable ontology matching system called PSOM 2 . We have designed a new neighbour-based intra-similarity measure to increase the quality of the clusterset formation for the partition-based ontology matching process. These sets of clusters or sub-ontologies are matched across the input ontologies to identify matchable cluster pairs, based on anchors that are efficiently discovered through a new light-weight linguistic matcher (EI-sub). However, in order to further increase the efficiency of the time-consuming anchor discovery process we have designed a Map Reduce-based EI-sub process where anchors are discovered in distributed and parallel fashion. Experiments on benchmark OAEI(Ontology Alignment Evaluation Initiative) large scale ontologies demonstrate that the new PSOM 2 system achieves, on an average, 31% decrease in entropy of the clusters and 54.5% reduction in overall run time. Based on the experimental results, it is evident that the new PSOM 2 achieves better quality clusters and a major reduction in execution time, leading to an effective and scalable ontology matching system.
机译:语义Web的发展和使用已导致Web上可用本体的大小,异构性和数量急剧增加。相应地,将需要消除大本体之间的异质性的可扩展本体匹配算法。本体匹配算法通常由于实现匹配所需的大量复杂计算而不能很好地扩展。解决此问题的方法之一是使用基于分区的系统来减少匹配空间。在本文中,我们提出了一种新的基于分区的可扩展本体匹配系统,称为PSOM 2。我们设计了一种新的基于邻居的内部相似性度量,以提高基于分区的本体匹配过程的集群集形成质量。基于通过新的轻量级语言匹配器(EI-sub)有效发现的锚点,可在输入本体之间对这些群集或子本体集进行匹配,以标识可匹配的群集对。但是,为了进一步提高耗时的锚发现过程的效率,我们设计了基于Map Reduce的EI子过程,该过程以分布式和并行方式发现锚。在基准OAEI(本体比对评估计划)大规模本体上进行的实验表明,新的PSOM 2系统平均使簇的熵降低31%,总体运行时间降低54.5%。根据实验结果,很明显,新的PSOM 2实现了更好的质量簇并大大减少了执行时间,从而导致了有效且可扩展的本体匹配系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号