首页> 外文期刊>VINE >Interoperating ontologies of organizational memory through hybrid unsupervised data mining
【24h】

Interoperating ontologies of organizational memory through hybrid unsupervised data mining

机译:通过混合无监督数据挖掘实现组织内存的互操作性本体

获取原文
获取原文并翻译 | 示例
       

摘要

Purpose The purpose of this paper is to present an automated ontology mapping and mergingalgorithm, namely OntoDNA, which employs data mining techniques (FCA, SOM, K-means) to resolveontological heterogeneities among distributed data sources in organizational memory andsubsequently generate a merged ontology to facilitate resource retrieval from distributed resourcesfor organizational decision making. Design/methodology/approach – The OntoDNA employs unsupervised data mining techniques(FCA, SOM, K-means) to resolve ontological heterogeneities to integrate distributed data sources inorganizational memory. Unsupervised methods are needed as an alternative in the absence of priorknowledge for managing this knowledge. Given two ontologies that are to be merged as the input, theontologies' conceptual pattern is discovered using FCA. Then, string normalizations are applied totransform their attributes in the formal context prior to lexical similarity mapping. Mapping rules areapplied to reconcile the attributes. Subsequently, SOM and K-means are applied for semanticsimilarity mapping based on the conceptual pattern discovered in the formal context to reduce theproblem size of the SOM clusters as validated by the Davies-Bouldin index. The mapping rules arethen applied to discover semantic similarity between ontological concepts in the clusters and theontological concepts of the target ontology are updated to the source ontology based on the mergingrules. Merged ontology in a concept lattice is formed. Findings – In experimental comparisons between PROMPT and OntoDNA ontology mapping andmerging tool based on precision, recall and f-measure, average mapping results for OntoDNA is 95.97percent compared to PROMPT's 67.24 percent In tetras of recall, OntoDNA outperforms PROMPT on allthe paired ontology except for one paired ontology. For the merging of one paired ontology, PROMPTfails to identify the mapping elements. OntoDNA significantly outperforms PROMPT due to theutilization of FCA in the OntoDNA to capture attributes and the inherent structural relationships amongconcepts. Better performance in OntoDNA is due to the following reasons. First, semantic problems suchas synonymy and polysemy are resolved prior to contextual clustering. Second, unsupervised data miningtechniques (SOM and K-means) have reduced" problem size. Third, string matching performs better thanPROMPT's linguistic-similarity matching in addressing semantic heterogeneity, in context it alsocontributes to the OntoDNA results. String matching resolves concept names based on similarity betweenconcept names in each cluster for ontology mapping. Linguistic-similarity matching resolves conceptnames based on concept-representation structure and relations between concepts for ontology mapping. Originality/value – The OntoDNA automates ontology mapping and merging without the need ofany prior knowledge to generate a merged ontology. String matching is shown to perform better thanlinguistic-similarity matching in resolving concept names. The OntoDNA will be valuable fororganizations interested in merging ontologies from distributed or different organizational memories.For example, an organization might want to merge their organization-specific ontologies withcommunity standard ontologies.
机译:目的本文的目的是提出一种自动的本体映射和合并算法,即OntoDNA,它使用数据挖掘技术(FCA,SOM,K-means)来解决组织内存中分布式数据源之间的本体异质性,并随后生成合并的本体以方便从分布式资源中检索资源以进行组织决策。设计/方法/方法– OntoDNA采用无监督数据挖掘技术(FCA,SOM,K-means)来解决本体异质性,从而将分布式数据源集成到组织内存中。在没有先验知识来管理此知识的情况下,需要无监督方法作为替代。给定两个要合并的本体作为输入,使用FCA发现本体的概念模式。然后,在词汇相似度映射之前,将字符串规范化应用于形式上下文中的属性转换。应用映射规则以协调属性。随后,基于在正式语境中发现的概念模式,将SOM和K-means应用于语义相似性映射,以减少Davies-Bouldin索引验证的SOM集群的问题大小。然后将映射规则应用于发现集群中本体概念之间的语义相似性,并根据合并规则将目标本体的本体概念更新为源本体。形成了概念格中的合并本体。研究结果–在基于精度,召回率和f度量的PROMPT与OntoDNA本体映射和合并工具之间的实验比较中,OntoDNA的平均映射结果为95.97%,而PROMPT的平均映射结果为67.24%。一对配对的本体。对于合并一对本体,PROMPT无法识别映射元素。由于在OntoDNA中利用FCA捕获概念之间的属性和固有的结构关系,OntoDNA的性能明显优于PROMPT。由于以下原因,OntoDNA的性能更好。首先,在上下文聚类之前解决诸如同义词和多义性之类的语义问题。第二,无监督数据挖掘技术(SOM和K-means)减小了“问题”的大小。第三,字符串匹配在解决语义异质性方面比PROMPT的语言相似性匹配要好,在上下文中它也有助于OntoDNA结果。字符串匹配基于每个集群中用于本体映射的概念名称之间的相似性。语言相似度匹配基于概念表示结构和本体映射的概念之间的关系来解析概念名称。原创性/价值– OntoDNA自动进行本体映射和合并,而无需任何先验知识即可生成本体合并的本体。字符串匹配在解析概念名称方面表现出比语言相似性匹配更好的性能。OntoDNA对于有兴趣从分布式或不同组织内存中合并本体的组织非常有价值。例如,一个组织可能希望合并其特定于组织的本体无线社区标准本体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号