Interoperating ontologies of organizational memory through hybrid unsupervised data mining

Ching-Chieh Kiu; Chien-Sing Lee

首页> 外文期刊>VINE >Interoperating ontologies of organizational memory through hybrid unsupervised data mining

【24h】

Interoperating ontologies of organizational memory through hybrid unsupervised data mining

机译：通过混合无监督数据挖掘实现组织内存的互操作性本体

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Purpose The purpose of this paper is to present an automated ontology mapping and mergingalgorithm, namely OntoDNA, which employs data mining techniques (FCA, SOM, K-means) to resolveontological heterogeneities among distributed data sources in organizational memory andsubsequently generate a merged ontology to facilitate resource retrieval from distributed resourcesfor organizational decision making. Design/methodology/approach – The OntoDNA employs unsupervised data mining techniques(FCA, SOM, K-means) to resolve ontological heterogeneities to integrate distributed data sources inorganizational memory. Unsupervised methods are needed as an alternative in the absence of priorknowledge for managing this knowledge. Given two ontologies that are to be merged as the input, theontologies' conceptual pattern is discovered using FCA. Then, string normalizations are applied totransform their attributes in the formal context prior to lexical similarity mapping. Mapping rules areapplied to reconcile the attributes. Subsequently, SOM and K-means are applied for semanticsimilarity mapping based on the conceptual pattern discovered in the formal context to reduce theproblem size of the SOM clusters as validated by the Davies-Bouldin index. The mapping rules arethen applied to discover semantic similarity between ontological concepts in the clusters and theontological concepts of the target ontology are updated to the source ontology based on the mergingrules. Merged ontology in a concept lattice is formed. Findings – In experimental comparisons between PROMPT and OntoDNA ontology mapping andmerging tool based on precision, recall and f-measure, average mapping results for OntoDNA is 95.97percent compared to PROMPT's 67.24 percent In tetras of recall, OntoDNA outperforms PROMPT on allthe paired ontology except for one paired ontology. For the merging of one paired ontology, PROMPTfails to identify the mapping elements. OntoDNA significantly outperforms PROMPT due to theutilization of FCA in the OntoDNA to capture attributes and the inherent structural relationships amongconcepts. Better performance in OntoDNA is due to the following reasons. First, semantic problems suchas synonymy and polysemy are resolved prior to contextual clustering. Second, unsupervised data miningtechniques (SOM and K-means) have reduced" problem size. Third, string matching performs better thanPROMPT's linguistic-similarity matching in addressing semantic heterogeneity, in context it alsocontributes to the OntoDNA results. String matching resolves concept names based on similarity betweenconcept names in each cluster for ontology mapping. Linguistic-similarity matching resolves conceptnames based on concept-representation structure and relations between concepts for ontology mapping. Originality/value – The OntoDNA automates ontology mapping and merging without the need ofany prior knowledge to generate a merged ontology. String matching is shown to perform better thanlinguistic-similarity matching in resolving concept names. The OntoDNA will be valuable fororganizations interested in merging ontologies from distributed or different organizational memories.For example, an organization might want to merge their organization-specific ontologies withcommunity standard ontologies.

机译：目的本文的目的是提出一种自动的本体映射和合并算法，即OntoDNA，它使用数据挖掘技术（FCA，SOM，K-means）来解决组织内存中分布式数据源之间的本体异质性，并随后生成合并的本体以方便从分布式资源中检索资源以进行组织决策。设计/方法/方法– OntoDNA采用无监督数据挖掘技术（FCA，SOM，K-means）来解决本体异质性，从而将分布式数据源集成到组织内存中。在没有先验知识来管理此知识的情况下，需要无监督方法作为替代。给定两个要合并的本体作为输入，使用FCA发现本体的概念模式。然后，在词汇相似度映射之前，将字符串规范化应用于形式上下文中的属性转换。应用映射规则以协调属性。随后，基于在正式语境中发现的概念模式，将SOM和K-means应用于语义相似性映射，以减少Davies-Bouldin索引验证的SOM集群的问题大小。然后将映射规则应用于发现集群中本体概念之间的语义相似性，并根据合并规则将目标本体的本体概念更新为源本体。形成了概念格中的合并本体。研究结果–在基于精度，召回率和f度量的PROMPT与OntoDNA本体映射和合并工具之间的实验比较中，OntoDNA的平均映射结果为95.97％，而PROMPT的平均映射结果为67.24％。一对配对的本体。对于合并一对本体，PROMPT无法识别映射元素。由于在OntoDNA中利用FCA捕获概念之间的属性和固有的结构关系，OntoDNA的性能明显优于PROMPT。由于以下原因，OntoDNA的性能更好。首先，在上下文聚类之前解决诸如同义词和多义性之类的语义问题。第二，无监督数据挖掘技术（SOM和K-means）减小了“问题”的大小。第三，字符串匹配在解决语义异质性方面比PROMPT的语言相似性匹配要好，在上下文中它也有助于OntoDNA结果。字符串匹配基于每个集群中用于本体映射的概念名称之间的相似性。语言相似度匹配基于概念表示结构和本体映射的概念之间的关系来解析概念名称。原创性/价值– OntoDNA自动进行本体映射和合并，而无需任何先验知识即可生成本体合并的本体。字符串匹配在解析概念名称方面表现出比语言相似性匹配更好的性能。OntoDNA对于有兴趣从分布式或不同组织内存中合并本体的组织非常有价值。例如，一个组织可能希望合并其特定于组织的本体无线社区标准本体。

著录项

来源
《VINE》 |2009年第4期|共23页
作者
Ching-Chieh Kiu; Chien-Sing Lee;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类图书馆自动化、网络化;
关键词
Storage management; Knowledge management; Data handling; Merging;

机译：存储管理;知识管理;数据处理;合并;
入库时间 2022-08-18 20:07:20

相似文献

外文文献
中文文献
专利

1. Interoperating ontologies of organizational memory through hybrid unsupervised data mining [J] . Ching-Chieh Kiu, Chien-Sing Lee VINE . 2009,第4期

机译：通过混合无监督数据挖掘实现组织内存的互操作性本体
2. Transforming Open Data to Linked Open Data Using Ontologies for Information Organization in Big Data Environments of the Brazilian Government: the Brazilian Database Government Open Linked Data - DBgoldbr [J] . Victorino Marcio, de Holanda Maristela Terto, Ishikawa Edison, Knowledge Organization . 2018,第6期

机译：在巴西政府的大数据环境中使用用于信息组织的本体将开放数据转换为链接的开放数据：巴西数据库政府开放的链接数据-DBgoldbr
3. A hybrid approach of neural network and memory-based learning to data mining [J] . Chung-Kwan Shin, Ui Tak Yun IEEE Transactions on Neural Networks . 2000,第3期

机译：神经网络和基于记忆的学习的混合方法用于数据挖掘
4. Learning Objects Reusability and Retrieval through Ontological Sharing: A Hybrid Unsupervised Data Mining Approach [C] . Kiu, Ching-Chieh, Lee, . 2007

机译：通过本体共享学习对象的可重用性和检索：混合无监督数据挖掘方法
5. A Scalable Physics-based Data Modeling Framework to Unsupervised High-Dimensional Data Mining. [D] . Huang, Hao. 2014

机译：可扩展的基于物理的数据建模框架，可实现无监督的高维数据挖掘。
6. Construction of protein phosphorylation networks by data mining text mining and ontology integration: analysis of the spindle checkpoint [O] . Karen E. Ross, Cecilia N. Arighi, Jia Ren, 2013

机译：通过数据挖掘文本挖掘和本体集成构建蛋白质磷酸化网络：纺锤体检查点的分析
7. Mining for Lexons: Applying Unsupervised Learning Methods to Create Ontology Bases [O] . Marie-laure Reinberger, Peter Spyns, Walter Daelemans, 2003

机译：Lexon的挖掘：应用无监督学习方法创建本体基础

Interoperating ontologies of organizational memory through hybrid unsupervised data mining

摘要

著录项

相似文献

相关主题

期刊订阅